What is your primary use case for StreamSets?

StreamSets is a data integration platform that enables organizations to efficiently move and process data across various systems. It offers a user-friendly interface for designing, deploying, and managing data pipelines, allowing users to easily connect to various data sources and destinations. StreamSets also provides real-time monitoring and alerting capabilities, ensuring that data is flowing smoothly and any issues are quickly addressed.

Download StreamSets Report Read more

Related Q&As

Dec 10, 2021

How does Matillion ETL compare to StreamSets?

Apr 10, 2024

What is your experience regarding pricing and costs for StreamSets?

score 0 · Answer 1 · 2025-04-02T03:18:00Z

SS

SrinivasanSankar

Enterprise Solutions Architect at a energy/utilities company with 1,001-5,000 employees

Real User

Top 20

Apr 2, 2025

We are using StreamSets for batch loading.

score 0 · Answer 2 · 2024-04-10T16:56:24Z

StreamSets is used for data transformation rather than ETL processes. It focuses on transforming data directly from sources without handling the extraction part of the process. The transformed data is loaded into Amazon Redshift or other data warehousing solutions.

score 0 · Answer 3 · 2023-07-21T08:45:00Z

reviewer2238417

Director Data Engineering, Governance, Operation and Analytics Platform at a financial services firm with 10,001+ employees

Real User

Top 20

Jul 21, 2023

We are using StreamSets to migrate our on-premise data to the cloud.

Saket Pandey Product Manager at a hospitality company with 51-200 employees · Answer 4 · 2023-05-17T11:24:00Z

We were receiving data from hospitals or any kind of healthcare service providers in the country. We were dominantly operating in the US. When we received that data, we had to classify it into different repositories or different datasets. This data was sent to different vendors, and for that, the data needed to get processed in different ways. We needed to bifurcate data at many steps with different kinds of filters. For that, we used StreamSets.

Namanya Brian CEO-founder at Tubayo · Answer 5 · 2023-04-14T09:32:00Z

We use it for building a data lake in our content. We have sales multiple times during the day, and a sale is the trigger. Sales use the lake as a landing zone. We also use it for various types of data transformation.

Nantabo Jackie Sales Manager at Soft Hostings Limited · Answer 6 · 2023-03-24T12:46:00Z

I use StreamSets to develop data feeds for different balance streams, I use it to control options for scheduling my data plane, and for internal version control.

Kevin Kathiem Mutunga Chief software engineer at Appnomu Business Services · Answer 7 · 2023-03-24T12:32:00Z

In our department, we use StreamSets to design data pipelines that load all data from various RD and VMS sources to the cloud, such as Azure. We also use the data set for data analysts to generate panels for our organization, as well as for real-time use cases for monitoring and consuming other streaming data. Additionally, we are able to customize StreamSets to suit our needs and budget.

Reyansh Kumar Technical Specialist at Accenture · Answer 8 · 2023-03-10T04:20:00Z

Our company builds products mainly for healthcare divisions and we use StreamSets for all our data engineering tasks.

Ramesh Kuppuswamy Senior Software Developer at Samsung · Answer 9 · 2023-01-06T23:33:00Z

The main use case of StreamSets is to work on data integration and ingesting data for DataOps and modern analytics. We also use it for integrating data files from multiple sources. We use it to build, monitor, and manage smart, continuous data pipelines.

Sumesh Gansar Product Marketing Manager at Samsung · Answer 10 · 2023-01-06T22:56:00Z

My primary use case with StreamSets is to integrate large data sets from multiple sources into a destination. We also use it as a platform to ingest data and deliver data for database analytics.

reviewer2067186 Product Marketer at a media company with 1,001-5,000 employees · Answer 11 · 2023-01-06T22:40:00Z

Our major use case with StreamSets is to build data pipelines from multiple sources to multiple destinations. We mainly use the StreamSets Data Collector Engine for seamless streaming from any source to any destination. We also use it to deliver continuous data for database operations and modern analytics.

score 0 · Answer 12 · 2022-12-01T21:40:00Z

reviewer2041068

Senior Network Administrator at a energy/utilities company with 201-500 employees

Real User

Dec 1, 2022

We use the whole Data Collector application.

Karthik Rajamani Principal Engineer at Tata Consultancy Services · Answer 13 · 2022-06-14T17:08:00Z

I worked mostly on data injection use cases when I was using Data Collector. Later on, I got involved with some Spark-based transformations using Transformer. Currently, we are not using CI/CD. We are not using automated deployments. We are manually deploying in prod, but going forward, we are planning to use CI/CD to have automated deployments. I worked on on-prem and cloud deployments. The current implementation is on-prem, but in my previous project, we worked on AWS-based implementation. We did a small PoC with GCP as well.

score 0 · Answer 14 · 2022-06-09T15:40:00Z

SS

SrinivasanSankar

Enterprise Solutions Architect at a energy/utilities company with 1,001-5,000 employees

Real User

Top 20

Jun 9, 2022

We are using the StreamSets DataOps platform to ingest data to a data lake.

AbhishekKatara Technical Lead at Sopra Steria · Answer 15 · 2022-05-15T09:42:00Z

StreamSets is a wonderful data engineering, data ops tool where we can design and create data pipelines, loading on-prem data to the cloud. One of our major projects was to move data from on-premises to Azure and GCP Cloud. From there, once data is loaded, the data scientist and data analyst teams use that data to generate patterns and insights. For a US healthcare service provider company, we designed a StreamSets pipeline to connect to relational database sources. We did generate schema from the source data loaded into Azure Data Lake Storage (ADLS) or any cloud, like S3 or GCP. This was one of our batch use cases. With StreamSets, we have also tried to solve our real-time streaming use cases as well, where we were streaming data from source Kafka topic to Azure Event Hubs. This was a trigger-based streaming pipeline, which moved data when it appeared in a Kafka topic. Since this pipeline was a streaming pipeline, it was continuously streaming data from Kafka to Azure for further analysis.

score 0 · Answer 16 · 2020-11-19T21:01:53Z

We typically use it to transport our Oracle raw datasets up to Microsoft Azure, and then into SQL databases there.

score 0 · Answer 17 · 2018-08-08T07:09:00Z

It performs very well. The main use is to extract information from some of our Kafka topics and put it in our internal systems, flat files, and integration with Java.