Try our new research platform with insights from 80,000+ expert users

Fivetran vs StreamSets comparison

 

Comparison Buyer's Guide

Executive SummaryUpdated on Dec 19, 2024

Review summaries and opinions

We asked business professionals to review the solutions they use. Here are some excerpts of what they said:
 

Categories and Ranking

Fivetran
Ranking in Data Integration
13th
Average Rating
8.0
Reviews Sentiment
6.9
Number of Reviews
25
Ranking in other categories
Data Replication (3rd), Cloud Data Integration (7th)
StreamSets
Ranking in Data Integration
10th
Average Rating
8.4
Reviews Sentiment
7.1
Number of Reviews
20
Ranking in other categories
No ranking in other categories
 

Mindshare comparison

As of February 2025, in the Data Integration category, the mindshare of Fivetran is 2.2%, up from 1.9% compared to the previous year. The mindshare of StreamSets is 1.6%, up from 1.2% compared to the previous year. It is calculated based on PeerSpot user engagement data.
Data Integration
 

Featured Reviews

Erik Jones - PeerSpot reviewer
Solution reduces time-to-value; high ROI
Fivetran has room for improvement in data pipeline observability. The Fivetran logs are fairly basic, compared to, for example, the insight Fivetran gives into helping users understanding the performance of data pipelines. So I think their observability into the pipeline itself could be improved. In addition, Fivetran is in the very early stages of allowing other companies to access its metadata API, but that's something that could use improvement, and I know that they're working on right now. We use a separate tool for "reverse ETL", which is the opposite of what Fivetran does; it pushes data from your data warehouse back out to business applications. If Fivetran pulls data from those same applications, they should also enable users to push it back. I would love to do both ETL and reverse ETL in the same tool. It would be nice if Fivetran offered both their regular offering plus the reverse ETL option as well.
Reyansh Kumar - PeerSpot reviewer
We no longer need to hire highly skilled data engineers to create and monitor data pipelines
The things I like about StreamSets are its * overall user interface * efficiency * product features, which are all good. Also, the scheduling within the data engineering pipeline is very much appreciated, and it has a wide range of connectors for connecting to any data sources like SQL Server, AWS, Azure, etc. We have used it with Kafka, Hadoop, and Azure Data Factory Datasets. Connecting to these systems with StreamSets is very easy. You just need to configure the data sources, the paths and their configurations, and you are ready to go. It is very efficient and very easy to use for ETL pipelines. It is a GUI-based interface in which you can easily create or design your own data pipelines with just a few clicks. As for moving data into modern analytics systems, we are using it with Microsoft Power BI, AWS, and some on-premises solutions, and it is very easy to get data from StreamSets into them. No hardcore coding or special technical expertise is required. It is also a no-code platform in which you can configure your data sources and data output for easy configuration of your data pipeline. This is a very important aspect because if a tool requires code development, we need to hire software developers to get the task done. By using StreamSets, it can be done with a few clicks.

Quotes from Members

We asked business professionals to review the solutions they use. Here are some excerpts of what they said:
 

Pros

"The most valuable feature of Fivetran is that it only synchronizes what needs to be synchronized."
"The user interface is mostly UI-based, eliminating the need for a lot of coding."
"The ease of setting up the connectors and transformations is highly valuable."
"The simplicity and scalability are the strongest features of Fivetran."
"Making the decision to implement Fivetran was supported by the fact that they have better connectors than other competitors."
"The product is very easy to use and very easy to configure."
"The compare feature is the most valuable piece of it."
"There's the general feature of the platform where it just makes it very easy to integrate different things, but I would say a specific difference is their integration of DBT,."
"The ability to have a good bifurcation rate and fewer mistakes is valuable."
"The best feature that I really like is the integration."
"The entire user interface is very simple and the simplicity of creating pipelines is something that I like very much about it. The design experience is very smooth."
"I have used Data Collector, Transformer, and Control Hub products from StreamSets. What I really like about these products is that they're very user-friendly. People who are not from a technological or core development background find it easy to get started and build data pipelines and connect to the databases. They would be comfortable like any technical person within a couple of weeks."
"The most valuable feature is the pipelines because they enable us to pull in and push out data from different sources and to manipulate and clean things up within them."
"The Ease of configuration for pipes is amazing. It has a lot of connectors. Mainly, we can do everything with the data in the pipe. I really like the graphical interface too"
"What I love the most is that StreamSets is very light. It's a containerized application. It's easy to use with Docker. If you are a large organization, it's very easy to use Kubernetes."
"StreamSets data drift feature gives us an alert upfront so we know that the data can be ingested. Whatever the schema or data type changes, it lands automatically into the data lake without any intervention from us, but then that information is crucial to fix for downstream pipelines, which process the data into models, like Tableau and Power BI models. This is actually very useful for us. We are already seeing benefits. Our pipelines used to break when there were data drift changes, then we needed to spend about a week fixing it. Right now, we are saving one to two weeks. Though, it depends on the complexity of the pipeline, we are definitely seeing a lot of time being saved."
 

Cons

"There is not a lot of options to customize the pipeline, and the cost is high."
"We experience cost issues because Fivetran is charged on a usage basis. When you reach a certain level, the tool should focus on reducing the costs. The solution is expensive when you are moving gigabytes and petabytes of data. It should also focus more on REST APIs and webhooks."
"The documentation is decent, but it's hard to find information online about Fivetran. For example, if you try to search for an error code, you won't find much information about it in forums."
"Some of the pain points we're looking at are trying to integrate some of the items in the Microsoft stack, so SharePoint and Excel, and then some of the newer Azure services."
"The documentation can be laid out better to make it easier to find things, and I really wish there was built-in support for changing passwords. Some features don't work as advertised for the platform/repository database, and HVR is not always the fastest at getting results."
"It should have a few more monitoring functionalities."
"This solution needs to improve its real-time data and transformation availability."
"We use a separate tool for "reverse ETL", which is the opposite of what Fivetran does; it pushes data from your data warehouse back out to business applications. If Fivetran pulls data from those same applications, they should also enable users to push it back. I would love to do both ETL and reverse ETL in the same tool."
"The logging mechanism could be improved. If I am working on a pipeline, then create a job out of it and it is running, it will generate constant logs. So, the logging mechanism could be simplified. Now, it is a bit difficult to understand and filter the logs. It takes some time."
"StreamSets should provide a mechanism to be able to perform data quality assessment when the data is being moved from one source to the target."
"The execution engine could be improved. When I was at their session, they were using some obscure platform to run. There is a controller, which controls what happens on that, but you should be able to easily do this at any of the cloud services, such as Google Cloud. You shouldn't have any issues in terms of how to run it with their online development platform or design platform, basically their execution engine. There are issues with that."
"We've seen a couple of cases where it appears to have a memory leak or a similar problem."
"If you use JDBC Lookup, for example, it generally takes a long time to process data."
"There aren't enough hands-on labs, and debugging is also an issue because it takes a lot of time. Logs are not that clear when you are debugging, and you can only select a single source for a pipeline."
"The documentation is inadequate and has room for improvement because the technical support does not regularly update their documentation or the knowledge base."
"In terms of the product, I don't think there is any room for improvement because it is very good. One small area of improvement that is very much needed is on the knowledge base side. Sometimes, it is not very clear how to set up a certain process or a certain node for a person who's using the platform for the first time."
 

Pricing and Cost Advice

"In the first year, we were given a very good discount. It was approximately 20,000 Euros per year. In the third year, we purchased credit for two years and the price was 33,000 Euros per year."
"When you have a lot of workflows and complex use cases, pricing goes down as you use it more."
"I don't have the exact information, but I know it is high, and it is on a yearly basis. There is no additional cost for what we're doing. We're always open to doing things cheaper, so we might potentially implement a different solution."
"The pricing model is okay and mid to large companies will not have an issue with it."
"I can't give exact amounts because that's based on usage, but it's more expensive than some of its competitors."
"I rate the pricing a six out of ten."
"I would say they're a little bit on the expensive side, and their contract process is not particularly good, but there is a lot of potential flexibility."
"The solution is affordable."
"We use the free version. It's great for a public, free release. Our stance is that the paid support model is too expensive to get into. They should honestly reevaluate that."
"Its pricing is pretty much up to the mark. For smaller enterprises, it could be a big price to pay at the initial stage of operations, but the moment you have the Seed B or Seed C funding and you want to scale up your operations and aren't much worried about the funds, at that point in time, you would need a solution that could be scaled."
"There are two editions, Professional and Enterprise, and there is a free trial. We're using the Professional edition and it is competitively priced."
"The pricing is too fixed. It should be based on how much data you need to process. Some businesses are not so big that they process a lot of data."
"StreamSets is an expensive solution."
"It has a CPU core-based licensing, which works for us and is quite good."
"It's not so favorable for small companies."
"StreamSets Data Collector is open source. One can utilize the StreamSets Data Collector, but the Control Hub is the main repository where all the jobs are present. Everything happens in Control Hub."
report
Use our free recommendation engine to learn which Data Integration solutions are best for your needs.
838,713 professionals have used our research since 2012.
 

Top Industries

By visitors reading reviews
Educational Organization
30%
Computer Software Company
12%
Financial Services Firm
10%
Manufacturing Company
7%
Financial Services Firm
17%
Computer Software Company
11%
Manufacturing Company
10%
Insurance Company
7%
 

Company Size

By reviewers
Large Enterprise
Midsize Enterprise
Small Business
 

Questions from the Community

What's the deal with the HVR software acquisition?
As a user of HVR Software I followed this deal closely. Fivetran is apparently trying to establish more in its sector and by buying an already established data replication software, they become som...
Does HVR Software provide reliable insights?
I honestly can't think of another data replication software that can give you better statistics and insight than HVR Software. There's the feature for topology and statistics and both of them can ...
How much traffic can HVR Software handle?
As someone who works at a company where a high volume of information is replicated and has tried several data replication softwares, I can tell you that you're looking at the right one. HVR Softwar...
What do you like most about StreamSets?
The best thing about StreamSets is its plugins, which are very useful and work well with almost every data source. It's also easy to use, especially if you're comfortable with SQL. You can customiz...
What needs improvement with StreamSets?
We often faced problems, especially with SAP ERP. We struggled because many columns weren't integers or primary keys, which StreamSets couldn't handle. We had to restructure our data tables, which ...
What is your primary use case for StreamSets?
StreamSets is used for data transformation rather than ETL processes. It focuses on transforming data directly from sources without handling the extraction part of the process. The transformed data...
 

Comparisons

 

Overview

 

Sample Customers

Autodesk, Condé Nast, JetBlue, Morgan Stanley, OpenAI, LVMH, Pfizer, Verizon, SpotifyNational Australia Bank, Saks, Cemex, Okta, Dropbox, Pitney Bowes, World Fuel Services,Lufthansa, AutoZone, ASICS, ASOS, Coupa, Databricks, Hermes, New Relic, Intercom,Canva, Honeywell, Square, DocuSign, Nandos, Oldcastle Infrastructure
Availity, BT Group, Humana, Deluxe, GSK, RingCentral, IBM, Shell, SamTrans, State of Ohio, TalentFulfilled, TechBridge
Find out what your peers are saying about Fivetran vs. StreamSets and other solutions. Updated: February 2025.
838,713 professionals have used our research since 2012.