Try our new research platform with insights from 80,000+ expert users

Fivetran vs StreamSets comparison

 

Comparison Buyer's Guide

Executive Summary
 

Categories and Ranking

Fivetran
Ranking in Data Integration
13th
Average Rating
8.0
Reviews Sentiment
7.0
Number of Reviews
25
Ranking in other categories
Data Replication (3rd), Cloud Data Integration (6th)
StreamSets
Ranking in Data Integration
9th
Average Rating
8.4
Reviews Sentiment
7.5
Number of Reviews
24
Ranking in other categories
No ranking in other categories
 

Mindshare comparison

As of November 2024, in the Data Integration category, the mindshare of Fivetran is 2.2%, up from 2.0% compared to the previous year. The mindshare of StreamSets is 1.7%, up from 1.3% compared to the previous year. It is calculated based on PeerSpot user engagement data.
Data Integration
 

Featured Reviews

Erik Jones - PeerSpot reviewer
Solution reduces time-to-value; high ROI
Fivetran has room for improvement in data pipeline observability. The Fivetran logs are fairly basic, compared to, for example, the insight Fivetran gives into helping users understanding the performance of data pipelines. So I think their observability into the pipeline itself could be improved. In addition, Fivetran is in the very early stages of allowing other companies to access its metadata API, but that's something that could use improvement, and I know that they're working on right now. We use a separate tool for "reverse ETL", which is the opposite of what Fivetran does; it pushes data from your data warehouse back out to business applications. If Fivetran pulls data from those same applications, they should also enable users to push it back. I would love to do both ETL and reverse ETL in the same tool. It would be nice if Fivetran offered both their regular offering plus the reverse ETL option as well.
Reyansh Kumar - PeerSpot reviewer
We no longer need to hire highly skilled data engineers to create and monitor data pipelines
The things I like about StreamSets are its * overall user interface * efficiency * product features, which are all good. Also, the scheduling within the data engineering pipeline is very much appreciated, and it has a wide range of connectors for connecting to any data sources like SQL Server, AWS, Azure, etc. We have used it with Kafka, Hadoop, and Azure Data Factory Datasets. Connecting to these systems with StreamSets is very easy. You just need to configure the data sources, the paths and their configurations, and you are ready to go. It is very efficient and very easy to use for ETL pipelines. It is a GUI-based interface in which you can easily create or design your own data pipelines with just a few clicks. As for moving data into modern analytics systems, we are using it with Microsoft Power BI, AWS, and some on-premises solutions, and it is very easy to get data from StreamSets into them. No hardcore coding or special technical expertise is required. It is also a no-code platform in which you can configure your data sources and data output for easy configuration of your data pipeline. This is a very important aspect because if a tool requires code development, we need to hire software developers to get the task done. By using StreamSets, it can be done with a few clicks.

Quotes from Members

We asked business professionals to review the solutions they use. Here are some excerpts of what they said:
 

Pros

"SysTrack load is the best feature."
"Fivetran can perform data migration incredibly fast, depending on the source and target."
"The portal is very intuitive and easy to use."
"The solution is stable. We've never faced any stability issues."
"The product is very easy to use and very easy to configure."
"The product has some seamless connectors, which are readily available."
"It is not like a traditional ETL, but it gives quite a lot of flexibility."
"Its arrays are powerful enough to handle migrations even when the replication is happening in the background, without causing any trouble with the ongoing traffic."
"The ability to have a good bifurcation rate and fewer mistakes is valuable."
"It's very easy to integrate. It integrates with Snowflake, AWS, Google Cloud, and Azure. It's very helpful for DevOps, DataOps, and data engineering because it provides a comprehensive solution, and it's not complicated."
"One of the things I like is the data pipelines. They have a very good design. Implementing pipelines is very straightforward. It doesn't require any technical skill."
"The most valuable feature is the pipelines because they enable us to pull in and push out data from different sources and to manipulate and clean things up within them."
"I have used Data Collector, Transformer, and Control Hub products from StreamSets. What I really like about these products is that they're very user-friendly. People who are not from a technological or core development background find it easy to get started and build data pipelines and connect to the databases. They would be comfortable like any technical person within a couple of weeks."
"The Ease of configuration for pipes is amazing. It has a lot of connectors. Mainly, we can do everything with the data in the pipe. I really like the graphical interface too"
"It is really easy to set up and the interface is easy to use."
"StreamSets Transformer is a good feature because it helps you when you are developing applications and when you don't want to write a lot of code. That is the best feature overall."
 

Cons

"This solution needs to improve its real-time data and transformation availability."
"The documentation can be laid out better to make it easier to find things, and I really wish there was built-in support for changing passwords. Some features don't work as advertised for the platform/repository database, and HVR is not always the fastest at getting results."
"There is not a lot of options to customize the pipeline, and the cost is high."
"It should have a few more monitoring functionalities."
"The documentation is decent, but it's hard to find information online about Fivetran. For example, if you try to search for an error code, you won't find much information about it in forums."
"There was a random change to our contract in a unilateral manner after the first year. The overall cost of using Fivetran was then unclear and this is the reason I would not recommend this solution."
"An in-line data quality checking capability is missing"
"The connections with SAP must be improved."
"They need to improve their customer care services. Sometimes it has taken more than 48 hours to resolve an issue. That should be reduced. They are aware of small or generic issues, but not the more technical or deep issues. For those, they require some time, generally 48 to 72 hours to respond. That should be improved."
"The execution engine could be improved. When I was at their session, they were using some obscure platform to run. There is a controller, which controls what happens on that, but you should be able to easily do this at any of the cloud services, such as Google Cloud. You shouldn't have any issues in terms of how to run it with their online development platform or design platform, basically their execution engine. There are issues with that."
"Sometimes, it is not clear at first how to set up nodes. A site with an explanation of how each node works would be very helpful."
"Currently, we can only use the query to read data from SAP HANA. What we would like to see, as soon as possible, is the ability to read from multiple tables from SAP HANA. That would be a really good thing that we could use immediately. For example, if you have 100 tables in SQL Server or Oracle, then you could just point it to the schema or the 100 tables and ingestion information. However, you can't do that in SAP HANA since StreamSets currently is lacking in this. They do not have a multi-table feature for SAP HANA. Therefore, a multi-table origin for SAP HANA would be helpful."
"The software is very good overall. Areas for improvement are the error logging and the version history. I would like to see better, more detailed error logging information."
"One thing that I would like to add is the ability to manually enter data. The way the solution currently works is we don't have the option to manually change the data at any point in time. Being able to do that will allow us to do everything that we want to do with our data. Sometimes, we need to manually manipulate the data to make it more accurate in case our prior bifurcation filters are not good. If we have the option to manually enter the data or make the exact iterations on the data set, that would be a good thing."
"The data collector in StreamSets has to be designed properly. For example, a simple database configuration with MySQL DB requires the MySQL Connector to be installed."
"Using ETL pipelines is a bit complicated and requires some technical aid."
 

Pricing and Cost Advice

"I've heard that the license for HVR is a bit costly compared to its competitors, but since it's reliable and efficient, I think the customer shouldn't be bothered about the cost."
"Fivetran is very expensive, and its database-driven pricing model is outdated."
"When you have a lot of workflows and complex use cases, pricing goes down as you use it more."
"In the first year, we were given a very good discount. It was approximately 20,000 Euros per year. In the third year, we purchased credit for two years and the price was 33,000 Euros per year."
"The product is reasonably expensive"
"I can't give exact amounts because that's based on usage, but it's more expensive than some of its competitors."
"I rate the pricing a six out of ten."
"I don't have the exact information, but I know it is high, and it is on a yearly basis. There is no additional cost for what we're doing. We're always open to doing things cheaper, so we might potentially implement a different solution."
"We are running the community version right now, which can be used free of charge."
"There are different versions of the product. One is the corporate license version, and the other one is the open-source or free version. I have been using the corporate license version, but they have recently launched a new open-source version so that anybody can create an account and use it. The licensing cost varies from customer to customer. I don't have a lot of input on that. It is taken care of by PMO, and they seem fine with its pricing model. It is being used enterprise-wide. They seem to have got a good deal for StreamSets."
"Its pricing is pretty much up to the mark. For smaller enterprises, it could be a big price to pay at the initial stage of operations, but the moment you have the Seed B or Seed C funding and you want to scale up your operations and aren't much worried about the funds, at that point in time, you would need a solution that could be scaled."
"StreamSets is an expensive solution."
"We use the free version. It's great for a public, free release. Our stance is that the paid support model is too expensive to get into. They should honestly reevaluate that."
"The pricing is affordable for any business."
"It's not so favorable for small companies."
"StreamSets Data Collector is open source. One can utilize the StreamSets Data Collector, but the Control Hub is the main repository where all the jobs are present. Everything happens in Control Hub."
report
Use our free recommendation engine to learn which Data Integration solutions are best for your needs.
816,406 professionals have used our research since 2012.
 

Top Industries

By visitors reading reviews
Educational Organization
28%
Computer Software Company
12%
Financial Services Firm
10%
Manufacturing Company
8%
Financial Services Firm
17%
Computer Software Company
13%
Manufacturing Company
8%
Insurance Company
6%
 

Company Size

By reviewers
Large Enterprise
Midsize Enterprise
Small Business
 

Questions from the Community

What's the deal with the HVR software acquisition?
As a user of HVR Software I followed this deal closely. Fivetran is apparently trying to establish more in its sector and by buying an already established data replication software, they become som...
Does HVR Software provide reliable insights?
I honestly can't think of another data replication software that can give you better statistics and insight than HVR Software. There's the feature for topology and statistics and both of them can ...
How much traffic can HVR Software handle?
As someone who works at a company where a high volume of information is replicated and has tried several data replication softwares, I can tell you that you're looking at the right one. HVR Softwar...
What do you like most about StreamSets?
The best thing about StreamSets is its plugins, which are very useful and work well with almost every data source. It's also easy to use, especially if you're comfortable with SQL. You can customiz...
What needs improvement with StreamSets?
We often faced problems, especially with SAP ERP. We struggled because many columns weren't integers or primary keys, which StreamSets couldn't handle. We had to restructure our data tables, which ...
What is your primary use case for StreamSets?
StreamSets is used for data transformation rather than ETL processes. It focuses on transforming data directly from sources without handling the extraction part of the process. The transformed data...
 

Comparisons

 

Learn More

Video not available
 

Overview

 

Sample Customers

DocuSign, Oldcastle Infrastructure, Crossmedia, Talkdesk, Chubbies, Brandwatch
Availity, BT Group, Humana, Deluxe, GSK, RingCentral, IBM, Shell, SamTrans, State of Ohio, TalentFulfilled, TechBridge
Find out what your peers are saying about Fivetran vs. StreamSets and other solutions. Updated: November 2024.
816,406 professionals have used our research since 2012.