Try our new research platform with insights from 80,000+ expert users

Fivetran vs Pentaho Data Integration and Analytics comparison

 

Comparison Buyer's Guide

Executive SummaryUpdated on Dec 19, 2024

Review summaries and opinions

We asked business professionals to review the solutions they use. Here are some excerpts of what they said:
 

Categories and Ranking

Fivetran
Ranking in Data Integration
13th
Average Rating
8.0
Reviews Sentiment
6.9
Number of Reviews
25
Ranking in other categories
Data Replication (3rd), Cloud Data Integration (7th)
Pentaho Data Integration an...
Ranking in Data Integration
24th
Average Rating
8.0
Reviews Sentiment
6.9
Number of Reviews
53
Ranking in other categories
No ranking in other categories
 

Mindshare comparison

As of January 2025, in the Data Integration category, the mindshare of Fivetran is 2.3%, up from 1.9% compared to the previous year. The mindshare of Pentaho Data Integration and Analytics is 1.5%, up from 0.6% compared to the previous year. It is calculated based on PeerSpot user engagement data.
Data Integration
 

Featured Reviews

Erik Jones - PeerSpot reviewer
Solution reduces time-to-value; high ROI
Fivetran has room for improvement in data pipeline observability. The Fivetran logs are fairly basic, compared to, for example, the insight Fivetran gives into helping users understanding the performance of data pipelines. So I think their observability into the pipeline itself could be improved. In addition, Fivetran is in the very early stages of allowing other companies to access its metadata API, but that's something that could use improvement, and I know that they're working on right now. We use a separate tool for "reverse ETL", which is the opposite of what Fivetran does; it pushes data from your data warehouse back out to business applications. If Fivetran pulls data from those same applications, they should also enable users to push it back. I would love to do both ETL and reverse ETL in the same tool. It would be nice if Fivetran offered both their regular offering plus the reverse ETL option as well.
Ryan Ferdon - PeerSpot reviewer
Low-code makes development faster than with Python, but there were caching issues
If you're working with a larger data set, I'm not so sure it would be the best solution. The larger things got the slower it was. It was kind of buggy sometimes. And when we ran the flow, it didn't go from a perceived start to end, node by node. Everything kicked off at once. That meant there were times when it would get ahead of itself and a job would fail. That was not because the job was wrong, but because Pentaho decided to go at everything at once, and something would process before it was supposed to. There were nodes you could add to make sure that, before this node kicks off, all these others have processed, but it was a bit tedious. There were also caching issues, and we had to write code to clear the cache every time we opened the program, because the cache would fill up and it wouldn't run. I don't know how hard that would be for them to fix, or if it was fixed in version 10. Also, the UI is a bit outdated, but I'm more of a fan of function over how something looks. One other thing that would have helped with Pentaho was documentation and support on the internet: how to do things, how to set up. I think there are some sites on how to install it, and Pentaho does have a help repository, but it wasn't always the most useful.

Quotes from Members

We asked business professionals to review the solutions they use. Here are some excerpts of what they said:
 

Pros

"The product is very easy to use and very easy to configure."
"The product has some seamless connectors, which are readily available."
"Fivetran's most valuable feature is replication."
"SysTrack load is the best feature."
"Fivetran offers native connectivity with Sybase"
"Fivetran is remarkably easy to use; I haven't encountered any other data integration tool that is as user-friendly."
"The simplicity and scalability are the strongest features of Fivetran."
"The compare feature is the most valuable piece of it."
"We also haven't had to create any custom Java code. Almost everywhere it's SQL, so it's done in the pipeline and the configuration. That means you can offload the work to people who, while they are not less experienced, are less technical when it comes to logic."
"The solution has a free to use community version."
"This solution allows us to create pipelines using a minimal amount of custom coding."
"Its drag-and-drop interface lets me and my team implement all the solutions that we need in our company very quickly. It's a very good tool for that."
"It's very simple compared to other products out there."
"Sometimes, it took a whole team about two weeks to get all the data to prepare and present it. After the optimization of the data, it took about one to two hours to do the whole process. Therefore, it has helped a lot when you talk about money, because it doesn't take a whole team to do it, just one person to do one project at a time and run it when you want to run it. So, it has helped a lot on that side."
"Provides a good open source option."
"One of the most valuable features is the ability to create many API integrations. I'm always working with advertising agents and using Facebook and Instagram to do campaigns. We use Pentaho to get the results from these campaigns and to create dashboards to analyze the results."
 

Cons

"I would like to see an improvement in the support offered by Fivetran."
"An in-line data quality checking capability is missing"
"I would like for them to incorporate additional transformations. A valuable aspect of the product is that it does inflight transformations and that could be expanded."
"We use a separate tool for "reverse ETL", which is the opposite of what Fivetran does; it pushes data from your data warehouse back out to business applications. If Fivetran pulls data from those same applications, they should also enable users to push it back. I would love to do both ETL and reverse ETL in the same tool."
"The documentation can be laid out better to make it easier to find things, and I really wish there was built-in support for changing passwords. Some features don't work as advertised for the platform/repository database, and HVR is not always the fastest at getting results."
"The environment must be more development-friendly."
"The customization could improve because Fivetran gives more thought to people who don't want to manage analytics workflows rather than engineers who want to be able to customize pipelines more thoroughly."
"The interface needs to be more user-friendly."
"Should provide additional control for the data warehouse"
"The performance could be improved. If they could have analytics perform well on large volumes, that would be a big deal for our products."
"Parallel execution could be better in Pentaho. It's very simple but I don't think it works well."
"The web interface is rusty, and the biggest problem with Pentaho is debugging and troubleshooting. It isn't easy to build the pipeline incrementally. At least in our case, it's hard to find a way to execute step by step in the debugging mode."
"I work with different databases. I would like to work with more connectors to new databases, e.g., DynamoDB and MariaDB, and new cloud solutions, e.g., AWS, Azure, and GCP. If they had these connectors, that would be great. They could improve by building new connectors. If you have native connections to different databases, then you can make instructions more efficient and in a more natural way. You don't have to write any scripts to use that connector."
"The reporting definitely needs improvement. There are a lot of general, basic features that it doesn't have. A simple feature you would expect a reporting tool to have is the ability to search the repository for a report. It doesn't even have that capability. That's been a feature that we've been asking for since the beginning and it hasn't been implemented yet."
"It could be better integrated with programming languages, like Python and R. Right now, if I want to run a Python code on one of my ETLs, it is a bit difficult to do. It would be great if we have some modules where we could code directly in a Python language. We don't really have a way to run Python code natively."
"I have been facing some difficulties when working with large datasets. It seems that when there is a large amount of data, I experience memory errors."
 

Pricing and Cost Advice

"Fivetran is very expensive, and its database-driven pricing model is outdated."
"I've heard that the license for HVR is a bit costly compared to its competitors, but since it's reliable and efficient, I think the customer shouldn't be bothered about the cost."
"The solution is affordable."
"The licensing costs are extremely high for the usage of somebody who has one GB or two GB of usage per day for real-time traffic. There are many other players in the market which are similarly priced or competitively priced. On average per month, it used to come around 12,000-15,000 USD, which is very high."
"I don't have the exact information, but I know it is high, and it is on a yearly basis. There is no additional cost for what we're doing. We're always open to doing things cheaper, so we might potentially implement a different solution."
"The product is reasonably expensive"
"I would say they're a little bit on the expensive side, and their contract process is not particularly good, but there is a lot of potential flexibility."
"I rate the pricing a six out of ten."
"I mostly used the open-source version. I didn't work with a license."
"You don't need the Enterprise Edition, you can go with the Community Edition. That way you can use it for free and, for free, it's a pretty good tool to use."
"I think Lumada's price is fair compared to some of the others, like BusinessObjects, which is was the other thing that I used at my previous job. BusinessObject's price was more reasonable before SAP acquired it. They jacked the price up significantly. Oracle's OBIEE tool was also prohibitively expensive."
"I use it because it is free. I download from their page for free. I don't have to pay for a license. With other tools, I have to pay for the licenses. That is why I use Pentaho."
"The cost of these types of solutions are expensive. So, we really appreciate what we get for our money. Though, we don't think of the solution as a top-of-the-line solution or anything like that."
"I primarily work on the Community Version, which is available to use free of charge."
"There was a cost analysis done and Pentaho did favorably in terms of cost."
"It does seem a bit expensive compared to the serverless product offering. Tools, such as Server Integration Services, are "almost" free with a database engine. It is comparable to products like Alteryx, which is also very expensive."
report
Use our free recommendation engine to learn which Data Integration solutions are best for your needs.
831,265 professionals have used our research since 2012.
 

Top Industries

By visitors reading reviews
Educational Organization
30%
Computer Software Company
12%
Financial Services Firm
10%
Manufacturing Company
7%
Financial Services Firm
22%
Computer Software Company
14%
Government
7%
Comms Service Provider
5%
 

Company Size

By reviewers
Large Enterprise
Midsize Enterprise
Small Business
 

Questions from the Community

What's the deal with the HVR software acquisition?
As a user of HVR Software I followed this deal closely. Fivetran is apparently trying to establish more in its sector and by buying an already established data replication software, they become som...
Does HVR Software provide reliable insights?
I honestly can't think of another data replication software that can give you better statistics and insight than HVR Software. There's the feature for topology and statistics and both of them can ...
How much traffic can HVR Software handle?
As someone who works at a company where a high volume of information is replicated and has tried several data replication softwares, I can tell you that you're looking at the right one. HVR Softwar...
Which ETL tool would you recommend to populate data from OLTP to OLAP?
Hi Rajneesh, yes here is the feature comparison between the community and enterprise edition : https://www.hitachivantara.com/en-us/pdf/brochure/leverage-open-source-benefits-with-assurance-of-hita...
What do you think can be improved with Hitachi Lumada Data Integrations?
In my opinion, the reporting side of this tool needs serious improvements. In my previous company, we worked with Hitachi Lumada Data Integration and while it does a good job for what it’s worth, ...
What do you use Hitachi Lumada Data Integrations for most frequently?
My company has used this product to transform data from databases, CSV files, and flat files. It really does a good job. We were most satisfied with the results in terms of how many people could us...
 

Also Known As

No data available
Hitachi Lumada Data Integration, Kettle, Pentaho Data Integration
 

Overview

 

Sample Customers

DocuSign, Oldcastle Infrastructure, Crossmedia, Talkdesk, Chubbies, Brandwatch
66Controls, Providential Revenue Agency of Ro Negro, NOAA Information Systems, Swiss Real Estate Institute
Find out what your peers are saying about Fivetran vs. Pentaho Data Integration and Analytics and other solutions. Updated: January 2025.
831,265 professionals have used our research since 2012.