Try our new research platform with insights from 80,000+ expert users

Azure Data Factory vs Pentaho Data Integration and Analytics comparison

 

Comparison Buyer's Guide

Executive SummaryUpdated on Dec 19, 2024

Review summaries and opinions

We asked business professionals to review the solutions they use. Here are some excerpts of what they said:
 

ROI

Sentiment score
7.1
Azure Data Factory automates data processes, cuts costs, and improves efficiency, offering potential ROI of 20%-30% over five years.
Sentiment score
7.9
Pentaho offers cost-effective integration, reducing ETL time, lowering expenses, and enhancing competitiveness with open-source flexibility and efficiency.
 

Customer Service

Sentiment score
6.5
Azure Data Factory support is responsive and helpful, though response times and efficiency vary based on issue complexity and support package.
Sentiment score
5.2
Users rely on community support over customer service due to mixed experiences, despite responsive technical support and Hitachi's involvement.
The technical support is responsive and helpful
The technical support from Microsoft is rated an eight out of ten.
Communication with the vendor is challenging
 

Scalability Issues

Sentiment score
7.5
Azure Data Factory is scalable and flexible for enterprises, though some concerns about costs and external integration exist.
Sentiment score
7.3
Pentaho excels in scalability and efficient data handling but faces challenges with exceptionally large data and complex growth scenarios.
Azure Data Factory is highly scalable.
Pentaho Data Integration handles larger datasets better.
 

Stability Issues

Sentiment score
7.8
Azure Data Factory is reliable, scoring 8-9/10, though some report occasional glitches or slowdowns under heavy loads.
Sentiment score
7.1
Pentaho Data Integration offers reliability for small to midsize operations but may lag and freeze with complex uses.
The solution has a high level of stability, roughly a nine out of ten.
It's pretty stable, however, it struggles when dealing with smaller amounts of data.
 

Room For Improvement

Azure Data Factory needs better integration, simplified UI, improved connectivity, enhanced support, clearer pricing, and performance scalability for large data.
Pentaho needs improvements in big data performance, error handling, UI, scheduling, backward compatibility, cloud integration, and Python support.
Sometimes, the compute fails to process data if there is a heavy load suddenly, and it doesn't scale up automatically.
Incorporating more dedicated API sources to specific services like HubSpot CRM or Salesforce would be beneficial.
Pentaho Data Integration is very friendly, it is not very useful when there isn't a lot of data to handle.
 

Setup Cost

Azure Data Factory has complex pricing, with costs varying widely due to usage, data volume, and additional services.
Pentaho offers a cost-effective solution with its free Community Edition and affordable subscription-based Enterprise Edition for varying needs.
The pricing is cost-effective.
It is considered cost-effective.
 

Valuable Features

Azure Data Factory provides scalable data transformation, integration, and orchestration with user-friendly interface and extensive connector support.
Pentaho provides an intuitive, open-source platform for efficient ETL development and data integration with minimal coding and broad compatibility.
It connects to different sources out-of-the-box, making integration much easier.
The interface of Azure Data Factory is very usable with a more interactive visual experience, making it easier for people who are not as experienced in coding to work with.
It's easy to use and friendly, especially for larger data sets.
 

Categories and Ranking

Azure Data Factory
Ranking in Data Integration
1st
Average Rating
8.0
Reviews Sentiment
6.9
Number of Reviews
89
Ranking in other categories
Cloud Data Warehouse (3rd)
Pentaho Data Integration an...
Ranking in Data Integration
24th
Average Rating
8.0
Reviews Sentiment
6.9
Number of Reviews
53
Ranking in other categories
No ranking in other categories
 

Mindshare comparison

As of January 2025, in the Data Integration category, the mindshare of Azure Data Factory is 10.8%, down from 13.4% compared to the previous year. The mindshare of Pentaho Data Integration and Analytics is 1.5%, up from 0.6% compared to the previous year. It is calculated based on PeerSpot user engagement data.
Data Integration
 

Featured Reviews

Joy Maitra - PeerSpot reviewer
Facilitates seamless data pipeline creation with good analytics and and thorough monitoring
Azure Data Factory is a low code, no code platform, which is helpful. It provides many prebuilt functionalities that assist in building data pipelines. Also, it facilitates easy transformation with all required functionalities for analytics. Furthermore, it connects to different sources out-of-the-box, making integration much easier. The monitoring is very thorough, though a more readable version would be appreciable.
Ryan Ferdon - PeerSpot reviewer
Low-code makes development faster than with Python, but there were caching issues
If you're working with a larger data set, I'm not so sure it would be the best solution. The larger things got the slower it was. It was kind of buggy sometimes. And when we ran the flow, it didn't go from a perceived start to end, node by node. Everything kicked off at once. That meant there were times when it would get ahead of itself and a job would fail. That was not because the job was wrong, but because Pentaho decided to go at everything at once, and something would process before it was supposed to. There were nodes you could add to make sure that, before this node kicks off, all these others have processed, but it was a bit tedious. There were also caching issues, and we had to write code to clear the cache every time we opened the program, because the cache would fill up and it wouldn't run. I don't know how hard that would be for them to fix, or if it was fixed in version 10. Also, the UI is a bit outdated, but I'm more of a fan of function over how something looks. One other thing that would have helped with Pentaho was documentation and support on the internet: how to do things, how to set up. I think there are some sites on how to install it, and Pentaho does have a help repository, but it wasn't always the most useful.
report
Use our free recommendation engine to learn which Data Integration solutions are best for your needs.
831,265 professionals have used our research since 2012.
 

Top Industries

By visitors reading reviews
Financial Services Firm
13%
Computer Software Company
12%
Manufacturing Company
9%
Healthcare Company
7%
Financial Services Firm
22%
Computer Software Company
14%
Government
7%
Comms Service Provider
5%
 

Company Size

By reviewers
Large Enterprise
Midsize Enterprise
Small Business
 

Questions from the Community

How do you select the right cloud ETL tool?
AWS Glue and Azure Data factory for ELT best performance cloud services.
How does Azure Data Factory compare with Informatica PowerCenter?
Azure Data Factory is flexible, modular, and works well. In terms of cost, it is not too pricey. It offers the stability and reliability I am looking for, good scalability, and is easy to set up an...
How does Azure Data Factory compare with Informatica Cloud Data Integration?
Azure Data Factory is a solid product offering many transformation functions; It has pre-load and post-load transformations, allowing users to apply transformations either in code by using Power Q...
Which ETL tool would you recommend to populate data from OLTP to OLAP?
Hi Rajneesh, yes here is the feature comparison between the community and enterprise edition : https://www.hitachivantara.com/en-us/pdf/brochure/leverage-open-source-benefits-with-assurance-of-hita...
What do you think can be improved with Hitachi Lumada Data Integrations?
In my opinion, the reporting side of this tool needs serious improvements. In my previous company, we worked with Hitachi Lumada Data Integration and while it does a good job for what it’s worth, ...
What do you use Hitachi Lumada Data Integrations for most frequently?
My company has used this product to transform data from databases, CSV files, and flat files. It really does a good job. We were most satisfied with the results in terms of how many people could us...
 

Also Known As

No data available
Hitachi Lumada Data Integration, Kettle, Pentaho Data Integration
 

Overview

 

Sample Customers

1. Adobe 2. BMW 3. Coca-Cola 4. General Electric 5. Johnson & Johnson 6. LinkedIn 7. Mastercard 8. Nestle 9. Pfizer 10. Samsung 11. Siemens 12. Toyota 13. Unilever 14. Verizon 15. Walmart 16. Accenture 17. American Express 18. AT&T 19. Bank of America 20. Cisco 21. Deloitte 22. ExxonMobil 23. Ford 24. General Motors 25. IBM 26. JPMorgan Chase 27. Microsoft (Azure Data Factory is developed by Microsoft) 28. Oracle 29. Procter & Gamble 30. Salesforce 31. Shell 32. Visa
66Controls, Providential Revenue Agency of Ro Negro, NOAA Information Systems, Swiss Real Estate Institute
Find out what your peers are saying about Azure Data Factory vs. Pentaho Data Integration and Analytics and other solutions. Updated: January 2025.
831,265 professionals have used our research since 2012.