Try our new research platform with insights from 80,000+ expert users
Pentaho Data Integration and Analytics Logo

Pentaho Data Integration and Analytics pros and cons

4.0 out of 5
1,177 followers
Post review

Pros & Cons summary

Buyer's Guide

Get pricing advice, tips, use cases and valuable features from real users of this product.
Get the report

Prominent pros & cons

PROS

Pentaho Data Integration is praised for its ease of use, especially in transforming data.
It offers a valuable feature called metadata injection, reducing the need for additional ETLs.
The tool supports flexible deployment across various environments, which is crucial for secure setups.
Users appreciate the ability to leverage Pentaho for effective data integration and reporting from multiple data sources.
Its use of open-source components allows for easy integration with other scripts, enhancing automation capabilities.

CONS

Users experience integration difficulties with Azure and seek improvements for seamless cloud deployments.
Feedback highlights a need for native connectors to new databases and cloud platforms like DynamoDB and MariaDB.
There are performance issues with large datasets, causing memory errors and slow processing.
Users express a desire for better support and integration with programming languages like Python and R.
Quality and stability during major releases are a concern, with reports of breaking basic functionalities.
 

Pentaho Data Integration and Analytics Pros review quotes

PR
Dec 13, 2021
The graphical nature of the development interface is most useful because we've got people with quite mixed skills in the team. We've got some very junior, apprentice-level people, and we've got support analysts who don't have an IT background. It allows us to have quite complicated data flows and embed logic in them. Rather than having to troll through lines and lines of code and try and work out what it's doing, you get a visual representation, which makes it quite easy for people with mixed skills to support and maintain the product. That's one side of it.
DP
Dec 14, 2021
Lumada has allowed us to interact with our employees more effectively and compensate them properly. One of the cool things is that we use it to generate commissions for our salespeople and bonuses for our warehouse people. It allows us to get information out to them in a timely fashion. We can also see where they're at and how they're doing.
Ryan Ferdon - PeerSpot reviewer
Mar 24, 2022
The fact that it's a low-code solution is valuable. It's good for more junior people who may not be as experienced with programming.
Learn what your peers think about Pentaho Data Integration and Analytics. Get advice and tips from experienced pros sharing their opinions. Updated: December 2024.
824,067 professionals have used our research since 2012.
reviewer1751571 - PeerSpot reviewer
Dec 22, 2021
The fact that it enables us to leverage metadata to automate data pipeline templates and reuse them is definitely one of the features that we like the best. The metadata injection is helpful because it reduces the need to create and maintain additional ETLs. If we didn't have that feature, we would have lots of duplicated ETLs that we would have to create and maintain. The data pipeline templates have definitely been helpful when looking at productivity and costs.
Dale Bloom - PeerSpot reviewer
Jan 20, 2022
I absolutely love Hitachi. I'm one of the forefront supporters of Hitachi for my firm. It's so easy to integrate within our environments. In terms of being able to quickly build ETL jobs, transform, and then automate them, it's really easy to integrate throughout for data analytics.
ES
Sep 4, 2022
We also haven't had to create any custom Java code. Almost everywhere it's SQL, so it's done in the pipeline and the configuration. That means you can offload the work to people who, while they are not less experienced, are less technical when it comes to logic.
VK
May 25, 2022
We use Lumada’s ability to develop and deploy data pipeline templates once and reuse them. This is very important. When the entire pipeline is automated, we do not have any issues in respect to deployment of code or with code working in one environment but not working in another environment. We have saved a lot of time and effort from that perspective because it is easy to build ETL pipelines.
José Orlando Maia - PeerSpot reviewer
Apr 11, 2022
The area where Lumada has helped us is in the commercial area. There are many extractions to compose reports about our sales team performance and production steps. Since we are using Lumada to gather data from each industry in each country. We can get data from Argentina, Chile, Brazil, and Colombia at the same time. We can then concentrate and consolidate it in only one place, like our data warehouse. This improves our production performance and need for information about the industry, production data, and commercial data.
SK
Jan 6, 2022
Flexible deployment, in any environment, is very important to us. That is the key reason why we ended up with these tools. Because we have a very highly secure environment, we must be able to install it in multiple environments on multiple different servers. The fact that we could use the same tool in all our environments, on-prem and in the cloud, was very important to us.
Ridwan Saeful Rohman - PeerSpot reviewer
Jul 4, 2024
The abstraction is quite good.
 

Pentaho Data Integration and Analytics Cons review quotes

PR
Dec 13, 2021
Although it is a low-code solution with a graphical interface, often the error messages that you get are of the type that a developer would be happy with. You get a big stack of red text and Java errors displayed on the screen, and less technical people can get intimidated by that. It can be a bit intimidating to get a wall of red error messages displayed. Other graphical tools that are focused at the power user level provide a much more user-friendly experience in dealing with your exceptions and guiding the user into where they've made the mistake.
DP
Dec 14, 2021
Some of the scheduling features about Lumada drive me buggy. The one issue that always drives me up the wall is when Daylight Savings Time changes. It doesn't take that into account elegantly. Every time it changes, I have to do something. It's not a big deal, but it's annoying.
Ryan Ferdon - PeerSpot reviewer
Mar 24, 2022
If you're working with a larger data set, I'm not so sure it would be the best solution. The larger things got the slower it was.
Learn what your peers think about Pentaho Data Integration and Analytics. Get advice and tips from experienced pros sharing their opinions. Updated: December 2024.
824,067 professionals have used our research since 2012.
reviewer1751571 - PeerSpot reviewer
Dec 22, 2021
The reporting definitely needs improvement. There are a lot of general, basic features that it doesn't have. A simple feature you would expect a reporting tool to have is the ability to search the repository for a report. It doesn't even have that capability. That's been a feature that we've been asking for since the beginning and it hasn't been implemented yet.
Dale Bloom - PeerSpot reviewer
Jan 20, 2022
In the Community edition, it would be nice to have more modules that allow you to code directly within the application. It could have R or Python completely integrated into it, but this could also be because I'm using an older version.
ES
Sep 4, 2022
The support for the Enterprise Edition is okay, but what they have done in the last three or four years is move more and more things to that edition. The result is that they are breaking the Community Edition. That's what our impression is.
VK
May 25, 2022
It could be better integrated with programming languages, like Python and R. Right now, if I want to run a Python code on one of my ETLs, it is a bit difficult to do. It would be great if we have some modules where we could code directly in a Python language. We don't really have a way to run Python code natively.
José Orlando Maia - PeerSpot reviewer
Apr 11, 2022
Lumada could have more native connectors with other vendors, such as Google BigQuery, Microsoft OneDrive, Jira systems, and Facebook or Instagram. We would like to gather data from modern platforms using Lumada, which is a better approach. As a comparison, if you open Power BI to retrieve data, then you can get data from many vendors with cloud-native connectors, such as Azure, AWS, Google BigQuery, and Athena Redshift. Lumada should have more native connectors to help us and facilitate our job in gathering information from these new modern infrastructures and tools.
SK
Jan 6, 2022
The testing and quality could really improve. Every time that there is a major release, we are very nervous about what is going to get broken. We have had a lot of experience with that, as even the latest one was broken. Some basic things get broken. That doesn't look good for Hitachi at all. If there is one place I would advise them to spend some money and do some effort, it is with the quality. It is not that hard to start putting in some unit tests so basic things don't get broken when they do a new release. That just looks horrible, especially for an organization like Hitachi.
Ridwan Saeful Rohman - PeerSpot reviewer
Jul 4, 2024
If you develop it on MacBook, it'll be quite a hassle.