Try our new research platform with insights from 80,000+ expert users

Matillion ETL vs Pentaho Data Integration and Analytics comparison

 

Comparison Buyer's Guide

Executive Summary
 

Categories and Ranking

Matillion ETL
Average Rating
8.4
Number of Reviews
26
Ranking in other categories
Cloud Data Integration (5th)
Pentaho Data Integration an...
Average Rating
8.0
Number of Reviews
51
Ranking in other categories
Data Integration (30th)
 

Featured Reviews

AntonHaupt - PeerSpot reviewer
Jan 23, 2024
Efficient data integration and transformation with seamless cloud-native integration
In our small business unit, we currently have around four users, with two of them utilizing Matillion within our organization. Considering our growing needs, we're contemplating transitioning to an enterprise SaaS solution where we would share the same instance. Currently, each user is billed individually, but consolidating to a shared instance seems more efficient. Scalability is excellent when using the SaaS solution, easily reaching a rating of ten out of ten. Each data pipeline request is encapsulated within a Docker container and spun off, allowing for instant scalability. Overall, I would rate it a nine out of ten in terms of performance and scalability.
Ryan Ferdon - PeerSpot reviewer
Mar 24, 2022
Low-code makes development faster than with Python, but there were caching issues
If you're working with a larger data set, I'm not so sure it would be the best solution. The larger things got the slower it was. It was kind of buggy sometimes. And when we ran the flow, it didn't go from a perceived start to end, node by node. Everything kicked off at once. That meant there were times when it would get ahead of itself and a job would fail. That was not because the job was wrong, but because Pentaho decided to go at everything at once, and something would process before it was supposed to. There were nodes you could add to make sure that, before this node kicks off, all these others have processed, but it was a bit tedious. There were also caching issues, and we had to write code to clear the cache every time we opened the program, because the cache would fill up and it wouldn't run. I don't know how hard that would be for them to fix, or if it was fixed in version 10. Also, the UI is a bit outdated, but I'm more of a fan of function over how something looks. One other thing that would have helped with Pentaho was documentation and support on the internet: how to do things, how to set up. I think there are some sites on how to install it, and Pentaho does have a help repository, but it wasn't always the most useful.

Quotes from Members

We asked business professionals to review the solutions they use. Here are some excerpts of what they said:
 

Pros

"It has helped us to get onto the cloud quickly."
"The most valuable feature of Matillion ETL is the UI experience in which you can drag and drop most of the transformation."
"It is pretty user-friendly, even for people who aren't super technical."
"The most valuable feature of Matillion ETL is its ease of use. If you have had some experience with other solutions, such as Snowflake, the use of this solution will be simple."
"The most valuable feature of Matillion ETL is its user-friendly graphical interface."
"Matillion ETL has great Git integration that is perfect and convenient to use."
"The simplicity of this tool is nice. It has a good graphical user interface. You can also do a lot of generic stuff in the tool. If there is good connectivity to a cloud database, such as Snowflake, and you can have a lot of Snowflake functionality in the tool."
"It can scale to a great extent. It can handle the load that we are putting on it, which is about 5TBs."
"Flexible deployment, in any environment, is very important to us. That is the key reason why we ended up with these tools. Because we have a very highly secure environment, we must be able to install it in multiple environments on multiple different servers. The fact that we could use the same tool in all our environments, on-prem and in the cloud, was very important to us."
"The way it has improved our product is by giving our users the ability to do ad hoc reports, which is very important to our users. We can do predictive analysis on trends coming in for contracts, which is what our product does. The product helps users decide which way to go based on the predictive analysis done by Pentaho. Pentaho is not doing predictions, but reporting on the predictions that our product is doing. This is a big part of our product."
"We're using the PDI and the repository function, and they give us the ability to easily generate reporting and output, and to access data. We also like the ability to schedule."
"Its drag-and-drop interface lets me and my team implement all the solutions that we need in our company very quickly. It's a very good tool for that."
"The solution has a free to use community version."
"We use Lumada’s ability to develop and deploy data pipeline templates once and reuse them. This is very important. When the entire pipeline is automated, we do not have any issues in respect to deployment of code or with code working in one environment but not working in another environment. We have saved a lot of time and effort from that perspective because it is easy to build ETL pipelines."
"It has a really friendly user interface, which is its main feature. The process of automating or combining SQL code with some databases and doing the automation is great and really convenient."
"Provides a good open source option."
 

Cons

"The cost of the solution is high and could be reduced."
"The product must enhance its near-real-time data capture feature."
"The current version is a bit more limited because it's on a virtual machine, and everything executes on that one virtual machine."
"In the next release, we would like to have connections to more databases."
"The improvement area could be possible if the tool provides better integration capabilities with other ecosystems, including governance tools or data cataloging tools, as it is currently an area where the solution is lacking."
"While the UI is good, it could be improved in its efficiency and made easier to use."
"The product's scalability needs improvement. Perhaps adding more connectors would be beneficial."
"I am looking forward to seeing the expansion of the source range for their data loader product."
"Larger data jobs take more time to execute."
"​There is not a data quality or MDM solution in the Pentaho DI suite.​"
"Although it is a low-code solution with a graphical interface, often the error messages that you get are of the type that a developer would be happy with. You get a big stack of red text and Java errors displayed on the screen, and less technical people can get intimidated by that. It can be a bit intimidating to get a wall of red error messages displayed. Other graphical tools that are focused at the power user level provide a much more user-friendly experience in dealing with your exceptions and guiding the user into where they've made the mistake."
"​I work with the Community Edition, therefore I do not have support. There was an issue that I could not resolve with community support.​"
"I would like to see more improvements with AS400 DB2."
"If you're working with a larger data set, I'm not so sure it would be the best solution. The larger things got the slower it was."
"I have been facing some difficulties when working with large datasets. It seems that when there is a large amount of data, I experience memory errors."
"As far as I remember, not all connectors worked very well. They can add more connectors and more drivers to the process to integrate with more flows."
 

Pricing and Cost Advice

"Purchasing it through the AWS Marketplace is pretty convenient. There is a little bit of back and forth in terms of the licensing based on the machine size, but it seems to have worked out well. it is convenient to have it all as part of our AWS billing."
"The solution is very cheap. You're paying $2.50 an hour and if you set your service up, which you can do, you're not getting charged. Currently, our ETL process is just an overnight process that runs for about an hour. I can start and stop my server just for an hour if I want to and spent $2.50 a day for an ETL solution. There are no additional costs."
"The pricing depends on what edition the customer opts for. For example, the standard edition is priced at $2.00 per credit. And you are only charged when you use it. You're not charged when it's idle."
"The cost of the solution is high and could be reduced."
"The price of Matillion ETL is reasonable."
"It was very easy to purchase through the AWS Marketplace, but it was also expensive."
"Matillion ETL is expensive."
"It is not necessarily a cheap solution. However, it's reasonable priced, especially with the smaller machines that we run it on."
"We did a two or three-year deal the last time we did it. As compared to other solutions, at least so far in our experience, it has been very affordable. The licensing is by component. So, you need to make sure you only license the components that you really intend to use. I am not sure if we have relicensed after the Hitachi acquisition, but previously, multi-year renewals resulted in a good discount. I'm not sure if this is still the case. We've had the full suite for a lot of years, and there is just the initial cost. I am not aware of any additional costs."
"When we first started with it, it was much cheaper. It has gone up drastically, especially since Hitachi bought out Pentaho."
"If a company is looking for an ETL solution and wants to integrate it with their tech stack but doesn't want to spend a bunch of money, Pentaho is a good solution"
"It does seem a bit expensive compared to the serverless product offering. Tools, such as Server Integration Services, are "almost" free with a database engine. It is comparable to products like Alteryx, which is also very expensive."
"The pricing has been pretty good. I'm used to using everything open-source or freeware-based. I understand that organizations need to make sure that the solutions are secure, and that's basically where I hit a roadblock in my current organization. They needed to ensure that we had a license and we had a secure way of accessing it so that no outside parties could get access to our data, but in terms of pricing, considering how much other teams are spending on cloud solutions or even their existing solutions, its price point is pretty good. At this time, there are no additional costs. We just have the licensing fees."
"I think Lumada's price is fair compared to some of the others, like BusinessObjects, which is was the other thing that I used at my previous job. BusinessObject's price was more reasonable before SAP acquired it. They jacked the price up significantly. Oracle's OBIEE tool was also prohibitively expensive."
"Sometimes we provide the licenses or the customer can procure their own licenses. Previously, we had an enterprise license. Currently, we are on a community license as this is adequate for our needs."
"There was a cost analysis done and Pentaho did favorably in terms of cost."
report
Use our free recommendation engine to learn which Cloud Data Integration solutions are best for your needs.
815,854 professionals have used our research since 2012.
 

Top Industries

By visitors reading reviews
Financial Services Firm
16%
Computer Software Company
14%
Manufacturing Company
9%
Government
6%
Financial Services Firm
23%
Computer Software Company
14%
Government
7%
Comms Service Provider
5%
 

Company Size

By reviewers
Large Enterprise
Midsize Enterprise
Small Business
 

Questions from the Community

What do you like most about Matillion ETL?
The new version with the Productivity Cloud is very simple. It's easy to use, navigate, and understand.
What is your experience regarding pricing and costs for Matillion ETL?
The solution's pricing is not based on the licensing cost but on the running hours when the Matillion instance is up and running. Its pricing model is different from the traditional pricing models ...
What needs improvement with Matillion ETL?
Depending on the use case, the solution's pricing could be improved. Matillion ETL should include more enhanced capabilities for extracting data from the SAP systems.
Which ETL tool would you recommend to populate data from OLTP to OLAP?
Hi Rajneesh, yes here is the feature comparison between the community and enterprise edition : https://www.hitachivantara.com/en-us/pdf/brochure/leverage-open-source-benefits-with-assurance-of-hita...
What do you think can be improved with Hitachi Lumada Data Integrations?
In my opinion, the reporting side of this tool needs serious improvements. In my previous company, we worked with Hitachi Lumada Data Integration and while it does a good job for what it’s worth, ...
What do you use Hitachi Lumada Data Integrations for most frequently?
My company has used this product to transform data from databases, CSV files, and flat files. It really does a good job. We were most satisfied with the results in terms of how many people could us...
 

Also Known As

Matillion ETL for Redshift, Matillion ETL for Snowflake, Matillion ETL for BigQuery
Hitachi Lumada Data Integration, Kettle, Pentaho Data Integration
 

Overview

 

Sample Customers

Thrive Market, MarketBot, PWC, Axtria, Field Nation, GE, Superdry, Quantcast, Lightbox, EDF Energy, Finn Air, IPRO, Twist, Penn National Gaming Inc
66Controls, Providential Revenue Agency of Ro Negro, NOAA Information Systems, Swiss Real Estate Institute
Find out what your peers are saying about Matillion ETL vs. Pentaho Data Integration and Analytics and other solutions. Updated: October 2024.
815,854 professionals have used our research since 2012.