Try our new research platform with insights from 80,000+ expert users

Palantir Foundry vs Pentaho Data Integration and Analytics comparison

 

Comparison Buyer's Guide

Executive SummaryUpdated on Dec 19, 2024

Review summaries and opinions

We asked business professionals to review the solutions they use. Here are some excerpts of what they said:
 

Categories and Ranking

Palantir Foundry
Ranking in Data Integration
19th
Average Rating
7.6
Reviews Sentiment
7.1
Number of Reviews
15
Ranking in other categories
IT Operations Analytics (9th), Supply Chain Analytics (1st), Cloud Data Integration (14th), Data Migration Appliances (4th), Data Management Platforms (DMP) (2nd), Data and Analytics Service Providers (1st)
Pentaho Data Integration an...
Ranking in Data Integration
22nd
Average Rating
8.0
Reviews Sentiment
6.9
Number of Reviews
53
Ranking in other categories
No ranking in other categories
 

Mindshare comparison

As of March 2025, in the Data Integration category, the mindshare of Palantir Foundry is 2.7%, up from 2.6% compared to the previous year. The mindshare of Pentaho Data Integration and Analytics is 1.4%, up from 0.5% compared to the previous year. It is calculated based on PeerSpot user engagement data.
Data Integration
 

Featured Reviews

Rama Subba Reddy Thavva - PeerSpot reviewer
A low-code/no-code platform with a user-friendly UI
We couldn't implement or use some of the latest functionalities, like Spark. Palantir Foundry is scalable, but it is costly compared to other cloud providers. The solution is more suitable for small and medium businesses. It might be difficult for large enterprises. I rate the solution’s scalability a seven out of ten.
Ryan Ferdon - PeerSpot reviewer
Low-code makes development faster than with Python, but there were caching issues
If you're working with a larger data set, I'm not so sure it would be the best solution. The larger things got the slower it was. It was kind of buggy sometimes. And when we ran the flow, it didn't go from a perceived start to end, node by node. Everything kicked off at once. That meant there were times when it would get ahead of itself and a job would fail. That was not because the job was wrong, but because Pentaho decided to go at everything at once, and something would process before it was supposed to. There were nodes you could add to make sure that, before this node kicks off, all these others have processed, but it was a bit tedious. There were also caching issues, and we had to write code to clear the cache every time we opened the program, because the cache would fill up and it wouldn't run. I don't know how hard that would be for them to fix, or if it was fixed in version 10. Also, the UI is a bit outdated, but I'm more of a fan of function over how something looks. One other thing that would have helped with Pentaho was documentation and support on the internet: how to do things, how to set up. I think there are some sites on how to install it, and Pentaho does have a help repository, but it wasn't always the most useful.

Quotes from Members

We asked business professionals to review the solutions they use. Here are some excerpts of what they said:
 

Pros

"The security is also excellent. It's highly granular, so the admins have a high degree of control, and there are many levels of security. That worked well. You won't have an EDC unless you put everything onto the platform because it is its own isolated thing."
"I like the data onboarding to Palantir Foundry and ETL creation."
"Great features available in one tool."
"It's scalable."
"The ease of use is my favorite feature. We're able to build different models and projects or combine different projects to build one use case."
"The interface is really user-friendly."
"The solution offers very good end-to-end capabilities."
"The solution provides an end-to-end integrated tech stack that takes care of all utility/infrastructure topics for you."
"It's very simple compared to other products out there."
"I can create faster instructions than writing with SQL or code. Also, I am able to do some background control of the data process with this tool. Therefore, I use it as an ELT tool. I have a station area where I can work with all the information that I have in my production databases, then I can work with the data that I created."
"I can use Python, which is open-source, and I can run other scripts, including Linux scripts. It's user-friendly for running any object-based language. That's a very important feature because we live in a world of open-source."
"The amount of data that it loads and processes is good."
"The fact that it's a low-code solution is valuable. It's good for more junior people who may not be as experienced with programming."
"It's my understanding that the product can scale."
"The fact that it enables us to leverage metadata to automate data pipeline templates and reuse them is definitely one of the features that we like the best. The metadata injection is helpful because it reduces the need to create and maintain additional ETLs. If we didn't have that feature, we would have lots of duplicated ETLs that we would have to create and maintain. The data pipeline templates have definitely been helpful when looking at productivity and costs."
"The abstraction is quite good."
 

Cons

"The solution could use more online documentation for new users."
"The data lineage was challenging. It's hard to track data from the sources as it moves through stages. Informatica EDC can easily capture and report it because it talks to the metadata. This is generated across those various staging points."
"The solution’s data security could be improved."
"It requires a lot of manual work and is very time-consuming to get to a functional point."
"There is not a wide user base for the solution's online documentation so it is sometimes difficult to find answers."
"Some error messages can be very cryptic."
"It would be helpful to build applications based on Azure functions or web apps in Palantir Foundry."
"Cost of this solution is quite high."
"The performance could be improved. If they could have analytics perform well on large volumes, that would be a big deal for our products."
"If you develop it on MacBook, it'll be quite a hassle."
"It could be better integrated with programming languages, like Python and R. Right now, if I want to run a Python code on one of my ETLs, it is a bit difficult to do. It would be great if we have some modules where we could code directly in a Python language. We don't really have a way to run Python code natively."
"​I could not connect to our Hadoop environment in an easy and flexible way, and it was important to scale our data warehouse​."
"I would like to see support for some additional cloud sources. It doesn't support Azure, for example. I was trying to do a PoC with Azure the other day but it seems they don't support it."
"I experience difficulties when handling millions of rows, as the data movement from one source to another becomes challenging."
"The web interface is rusty, and the biggest problem with Pentaho is debugging and troubleshooting. It isn't easy to build the pipeline incrementally. At least in our case, it's hard to find a way to execute step by step in the debugging mode."
"​I work with the Community Edition, therefore I do not have support. There was an issue that I could not resolve with community support.​"
 

Pricing and Cost Advice

"Palantir Foundry is an expensive solution."
"The solution’s pricing is high."
"It's expensive."
"Palantir Foundry has different pricing models that can be negotiated."
"There was a cost analysis done and Pentaho did favorably in terms of cost."
"There is a good open source option (Community Edition)​."
"The pricing has been pretty good. I'm used to using everything open-source or freeware-based. I understand that organizations need to make sure that the solutions are secure, and that's basically where I hit a roadblock in my current organization. They needed to ensure that we had a license and we had a secure way of accessing it so that no outside parties could get access to our data, but in terms of pricing, considering how much other teams are spending on cloud solutions or even their existing solutions, its price point is pretty good. At this time, there are no additional costs. We just have the licensing fees."
"It does seem a bit expensive compared to the serverless product offering. Tools, such as Server Integration Services, are "almost" free with a database engine. It is comparable to products like Alteryx, which is also very expensive."
"We did a two or three-year deal the last time we did it. As compared to other solutions, at least so far in our experience, it has been very affordable. The licensing is by component. So, you need to make sure you only license the components that you really intend to use. I am not sure if we have relicensed after the Hitachi acquisition, but previously, multi-year renewals resulted in a good discount. I'm not sure if this is still the case. We've had the full suite for a lot of years, and there is just the initial cost. I am not aware of any additional costs."
"Sometimes we provide the licenses or the customer can procure their own licenses. Previously, we had an enterprise license. Currently, we are on a community license as this is adequate for our needs."
"For most development tasks, the Enterprise edition should be sufficient. It depends on the type of support that you require for your production environment."
"I believe the pricing of the solution is more affordable than the competitors"
report
Use our free recommendation engine to learn which Data Integration solutions are best for your needs.
839,422 professionals have used our research since 2012.
 

Top Industries

By visitors reading reviews
Manufacturing Company
14%
Financial Services Firm
11%
Computer Software Company
10%
Government
7%
Financial Services Firm
21%
Computer Software Company
15%
Government
8%
Comms Service Provider
5%
 

Company Size

By reviewers
Large Enterprise
Midsize Enterprise
Small Business
 

Questions from the Community

What do you like most about Palantir Foundry?
Palantir Foundry is a robust platform that has really strong plugin connectors and provides features for real-time integration.
What needs improvement with Palantir Foundry?
The solution’s data security could be improved. We cannot use many Python packages with the solution. We were able to use only a few compatible Python packages.
What is your primary use case for Palantir Foundry?
Our use cases are mostly related to data analytics. We are building some dashboards and ETL pipelines on the Palantir side. Palantir Foundry is a low-code/no-code platform with a user-friendly UI. ...
Which ETL tool would you recommend to populate data from OLTP to OLAP?
Hi Rajneesh, yes here is the feature comparison between the community and enterprise edition : https://www.hitachivantara.com/en-us/pdf/brochure/leverage-open-source-benefits-with-assurance-of-hita...
What do you think can be improved with Hitachi Lumada Data Integrations?
In my opinion, the reporting side of this tool needs serious improvements. In my previous company, we worked with Hitachi Lumada Data Integration and while it does a good job for what it’s worth, ...
What do you use Hitachi Lumada Data Integrations for most frequently?
My company has used this product to transform data from databases, CSV files, and flat files. It really does a good job. We were most satisfied with the results in terms of how many people could us...
 

Also Known As

No data available
Hitachi Lumada Data Integration, Kettle, Pentaho Data Integration
 

Overview

 

Sample Customers

Merck KGaA, Airbus, Ferrari,United States Intelligence Community, United States Department of Defense
66Controls, Providential Revenue Agency of Ro Negro, NOAA Information Systems, Swiss Real Estate Institute
Find out what your peers are saying about Palantir Foundry vs. Pentaho Data Integration and Analytics and other solutions. Updated: February 2025.
839,422 professionals have used our research since 2012.