Try our new research platform with insights from 80,000+ expert users

Palantir Foundry vs StreamSets comparison

 

Comparison Buyer's Guide

Executive SummaryUpdated on Dec 19, 2024

Review summaries and opinions

We asked business professionals to review the solutions they use. Here are some excerpts of what they said:
 

Categories and Ranking

Palantir Foundry
Ranking in Data Integration
19th
Average Rating
7.6
Reviews Sentiment
7.1
Number of Reviews
16
Ranking in other categories
IT Operations Analytics (9th), Supply Chain Analytics (1st), Cloud Data Integration (14th), Data Migration Appliances (4th), Data Management Platforms (DMP) (2nd), Data and Analytics Service Providers (1st)
StreamSets
Ranking in Data Integration
15th
Average Rating
8.4
Reviews Sentiment
7.0
Number of Reviews
21
Ranking in other categories
No ranking in other categories
 

Mindshare comparison

As of April 2025, in the Data Integration category, the mindshare of Palantir Foundry is 2.8%, up from 2.7% compared to the previous year. The mindshare of StreamSets is 1.6%, up from 1.3% compared to the previous year. It is calculated based on PeerSpot user engagement data.
Data Integration
 

Featured Reviews

Rama Subba Reddy Thavva - PeerSpot reviewer
A low-code/no-code platform with a user-friendly UI
We couldn't implement or use some of the latest functionalities, like Spark. Palantir Foundry is scalable, but it is costly compared to other cloud providers. The solution is more suitable for small and medium businesses. It might be difficult for large enterprises. I rate the solution’s scalability a seven out of ten.
Karthik Rajamani - PeerSpot reviewer
Integrates with different enterprise systems and enables us to easily build data pipelines without knowing how to code
There are a few things that can be better. We create pipelines or jobs in StreamSets Control Hub. It is a great feature, but if there is a way to have a folder structure or organize the pipelines and jobs in Control Hub, it would be great. I submitted a ticket for this some time back. There are certain features that are only available at certain stages. For example, HTTP Client has some great features when it is used as a processor, but those features are not available in HTTP Client as a destination. There could be some improvements on the group side. Currently, if I want to know which users are a part of certain groups, it is not straightforward to see. You have to go to each and every user and check the groups he or she is a part of. They could improve it in that direction. Currently, we have to put in a manual effort. In case something goes wrong, we have to go to each and every user account to check whether he or she is a part of a certain group or not.

Quotes from Members

We asked business professionals to review the solutions they use. Here are some excerpts of what they said:
 

Pros

"The security is also excellent. It's highly granular, so the admins have a high degree of control, and there are many levels of security. That worked well. You won't have an EDC unless you put everything onto the platform because it is its own isolated thing."
"Great features available in one tool."
"The ease of use is my favorite feature. We're able to build different models and projects or combine different projects to build one use case."
"I like the data onboarding to Palantir Foundry and ETL creation."
"It's scalable."
"Palantir Foundry is a robust platform that has really strong plugin connectors and provides features for real-time integration."
"The solution provides an end-to-end integrated tech stack that takes care of all utility/infrastructure topics for you."
"The solution offers very good end-to-end capabilities."
"The entire user interface is very simple and the simplicity of creating pipelines is something that I like very much about it. The design experience is very smooth."
"StreamSets data drift feature gives us an alert upfront so we know that the data can be ingested. Whatever the schema or data type changes, it lands automatically into the data lake without any intervention from us, but then that information is crucial to fix for downstream pipelines, which process the data into models, like Tableau and Power BI models. This is actually very useful for us. We are already seeing benefits. Our pipelines used to break when there were data drift changes, then we needed to spend about a week fixing it. Right now, we are saving one to two weeks. Though, it depends on the complexity of the pipeline, we are definitely seeing a lot of time being saved."
"StreamSets is the leader in the market."
"Also, the intuitive canvas for designing all the streams in the pipeline, along with the simplicity of the entire product are very big pluses for me. The software is very simple and straightforward. That is something that is needed right now."
"The ability to have a good bifurcation rate and fewer mistakes is valuable."
"The most valuable feature is the pipelines because they enable us to pull in and push out data from different sources and to manipulate and clean things up within them."
"The Ease of configuration for pipes is amazing. It has a lot of connectors. Mainly, we can do everything with the data in the pipe. I really like the graphical interface too"
"What I love the most is that StreamSets is very light. It's a containerized application. It's easy to use with Docker. If you are a large organization, it's very easy to use Kubernetes."
 

Cons

"The data lineage was challenging. It's hard to track data from the sources as it moves through stages. Informatica EDC can easily capture and report it because it talks to the metadata. This is generated across those various staging points."
"If you want to create new models on specific data sets, computing that is quite costly."
"It requires a lot of manual work and is very time-consuming to get to a functional point."
"The frontend capabilities of Palantir Foundry could be improved."
"It would be helpful to build applications based on Azure functions or web apps in Palantir Foundry."
"The startup pricing is high, causing concern despite being cost-effective in terms of total cost of ownership."
"There is not a wide user base for the solution's online documentation so it is sometimes difficult to find answers."
"They do not have a data center in Europe, and we have lots of personally identifiable information in our dataset that needs to be hosted by a third-party data center like Amazon or Microsoft Azure."
"The documentation is inadequate and has room for improvement because the technical support does not regularly update their documentation or the knowledge base."
"Currently, we can only use the query to read data from SAP HANA. What we would like to see, as soon as possible, is the ability to read from multiple tables from SAP HANA. That would be a really good thing that we could use immediately. For example, if you have 100 tables in SQL Server or Oracle, then you could just point it to the schema or the 100 tables and ingestion information. However, you can't do that in SAP HANA since StreamSets currently is lacking in this. They do not have a multi-table feature for SAP HANA. Therefore, a multi-table origin for SAP HANA would be helpful."
"The monitoring visualization is not that user-friendly. It should include other features to visualize things, like how many records were streamed from a source to a destination on a particular date."
"The execution engine could be improved. When I was at their session, they were using some obscure platform to run. There is a controller, which controls what happens on that, but you should be able to easily do this at any of the cloud services, such as Google Cloud. You shouldn't have any issues in terms of how to run it with their online development platform or design platform, basically their execution engine. There are issues with that."
"The design experience is the bane of our existence because their documentation is not the best. Even when they update their software, they don't publish the best information on how to update and change your pipeline configuration to make it conform to current best practices. We don't pay for the added support. We use the "freeware version." The user community, as well as the documentation they provide for the standard user, are difficult, at best."
"One area for improvement could be the cloud storage server speed, as we have faced some latency issues here and there."
"One issue I observed with StreamSets is that the memory runs out quickly when processing large volumes of data. Because of this memory issue, we have to upgrade our EC2 boxes in the Amazon AWS infrastructure."
"We've seen a couple of cases where it appears to have a memory leak or a similar problem."
 

Pricing and Cost Advice

"The solution’s pricing is high."
"Palantir Foundry has different pricing models that can be negotiated."
"Palantir Foundry is an expensive solution."
"It's expensive."
"We use the free version. It's great for a public, free release. Our stance is that the paid support model is too expensive to get into. They should honestly reevaluate that."
"It's not so favorable for small companies."
"The pricing is too fixed. It should be based on how much data you need to process. Some businesses are not so big that they process a lot of data."
"StreamSets Data Collector is open source. One can utilize the StreamSets Data Collector, but the Control Hub is the main repository where all the jobs are present. Everything happens in Control Hub."
"There are two editions, Professional and Enterprise, and there is a free trial. We're using the Professional edition and it is competitively priced."
"There are different versions of the product. One is the corporate license version, and the other one is the open-source or free version. I have been using the corporate license version, but they have recently launched a new open-source version so that anybody can create an account and use it. The licensing cost varies from customer to customer. I don't have a lot of input on that. It is taken care of by PMO, and they seem fine with its pricing model. It is being used enterprise-wide. They seem to have got a good deal for StreamSets."
"It has a CPU core-based licensing, which works for us and is quite good."
"We are running the community version right now, which can be used free of charge."
report
Use our free recommendation engine to learn which Data Integration solutions are best for your needs.
847,772 professionals have used our research since 2012.
 

Top Industries

By visitors reading reviews
Manufacturing Company
14%
Computer Software Company
11%
Financial Services Firm
10%
Government
7%
Financial Services Firm
14%
Computer Software Company
11%
Manufacturing Company
9%
Insurance Company
8%
 

Company Size

By reviewers
Large Enterprise
Midsize Enterprise
Small Business
 

Questions from the Community

What do you like most about Palantir Foundry?
Palantir Foundry is a robust platform that has really strong plugin connectors and provides features for real-time integration.
What needs improvement with Palantir Foundry?
The solution’s data security could be improved. We cannot use many Python packages with the solution. We were able to use only a few compatible Python packages.
What is your primary use case for Palantir Foundry?
Our use cases are mostly related to data analytics. We are building some dashboards and ETL pipelines on the Palantir side. Palantir Foundry is a low-code/no-code platform with a user-friendly UI. ...
What do you like most about StreamSets?
The best thing about StreamSets is its plugins, which are very useful and work well with almost every data source. It's also easy to use, especially if you're comfortable with SQL. You can customiz...
What needs improvement with StreamSets?
We often faced problems, especially with SAP ERP. We struggled because many columns weren't integers or primary keys, which StreamSets couldn't handle. We had to restructure our data tables, which ...
What is your primary use case for StreamSets?
StreamSets is used for data transformation rather than ETL processes. It focuses on transforming data directly from sources without handling the extraction part of the process. The transformed data...
 

Overview

 

Sample Customers

Merck KGaA, Airbus, Ferrari,United States Intelligence Community, United States Department of Defense
Availity, BT Group, Humana, Deluxe, GSK, RingCentral, IBM, Shell, SamTrans, State of Ohio, TalentFulfilled, TechBridge
Find out what your peers are saying about Palantir Foundry vs. StreamSets and other solutions. Updated: April 2025.
847,772 professionals have used our research since 2012.