Try our new research platform with insights from 80,000+ expert users

Denodo vs StreamSets comparison

 

Comparison Buyer's Guide

Executive SummaryUpdated on Dec 19, 2024

Review summaries and opinions

We asked business professionals to review the solutions they use. Here are some excerpts of what they said:
 

Categories and Ranking

Denodo
Ranking in Data Integration
9th
Average Rating
8.0
Reviews Sentiment
6.9
Number of Reviews
36
Ranking in other categories
Data Virtualization (1st), Cloud Data Integration (5th)
StreamSets
Ranking in Data Integration
15th
Average Rating
8.4
Reviews Sentiment
7.0
Number of Reviews
21
Ranking in other categories
No ranking in other categories
 

Mindshare comparison

As of April 2025, in the Data Integration category, the mindshare of Denodo is 1.9%, down from 1.9% compared to the previous year. The mindshare of StreamSets is 1.6%, up from 1.3% compared to the previous year. It is calculated based on PeerSpot user engagement data.
Data Integration
 

Featured Reviews

Vishal_Goyal - PeerSpot reviewer
The data catalog feature helps define data structures without storing data
The data catalog feature is valuable as it helps define data structures without storing data. Denodo's ability to connect to multiple data sources and perform extract-transform-load (ETL) operations on the fly is noteworthy. It supports API usage to call and present data in JSON or Tableau format without visualization capabilities.
Nantabo Jackie - PeerSpot reviewer
Simplified pipelines and helped us break down data silos within our organization
The design experience when implementing batch streaming or ECL pipelines is very easy and straightforward. When we initially attempted to integrate StreamSets with Kafka, it was somewhat challenging until we consulted the documentation, after which it became straightforward. We use StreamSets to move data into modern analytics platforms. Moving the data into modern analytics platforms is still complex. It requires a lot of understanding of logic. StreamSets enables us to build data pipelines without knowing how to code. StreamSets' ability to build data pipelines without requiring us to know complex programming is very important, as it allows us to focus on our projects without spending time writing code. StreamSets' Transformer for Snowflake is simple to use for designing both simple and complex transformation logic. StreamSets' Transformer for Snowflake is extremely important to me as it helps me to connect external data sources and keep my internal workflow organized. Transformer for Snowflake's functionality is a perfect ten out of ten. It is important and cost-effective that Transformer for Snowflake is a serverless engine embedded within the platform, as without this feature, it would be very expensive. This feature helps us to sell at lower budget costs, which would otherwise be at a high cost with other servers. StreamSets has helped improve our organization. StreamSets simplified pipelines for our organization. It is easier to complete a project when we know where and how to start, and working with the team remotely makes it more efficient. This helps us to save time and be more organized when creating data pipelines. Being a structured company that produces reliable resources for our application benefits both our clients and contacts. StreamSets' built-in data drift resilience plays a part in our ETL operations. With prior knowledge, the built-in data drift resilience is very effective, but it can be challenging to implement without the preexisting knowledge. The built-in data drift resilience reduced the time it takes us to fix data drift breakages by 45 percent. StreamSets helped us break down data silos within our organization. The use of StreamSets to break down data silos enabled us to be confident in the services and products we provide, as well as the real-time streaming we offer. This has had a positive impact on our business, as it allowed us to accurately determine the analytics we need to present to stakeholders, clients, and our sources while ensuring that the process is secure and transparent. StreamSets saved us time because anyone can use StreamSets not just developers. We can save around 40 percent of our time. StreamSets' reusable assets helped us reduce workload by around 25 percent. StreamSets saved us money by not having to hire developers with specialized skills. We saved around $2,000 US. StreamSets helped us scale our data operations. Since StreamSets makes it easy to scale our data operations, it enabled us to know exactly where to start at any time. We are aware of the timeline for completing the project, and depending on our familiarity with the software, we can come up with a solution quickly.

Quotes from Members

We asked business professionals to review the solutions they use. Here are some excerpts of what they said:
 

Pros

"Denodo is very stable."
"Overall, I would rate Denodo a nine out of ten."
"The logical data warehouse functionality is fantastic. It truly stands out. The ClearOptimizer and Virtual Cache are great features. They work together seamlessly to optimize performance."
"Denodo makes it easy to export data as a service or data link to other services."
"In general, it's good for us to make tests so we can scout the data."
"Denodo's best features are its performance, easy data transformation, and the job scheduler."
"The best thing about Denodo is that creating and deploying a web service can be done in about 10 minutes, compared to a whole day when it comes to other solutions (such as when deploying with Java and AWS)."
"The most valuable features are data lineage and the concept of a semantic layer."
"The most valuable features are the option of integration with a variety of protocols, languages, and origins."
"For me, the most valuable features in StreamSets have to be the Data Collector and Control Hub, but especially the Data Collector. That feature is very elegant and seamlessly works with numerous source systems."
"StreamSets’ data drift resilience has reduced the time it takes us to fix data drift breakages. For example, in our previous Hadoop scenario, when we were creating the Sqoop-based processes to move data from source to destinations, we were getting the job done. That took approximately an hour to an hour and a half when we did it with Hadoop. However, with the StreamSets, since it works on a data collector-based mechanism, it completes the same process in 15 minutes of time. Therefore, it has saved us around 45 minutes per data pipeline or table that we migrate. Thus, it reduced the data transfer, including the drift part, by 45 minutes."
"What I love the most is that StreamSets is very light. It's a containerized application. It's easy to use with Docker. If you are a large organization, it's very easy to use Kubernetes."
"StreamSets data drift feature gives us an alert upfront so we know that the data can be ingested. Whatever the schema or data type changes, it lands automatically into the data lake without any intervention from us, but then that information is crucial to fix for downstream pipelines, which process the data into models, like Tableau and Power BI models. This is actually very useful for us. We are already seeing benefits. Our pipelines used to break when there were data drift changes, then we needed to spend about a week fixing it. Right now, we are saving one to two weeks. Though, it depends on the complexity of the pipeline, we are definitely seeing a lot of time being saved."
"I have used Data Collector, Transformer, and Control Hub products from StreamSets. What I really like about these products is that they're very user-friendly. People who are not from a technological or core development background find it easy to get started and build data pipelines and connect to the databases. They would be comfortable like any technical person within a couple of weeks."
"The ability to have a good bifurcation rate and fewer mistakes is valuable."
"The most valuable feature is the pipelines because they enable us to pull in and push out data from different sources and to manipulate and clean things up within them."
 

Cons

"Denodo needs better communication on how the product can be deployed for specific solutions, and there are price issues, as it's considered expensive."
"Denodo currently integrates with ChatGPT, but the ability to manage and utilize them directly within Denodo would be a significant improvement."
"Monitoring event logs can be improved. In the older version, there was a monitoring schedule to get event reports and properly audit the reports. In the newer version, it is not there, and we have to manually configure data and audit events."
"There are a couple of areas that can be improved in Denodo. From a stability point of view, sometimes we see issues in the data management functionality. This only happens now and then, however, and usually takes place when we add in our own customization."
"It requires improving the story of how it can solve specific problems."
"Denodo shows stability concerns due to its dependency on external environments, such as JVM."
"The solution is slow when there are many virtualization layers."
"I would like to see a proper way to avoid killing the sourcing systems."
"There aren't enough hands-on labs, and debugging is also an issue because it takes a lot of time. Logs are not that clear when you are debugging, and you can only select a single source for a pipeline."
"One area for improvement could be the cloud storage server speed, as we have faced some latency issues here and there."
"Visualization and monitoring need to be improved and refined."
"The documentation is inadequate and has room for improvement because the technical support does not regularly update their documentation or the knowledge base."
"They need to improve their customer care services. Sometimes it has taken more than 48 hours to resolve an issue. That should be reduced. They are aware of small or generic issues, but not the more technical or deep issues. For those, they require some time, generally 48 to 72 hours to respond. That should be improved."
"We've seen a couple of cases where it appears to have a memory leak or a similar problem."
"The design experience is the bane of our existence because their documentation is not the best. Even when they update their software, they don't publish the best information on how to update and change your pipeline configuration to make it conform to current best practices. We don't pay for the added support. We use the "freeware version." The user community, as well as the documentation they provide for the standard user, are difficult, at best."
"One issue I observed with StreamSets is that the memory runs out quickly when processing large volumes of data. Because of this memory issue, we have to upgrade our EC2 boxes in the Amazon AWS infrastructure."
 

Pricing and Cost Advice

"Depending on the size of the client you want to work with, it can be prohibitively expensive at times."
"There is an express edition and a licensed enterprise edition."
"The licensing charges for this solution are payable annually."
"For us, the cost has been okay. Also, there are no additional costs; it's just the standard licensing fee."
"The cost for Denodo is in line with other similar products."
"I am not super familiar with the pricing, but so far, it seems good. We have been happy. We haven't seen any problems. The only time we had to pay extra was during the upgrade. We didn't upgrade at the time they told us to upgrade, and we had to pay extra to keep the service. They had stopped the support for the older version and moved to the newer version. It was not their fault. It was our fault because we didn't get on board quickly."
"Talking with my manager and others, nobody has complained about the pricing so far which is a positive sign."
"The licensing should be improved, as the cost per connector is quite expensive."
"It's not so favorable for small companies."
"The pricing is too fixed. It should be based on how much data you need to process. Some businesses are not so big that they process a lot of data."
"I believe the pricing is not equitable."
"Its pricing is pretty much up to the mark. For smaller enterprises, it could be a big price to pay at the initial stage of operations, but the moment you have the Seed B or Seed C funding and you want to scale up your operations and aren't much worried about the funds, at that point in time, you would need a solution that could be scaled."
"There are two editions, Professional and Enterprise, and there is a free trial. We're using the Professional edition and it is competitively priced."
"The overall cost is very flexible so it is not a burden for our organization... However, the cost should be improved. For small and mid-size organizations it might be a challenge."
"The pricing is affordable for any business."
"StreamSets Data Collector is open source. One can utilize the StreamSets Data Collector, but the Control Hub is the main repository where all the jobs are present. Everything happens in Control Hub."
report
Use our free recommendation engine to learn which Data Integration solutions are best for your needs.
845,040 professionals have used our research since 2012.
 

Top Industries

By visitors reading reviews
Financial Services Firm
21%
Manufacturing Company
10%
Computer Software Company
10%
Government
7%
Financial Services Firm
14%
Computer Software Company
11%
Manufacturing Company
10%
Insurance Company
8%
 

Company Size

By reviewers
Large Enterprise
Midsize Enterprise
Small Business
 

Questions from the Community

Does Denodo provide useful data virtualization education? Is it useful to attend their training?
If you are a Denodo user, it makes sense to undergo their training. Different types of professionals can benefit from it, including administrators, developers, and architects. If you are keen on i...
In experience, what might Denodo be lacking or need improvement on?
I like Denodo a lot. It offers quick and easy web service deployment within minutes. There are not any flaws that I think make the product less good or effective. The only thing I can point out is...
Which industries can benefit from Denodo the most?
Denodo is suitable for pretty much all sectors that deal with: Big data Cloud solutions Data governance Logical data fabric Master data management In my opinion, organizations in different fields...
What do you like most about StreamSets?
The best thing about StreamSets is its plugins, which are very useful and work well with almost every data source. It's also easy to use, especially if you're comfortable with SQL. You can customiz...
What needs improvement with StreamSets?
We often faced problems, especially with SAP ERP. We struggled because many columns weren't integers or primary keys, which StreamSets couldn't handle. We had to restructure our data tables, which ...
What is your primary use case for StreamSets?
StreamSets is used for data transformation rather than ETL processes. It focuses on transforming data directly from sources without handling the extraction part of the process. The transformed data...
 

Comparisons

 

Overview

 

Sample Customers

Autodesk, VHA, AAA, Sumitomo Mitsui Trust Bank, Caterpillar, European Chemical Agency, Seagate, Nationwide, Time Warner Cable, Pantex, Inditex, BNSF Railways, Vodafone, CIT Group, Jazztel, Wolters Kluwer, Telefonica, TransAlta
Availity, BT Group, Humana, Deluxe, GSK, RingCentral, IBM, Shell, SamTrans, State of Ohio, TalentFulfilled, TechBridge
Find out what your peers are saying about Denodo vs. StreamSets and other solutions. Updated: February 2025.
845,040 professionals have used our research since 2012.