Try our new research platform with insights from 80,000+ expert users

SAP Data Services vs StreamSets comparison

 

Comparison Buyer's Guide

Executive Summary
 

Categories and Ranking

SAP Data Services
Ranking in Data Integration
10th
Average Rating
8.0
Number of Reviews
48
Ranking in other categories
Data Quality (2nd)
StreamSets
Ranking in Data Integration
9th
Average Rating
8.4
Reviews Sentiment
7.5
Number of Reviews
24
Ranking in other categories
No ranking in other categories
 

Mindshare comparison

As of November 2024, in the Data Integration category, the mindshare of SAP Data Services is 2.9%, down from 3.4% compared to the previous year. The mindshare of StreamSets is 1.7%, up from 1.3% compared to the previous year. It is calculated based on PeerSpot user engagement data.
Data Integration
 

Featured Reviews

RambabuMandala - PeerSpot reviewer
Responsive support, scalable, and beneficial integration
The BA reporting tools, such as Data Services, and ETL tool in SAP Data Services are the most valuable. When we had in-memory requirements, we used HANA. HANA is most preferably for most the customers for in-memory. SAP is the first company that created the in-memory concept. There are a lot of SAP applications and they have good integration with SAP packages. In the next release, they should be more advanced cloud functionalities and web services. More integration with other applications including SAPs and other applications. They should not switch to a complete solution but provide better integration with other applications.
Reyansh Kumar - PeerSpot reviewer
We no longer need to hire highly skilled data engineers to create and monitor data pipelines
The things I like about StreamSets are its * overall user interface * efficiency * product features, which are all good. Also, the scheduling within the data engineering pipeline is very much appreciated, and it has a wide range of connectors for connecting to any data sources like SQL Server, AWS, Azure, etc. We have used it with Kafka, Hadoop, and Azure Data Factory Datasets. Connecting to these systems with StreamSets is very easy. You just need to configure the data sources, the paths and their configurations, and you are ready to go. It is very efficient and very easy to use for ETL pipelines. It is a GUI-based interface in which you can easily create or design your own data pipelines with just a few clicks. As for moving data into modern analytics systems, we are using it with Microsoft Power BI, AWS, and some on-premises solutions, and it is very easy to get data from StreamSets into them. No hardcore coding or special technical expertise is required. It is also a no-code platform in which you can configure your data sources and data output for easy configuration of your data pipeline. This is a very important aspect because if a tool requires code development, we need to hire software developers to get the task done. By using StreamSets, it can be done with a few clicks.

Quotes from Members

We asked business professionals to review the solutions they use. Here are some excerpts of what they said:
 

Pros

"It is a powerful product with a broad range of features."
"The solution offers very good integration capabilities."
"Data Services' table comparison mechanism is very powerful. It's pretty hard to find a similar feature in other solutions."
"The most valuable feature of SAP Data Services is the integration with data sources."
"It's easy to understand and deploy. It's easy to create new applications, and depending on the complexity of the application, it is easy to deploy the new requirements."
"The logic is also simple. It makes it easy to build your extraction."
"The maintenance of data services is the solution's most valuable feature."
"The solution has good connectivity with SAP."
"StreamSets’ data drift resilience has reduced the time it takes us to fix data drift breakages. For example, in our previous Hadoop scenario, when we were creating the Sqoop-based processes to move data from source to destinations, we were getting the job done. That took approximately an hour to an hour and a half when we did it with Hadoop. However, with the StreamSets, since it works on a data collector-based mechanism, it completes the same process in 15 minutes of time. Therefore, it has saved us around 45 minutes per data pipeline or table that we migrate. Thus, it reduced the data transfer, including the drift part, by 45 minutes."
"One of the things I like is the data pipelines. They have a very good design. Implementing pipelines is very straightforward. It doesn't require any technical skill."
"It's very easy to integrate. It integrates with Snowflake, AWS, Google Cloud, and Azure. It's very helpful for DevOps, DataOps, and data engineering because it provides a comprehensive solution, and it's not complicated."
"I really appreciate the numerous ready connectors available on both the source and target sides, the support for various media file formats, and the ease of configuring and managing pipelines centrally."
"The most valuable would be the GUI platform that I saw. I first saw it at a special session that StreamSets provided towards the end of the summer. I saw the way you set it up and how you have different processes going on with your data. The design experience seemed to be pretty straightforward to me in terms of how you drag and drop these nodes and connect them with arrows."
"StreamSets data drift feature gives us an alert upfront so we know that the data can be ingested. Whatever the schema or data type changes, it lands automatically into the data lake without any intervention from us, but then that information is crucial to fix for downstream pipelines, which process the data into models, like Tableau and Power BI models. This is actually very useful for us. We are already seeing benefits. Our pipelines used to break when there were data drift changes, then we needed to spend about a week fixing it. Right now, we are saving one to two weeks. Though, it depends on the complexity of the pipeline, we are definitely seeing a lot of time being saved."
"The ETL capabilities are very useful for us. We extract and transform data from multiple data sources, into a single, consistent data store, and then we put it in our systems. We typically use it to connect our Apache Kafka with data lakes. That process is smooth and saves us a lot of time in our production systems."
"Important features include that it comprises lots of functionality to connect data from various sources through connector availability, scheduling pipelines at any time, and integration with third-party and security solutions for encryption."
 

Cons

"The solution could improve the overall features. There is a lot that can be done in the solution, therefor there are areas where it can improve. Additionally, there is a need to make it easier for one to create connections to other non-SAP systems. The flexibility to connect to other non-SAP systems is needed."
"The execution engines and processing engines have shortcomings and need improvements."
"The solution should improve connectivity with more source systems and target systems."
"Scheduling of jobs in the sequence is not available. It's a very tedious process to do that particular task. If there was a tool to schedule in the sequence that would be extremely helpful."
"The description of error messages isn't extensive, although they point to the problem. With other solutions, like Talend, I was able to use the debugger to get directly to the problem, but with SAP Data Services the debugger is not working. I'm not sure if it's a problem with the version specifically, but I'm using it in an enterprise environment and I can't do an upgrade."
"Address validation in SAP Data Services is chargeable and should be made free."
"We would like the different tools offered within this solution to be available as standalone entities, rather than having to purchase the entire package when we only require a few features."
"Source code control is another headache. When your source code base gets too large, managing the source code becomes cumbersome."
"One thing that I would like to add is the ability to manually enter data. The way the solution currently works is we don't have the option to manually change the data at any point in time. Being able to do that will allow us to do everything that we want to do with our data. Sometimes, we need to manually manipulate the data to make it more accurate in case our prior bifurcation filters are not good. If we have the option to manually enter the data or make the exact iterations on the data set, that would be a good thing."
"I would like to see it integrate with other kinds of platforms, other than Java. We're going to have a lot of applications using .NET and other languages or frameworks. StreamSets is very helpful for the old Java platform but it's hard to integrate with the other platforms and frameworks."
"In terms of the product, I don't think there is any room for improvement because it is very good. One small area of improvement that is very much needed is on the knowledge base side. Sometimes, it is not very clear how to set up a certain process or a certain node for a person who's using the platform for the first time."
"If you use JDBC Lookup, for example, it generally takes a long time to process data."
"Sometimes, it is not clear at first how to set up nodes. A site with an explanation of how each node works would be very helpful."
"The software is very good overall. Areas for improvement are the error logging and the version history. I would like to see better, more detailed error logging information."
"The monitoring visualization is not that user-friendly. It should include other features to visualize things, like how many records were streamed from a source to a destination on a particular date."
"The data collector in StreamSets has to be designed properly. For example, a simple database configuration with MySQL DB requires the MySQL Connector to be installed."
 

Pricing and Cost Advice

"Speaking about prices, Oracle and SAP are market leaders. So, the prices are more."
"The product’s on-premise version is expensive for a medium-sized company."
"There is a one time purchase fee plus annual maintenance."
"The price is very reasonable."
"I rate the product price as one or two on a scale of one to ten, where one is a low price, and ten is a high price."
"The price of SAP Data Services is a little high when compared to other solutions. Due to the pricing and capabilities, many clients are switching to other solutions."
"At the entry level, if you're using it only for data integration, it is a little bit cheap. However, for large organizations it can be expensive. There are additional costs apart from the license."
"It's relatively expensive compared to other vendors, but it is competitive."
"Its pricing is pretty much up to the mark. For smaller enterprises, it could be a big price to pay at the initial stage of operations, but the moment you have the Seed B or Seed C funding and you want to scale up your operations and aren't much worried about the funds, at that point in time, you would need a solution that could be scaled."
"We are running the community version right now, which can be used free of charge."
"The overall cost is very flexible so it is not a burden for our organization... However, the cost should be improved. For small and mid-size organizations it might be a challenge."
"There are different versions of the product. One is the corporate license version, and the other one is the open-source or free version. I have been using the corporate license version, but they have recently launched a new open-source version so that anybody can create an account and use it. The licensing cost varies from customer to customer. I don't have a lot of input on that. It is taken care of by PMO, and they seem fine with its pricing model. It is being used enterprise-wide. They seem to have got a good deal for StreamSets."
"StreamSets is an expensive solution."
"StreamSets Data Collector is open source. One can utilize the StreamSets Data Collector, but the Control Hub is the main repository where all the jobs are present. Everything happens in Control Hub."
"The licensing is expensive, and there are other costs involved too. I know from using the software that you have to buy new features whenever there are new updates, which I don't really like. But initially, it was very good."
"There are two editions, Professional and Enterprise, and there is a free trial. We're using the Professional edition and it is competitively priced."
report
Use our free recommendation engine to learn which Data Integration solutions are best for your needs.
816,406 professionals have used our research since 2012.
 

Top Industries

By visitors reading reviews
Manufacturing Company
15%
Computer Software Company
13%
Financial Services Firm
9%
Energy/Utilities Company
8%
Financial Services Firm
17%
Computer Software Company
13%
Manufacturing Company
8%
Insurance Company
6%
 

Company Size

By reviewers
Large Enterprise
Midsize Enterprise
Small Business
 

Questions from the Community

How do you evaluate the ratio of price to quality for SAP Data Services?
I believe the license for the product is fair, even though some competitors offer similar services at lower prices. The difference between the Edge edition and the next one up is significant in pr...
Would you recommend SAP Data Services to complete beginners?
I think after some preparation, a beginner will be able to use SAP Data Services, however, if you are completely unprepared, you may have some issues. I think that in particular you will need a mo...
How has SAP Data Services helped your organization?
SAP Data Services has helped our whole staff understand the data across various sources and systems within the company. Not everyone who works at my organization is an IT expert - most just have av...
What do you like most about StreamSets?
The best thing about StreamSets is its plugins, which are very useful and work well with almost every data source. It's also easy to use, especially if you're comfortable with SQL. You can customiz...
What needs improvement with StreamSets?
We often faced problems, especially with SAP ERP. We struggled because many columns weren't integers or primary keys, which StreamSets couldn't handle. We had to restructure our data tables, which ...
What is your primary use case for StreamSets?
StreamSets is used for data transformation rather than ETL processes. It focuses on transforming data directly from sources without handling the extraction part of the process. The transformed data...
 

Also Known As

SAP BusinessObjects Data Services, SAP BusinessObjects Data Integrator, BusinessObjects Data Integrator
No data available
 

Learn More

Video not available
 

Overview

 

Sample Customers

EMC Corporation, LivePerson, Eldorado, Mozzart, The VELUX Group, AOK Bundesverband, Hilti Group, Nissha Printing Company Ltd., Asian Paints, Aareal Bank Group, Migros Group
Availity, BT Group, Humana, Deluxe, GSK, RingCentral, IBM, Shell, SamTrans, State of Ohio, TalentFulfilled, TechBridge
Find out what your peers are saying about SAP Data Services vs. StreamSets and other solutions. Updated: November 2024.
816,406 professionals have used our research since 2012.