Try our new research platform with insights from 80,000+ expert users

StreamSets vs WhereScape RED comparison

 

Comparison Buyer's Guide

Executive Summary
 

Categories and Ranking

StreamSets
Ranking in Data Integration
9th
Average Rating
8.4
Number of Reviews
24
Ranking in other categories
No ranking in other categories
WhereScape RED
Ranking in Data Integration
49th
Average Rating
8.2
Number of Reviews
15
Ranking in other categories
No ranking in other categories
 

Mindshare comparison

As of November 2024, in the Data Integration category, the mindshare of StreamSets is 1.7%, up from 1.3% compared to the previous year. The mindshare of WhereScape RED is 1.0%, down from 1.1% compared to the previous year. It is calculated based on PeerSpot user engagement data.
Data Integration
 

Featured Reviews

Reyansh Kumar - PeerSpot reviewer
Mar 10, 2023
We no longer need to hire highly skilled data engineers to create and monitor data pipelines
The things I like about StreamSets are its * overall user interface * efficiency * product features, which are all good. Also, the scheduling within the data engineering pipeline is very much appreciated, and it has a wide range of connectors for connecting to any data sources like SQL Server, AWS, Azure, etc. We have used it with Kafka, Hadoop, and Azure Data Factory Datasets. Connecting to these systems with StreamSets is very easy. You just need to configure the data sources, the paths and their configurations, and you are ready to go. It is very efficient and very easy to use for ETL pipelines. It is a GUI-based interface in which you can easily create or design your own data pipelines with just a few clicks. As for moving data into modern analytics systems, we are using it with Microsoft Power BI, AWS, and some on-premises solutions, and it is very easy to get data from StreamSets into them. No hardcore coding or special technical expertise is required. It is also a no-code platform in which you can configure your data sources and data output for easy configuration of your data pipeline. This is a very important aspect because if a tool requires code development, we need to hire software developers to get the task done. By using StreamSets, it can be done with a few clicks.
SM
Jul 9, 2021
Quick to set up, flexible, and stable
The scheduling part I don't like due to the fact that it allows you to schedule as a parent and child and other things, however, the error trackability has to be a little more user-friendly. It's also not user-friendly in the sense that it loads all the jobs and there are not enough filters so that it doesn't need to load everything. If the job fails, you don't get any type of alert or email. It would be ideal if there was some sort of automated alert message. Technical support isn't the best. It would be ideal if we understood how to do it in a card exception regarding exclusion, where the card is captured separately rather than filling the whole process on the data inbound side. Certain workloads like this are organized in such a way where you seem to be doubling the work as opposed to streamlining the process.

Quotes from Members

We asked business professionals to review the solutions they use. Here are some excerpts of what they said:
 

Pros

"The ETL capabilities are very useful for us. We extract and transform data from multiple data sources, into a single, consistent data store, and then we put it in our systems. We typically use it to connect our Apache Kafka with data lakes. That process is smooth and saves us a lot of time in our production systems."
"The Ease of configuration for pipes is amazing. It has a lot of connectors. Mainly, we can do everything with the data in the pipe. I really like the graphical interface too"
"The best thing about StreamSets is its plugins, which are very useful and work well with almost every data source. It's also easy to use, especially if you're comfortable with SQL. You can customize it to do what you need. Many other tools have started to use features similar to those introduced by StreamSets, like automated workflows that are easy to set up."
"It is really easy to set up and the interface is easy to use."
"Also, the intuitive canvas for designing all the streams in the pipeline, along with the simplicity of the entire product are very big pluses for me. The software is very simple and straightforward. That is something that is needed right now."
"Important features include that it comprises lots of functionality to connect data from various sources through connector availability, scheduling pipelines at any time, and integration with third-party and security solutions for encryption."
"The ability to have a good bifurcation rate and fewer mistakes is valuable."
"The most valuable features are the option of integration with a variety of protocols, languages, and origins."
"RED generates comprehensive documentation and regenerates it as quickly as things changes, but it also provides impact documentation."
"WhereScape's deployment package is a fantastic feature. The application allows for selecting specific objects that you would like to deploy from one environment to another rather than deploying the entire database."
"Their support staff are very knowledgeable, courteous, and professional. I feel their support staff go above and beyond to assure their customers are satisfied."
"It has a built-in automatic scheduling environment."
"Quickly develops a data warehouse for our organization with documentation and can track back/forward features."
"RED has provided us the ability to integrate, stage, and transform data from diverse sources into an enterprise-grade data warehouse which meets the needs of my organization, but it also enables us to easily and quickly make ETL or DW changes."
"The tool supports multiple target update methods."
"Support is absolutely excellent, efficient, and timely."
 

Cons

"I would like to see further improvement in the UI. In addition, upgrades are not automatic and they should be automated. Currently, we have to manually upgrade versions."
"One thing that I would like to add is the ability to manually enter data. The way the solution currently works is we don't have the option to manually change the data at any point in time. Being able to do that will allow us to do everything that we want to do with our data. Sometimes, we need to manually manipulate the data to make it more accurate in case our prior bifurcation filters are not good. If we have the option to manually enter the data or make the exact iterations on the data set, that would be a good thing."
"We often faced problems, especially with SAP ERP. We struggled because many columns weren't integers or primary keys, which StreamSets couldn't handle. We had to restructure our data tables, which was painful. Also, pipeline failures were common, and data drifting wasn't addressed, which made things worse. Licensing was another issue we encountered."
"We create pipelines or jobs in StreamSets Control Hub. It is a great feature, but if there is a way to have a folder structure or organize the pipelines and jobs in Control Hub, it would be great. I submitted a ticket for this some time back."
"We've seen a couple of cases where it appears to have a memory leak or a similar problem."
"The data collector in StreamSets has to be designed properly. For example, a simple database configuration with MySQL DB requires the MySQL Connector to be installed."
"Visualization and monitoring need to be improved and refined."
"Using ETL pipelines is a bit complicated and requires some technical aid."
"Project-based searching of data objects in the data warehouse browser needs to be improved."
"The solution can be a little more user-friendly on enterprise-level where people use it."
"Customization could be better."
"Jobs cannot be deleted via the deployment package. When deploying from dev to QA or production, a job has to be retired. The job has to be manually removed from the target environment."
"No support for change data capture or delta detection - that must be custom coded ."
"It could use a tool to diagnose what is missing from the environment for WhereScape to install successfully."
"Improve the object renaming ability (it works, but it could be more automated)."
"The ability to execute SSIS projects within WhereScape would be nice because we have a lot of packages that are too cumbersome to recreate."
 

Pricing and Cost Advice

"It has a CPU core-based licensing, which works for us and is quite good."
"The overall cost for small and mid-size organizations needs to be better."
"The pricing is affordable for any business."
"There are two editions, Professional and Enterprise, and there is a free trial. We're using the Professional edition and it is competitively priced."
"We use the free version. It's great for a public, free release. Our stance is that the paid support model is too expensive to get into. They should honestly reevaluate that."
"There are different versions of the product. One is the corporate license version, and the other one is the open-source or free version. I have been using the corporate license version, but they have recently launched a new open-source version so that anybody can create an account and use it. The licensing cost varies from customer to customer. I don't have a lot of input on that. It is taken care of by PMO, and they seem fine with its pricing model. It is being used enterprise-wide. They seem to have got a good deal for StreamSets."
"I believe the pricing is not equitable."
"Its pricing is pretty much up to the mark. For smaller enterprises, it could be a big price to pay at the initial stage of operations, but the moment you have the Seed B or Seed C funding and you want to scale up your operations and aren't much worried about the funds, at that point in time, you would need a solution that could be scaled."
"Speed to market of a warehouse solution at a relatively inexpensive price point."
"Our company purchased a corporate unlimited license."
"Factor in the price of specialized consulting who know this product. They're hard to find and expensive."
"ROI is at least 10 times."
report
Use our free recommendation engine to learn which Data Integration solutions are best for your needs.
814,649 professionals have used our research since 2012.
 

Top Industries

By visitors reading reviews
Financial Services Firm
17%
Computer Software Company
13%
Manufacturing Company
9%
Government
6%
Financial Services Firm
17%
Government
10%
Insurance Company
9%
Manufacturing Company
9%
 

Company Size

By reviewers
Large Enterprise
Midsize Enterprise
Small Business
 

Questions from the Community

What do you like most about StreamSets?
The best thing about StreamSets is its plugins, which are very useful and work well with almost every data source. It's also easy to use, especially if you're comfortable with SQL. You can customiz...
What needs improvement with StreamSets?
We often faced problems, especially with SAP ERP. We struggled because many columns weren't integers or primary keys, which StreamSets couldn't handle. We had to restructure our data tables, which ...
What is your primary use case for StreamSets?
StreamSets is used for data transformation rather than ETL processes. It focuses on transforming data directly from sources without handling the extraction part of the process. The transformed data...
Ask a question
Earn 20 points
 

Comparisons

 

Learn More

Video not available
Video not available
 

Overview

 

Sample Customers

Availity, BT Group, Humana, Deluxe, GSK, RingCentral, IBM, Shell, SamTrans, State of Ohio, TalentFulfilled, TechBridge
British American Tobacco, Cornell University, Allianz Benelux, Finnair, Solarwinds and many more.
Find out what your peers are saying about StreamSets vs. WhereScape RED and other solutions. Updated: October 2024.
814,649 professionals have used our research since 2012.