Try our new research platform with insights from 80,000+ expert users

AWS Glue vs StreamSets comparison

 

Comparison Buyer's Guide

Executive Summary

Review summaries and opinions

We asked business professionals to review the solutions they use. Here are some excerpts of what they said:
 

ROI

Sentiment score
5.9
Organizations find AWS Glue efficient and cost-effective despite overhead costs, though some consider alternatives due to budget constraints.
Sentiment score
8.1
StreamSets speeds up data processing, boosts efficiency and revenue, simplifies tasks, enhances security, and reduces costs significantly.
I advocate using Glue in such cases.
Data Architect at a financial services firm with 10,001+ employees
 

Customer Service

Sentiment score
6.5
AWS Glue customer service is praised for responsiveness and effectiveness, with mixed feedback on support speed, costs, and consistency.
Sentiment score
6.7
StreamSets support is responsive and knowledgeable, offering effective solutions, though response times and technical handling could improve.
AWS's documentation is reliable, and careful reference often resolves missed upgrade details.
Principal Consultant at a retailer with 1,001-5,000 employees
For complex Glue-related problems such as job failures or permission issues, their documentation is good, but having direct access to support helps cut down troubleshooting time significantly.
application security engineer at Hyperspace IT India
IBM technical support sometimes transfers tickets between different teams due to shift changes, which can be frustrating.
Enterprise Solutions Architect at a energy/utilities company with 1,001-5,000 employees
 

Scalability Issues

Sentiment score
7.8
AWS Glue is highly scalable and serverless, praised for easy resource management, but needs better parallel computation.
Sentiment score
7.6
StreamSets is scalable and flexible, favored for cloud use but could improve auto-scaling for large data migrations.
It can easily handle data from one terabyte to 100 terabytes or more, scaling nicely with larger datasets.
Python AWS & AI Expert at a tech consulting company
For jobs requiring multiple RAM usage, we increase the number of workers accordingly.
Principal Consultant at a retailer with 1,001-5,000 employees
 

Stability Issues

Sentiment score
7.9
AWS Glue is stable and reliable with minor issues, scaling well, and efficient due to serverless architecture and tool integration.
Sentiment score
7.8
StreamSets is praised for stability and reliability, despite minor memory issues, with high user ratings and market competitiveness.
As a managed service, it reduces management burdens.
Python AWS & AI Expert at a tech consulting company
 

Room For Improvement

AWS Glue faces challenges with startup times, interface complexity, language limitations, cost, performance, integration, and multi-cloud compatibility.
StreamSets struggles with integration, real-time processing, clarity in UI, memory issues, security, documentation, and cloud storage performance.
Learning the latest functionalities is crucial, and while challenging, it is a vital part of staying current and ensuring an efficient ETL process.
Principal Consultant at a retailer with 1,001-5,000 employees
With AWS, I gather data from multiple sources, clean it up, normalize it, de-duplicate it, and make it presentable.
Data Architect at a financial services firm with 10,001+ employees
A more user-friendly and simpler process would help speed up the deployment process.
Python AWS & AI Expert at a tech consulting company
It would be beneficial if StreamSets addressed any potential memory leak issues to prevent unnecessary upgrades.
Enterprise Solutions Architect at a energy/utilities company with 1,001-5,000 employees
 

Setup Cost

AWS Glue offers flexible, efficient serverless architecture but can be costly and unpredictable, especially for smaller organizations.
StreamSets provides flexible pricing models, with varied user satisfaction, favoring larger enterprises over smaller companies due to cost.
AWS charges based on runtime, which can be quite pricey.
Data Architect at a financial services firm with 10,001+ employees
The smallest cost for a project is around €700, while the largest can reach up to €7,000 based on the scale of the usage.
Python AWS & AI Expert at a tech consulting company
Regarding AWS Glue's pricing, it is not more expensive; rather, it is very reasonable, but it is not cheap.
application security engineer at Hyperspace IT India
 

Valuable Features

AWS Glue excels with its easy interface, scalable ETL processing, seamless AWS integration, affordability, and serverless architecture.
StreamSets offers intuitive interface, extensive connectors, and features accessible to non-technical users for seamless data integration and manipulation.
AWS Glue is very efficient and integrates well with the AWS ecosystem.
application security engineer at Hyperspace IT India
For ETL, I feel the performance is excellent. If I create jobs in a standard way, the performance is great, and maintenance is also seamless.
Principal Consultant at a retailer with 1,001-5,000 employees
AWS Glue also enhances job scheduling and orchestration capabilities, integrating with AWS Glue Studio for comprehensive data workflow management.
Python AWS & AI Expert at a tech consulting company
It allows a hybrid installation approach, rather than being completely cloud-based or on-premises.
Enterprise Solutions Architect at a energy/utilities company with 1,001-5,000 employees
 

Categories and Ranking

AWS Glue
Average Rating
7.8
Reviews Sentiment
6.9
Number of Reviews
50
Ranking in other categories
Cloud Data Integration (1st)
StreamSets
Average Rating
8.4
Reviews Sentiment
7.0
Number of Reviews
21
Ranking in other categories
Data Integration (22nd)
 

Featured Reviews

SC
application security engineer at Hyperspace IT India
Efficient data integration reduces operational time and enhances metadata management
For the initial setup with AWS Glue, I find it easy to set up the data catalog and create Glue jobs using the visual editor or the visual code. Setting permission sets via IAM rules can be a bit tricky at the start, but we ensure Glue has access to AWS S3, Redshift, and other services. Once the role is configured, it runs smoothly. For advanced configurations, connecting to VPCs and setting up connections with JDBC sources takes more time compared to my cloud experience, but overall, for someone with cloud and ETL experience, the setup is manageable and well done.
SS
Enterprise Solutions Architect at a energy/utilities company with 1,001-5,000 employees
Enables effective batch loading with visual interface and enterprise support
One issue I observed with StreamSets is that the memory runs out quickly when processing large volumes of data. Because of this memory issue, we have to upgrade our EC2 boxes in the Amazon AWS infrastructure. I had to switch to a new EC2 box, even though the processor was not fully utilized. It would be beneficial if StreamSets addressed any potential memory leak issues to prevent unnecessary upgrades. Additionally, it would be a great enhancement if StreamSets could produce a lineage graph to visualize how the data has passed through the system.
report
Use our free recommendation engine to learn which Cloud Data Integration solutions are best for your needs.
879,889 professionals have used our research since 2012.
 

Top Industries

By visitors reading reviews
Financial Services Firm
21%
Computer Software Company
12%
Manufacturing Company
8%
Government
6%
Financial Services Firm
9%
Computer Software Company
8%
Insurance Company
8%
Manufacturing Company
7%
 

Company Size

By reviewers
Large Enterprise
Midsize Enterprise
Small Business
By reviewers
Company SizeCount
Small Business11
Midsize Enterprise6
Large Enterprise32
By reviewers
Company SizeCount
Small Business9
Midsize Enterprise2
Large Enterprise11
 

Questions from the Community

How do you select the right cloud ETL tool?
AWS Glue and Azure Data factory for ELT best performance cloud services.
How does Talend Open Studio compare with AWS Glue?
We reviewed AWS Glue before choosing Talend Open Studio. AWS Glue is the managed ETL (extract, transform, and load) from Amazon Web Services. AWS Glue enables AWS users to create and manage jobs in...
What are the most common use cases for AWS Glue?
AWS Glue's main use case is for allowing users to discover, prepare, move, and integrate data from multiple sources. The product lets you use this data for analytics, application development, or ma...
What do you like most about StreamSets?
The best thing about StreamSets is its plugins, which are very useful and work well with almost every data source. It's also easy to use, especially if you're comfortable with SQL. You can customiz...
What needs improvement with StreamSets?
One issue I observed with StreamSets is that the memory runs out quickly when processing large volumes of data. Because of this memory issue, we have to upgrade our EC2 boxes in the Amazon AWS infr...
What is your primary use case for StreamSets?
We are using StreamSets for batch loading.
 

Overview

 

Sample Customers

bp, Cerner, Expedia, Finra, HESS, intuit, Kellog's, Philips, TIME, workday
Availity, BT Group, Humana, Deluxe, GSK, RingCentral, IBM, Shell, SamTrans, State of Ohio, TalentFulfilled, TechBridge
Find out what your peers are saying about AWS Glue vs. StreamSets and other solutions. Updated: December 2025.
879,889 professionals have used our research since 2012.