Try our new research platform with insights from 80,000+ expert users

AWS Glue vs Talend Open Studio comparison

 

Comparison Buyer's Guide

Executive SummaryUpdated on Apr 20, 2025

Review summaries and opinions

We asked business professionals to review the solutions they use. Here are some excerpts of what they said:
 

Categories and Ranking

AWS Glue
Average Rating
7.8
Reviews Sentiment
7.0
Number of Reviews
49
Ranking in other categories
Cloud Data Integration (1st)
Talend Open Studio
Average Rating
8.0
Reviews Sentiment
6.8
Number of Reviews
50
Ranking in other categories
Data Integration (5th)
 

Q&A Highlights

NC
Nov 24, 2021
 

Featured Reviews

Saurabh Jaiswal - PeerSpot reviewer
Enables seamless integration and data preparation with robust transformation capabilities
AWS Glue's most valuable features include its transformation capabilities, which provide data quality and shape for processing in ML or AI models. It offers transformation options on canvas or through ETL pipelines, notebooks, and code. Additionally, it supports data preparation, cleaning, and filtering seamlessly. AWS Glue also enhances job scheduling and orchestration capabilities, integrating with AWS Glue Studio for comprehensive data workflow management.
Costin Marzea - PeerSpot reviewer
Allows you to develop your own components and can be used as an OEM
Sometimes, scalability is part of planning. It depends on what you mean by scalability. People talk a lot about it, but scalability is not always about system functionality. Sometimes, it may be planning the job you're doing. If you want to split it into several jobs or servers, you don't actually have to have it built in as a functionality. You can create a job using a loop, which runs and controls several jobs in a loop that may be controlled. Scaling should not always be part of the infrastructure based on whether the engine can scale or not. I think it's your plan or project that should scale and split, and you can define these parameters. These parameters include how many servers you want to run or how many executions you want to do on different parts of the data. It's not always an issue of the engine running. Sometimes, your database should be configured to support partitioning. The product may scale very well without partitioning, but if the basic response is very slow, you didn't solve the problem. You should solve the problems at a higher level, not just at the execution level. They should be solved at the database level and communication level, and you should have firewalls. We are trying to add to the open source the ability to generate code for containers and Kubernetes that exist in the subscription version. Once you do this, Kubernetes will take care of the scaling, so there is no problem.

Quotes from Members

We asked business professionals to review the solutions they use. Here are some excerpts of what they said:
 

Pros

"AWS Glue is very quick to start without cold starts, unlike AWS Lambda."
"AWS Glue is fast and managed by AWS. Hence, you don't have to worry about capacity and the performance of Glue jobs. It has integrations with other data stores of AWS. The product offers metadata management, logging, and ETL processing capabilities. It comes with a powerful feature, Glue Studio, which helps to do queries interactively within the community. It is a managed service and very secure. Another popular and mature service is S3."
"AWS Glue's most valuable features are the data catalog, including crawlers and tables, and Glue Studio, which means you don't have to use custom code."
"The product has a valuable feature for data catalog."
"The most valuable feature of AWS Glue is scalability."
"I also like that you can add custom libraries like JAR files and use them. So, the ability to use a fast processing engine and embed basic jobs easily are significant advantages."
"The best thing about AWS Glue is its scalability and how easy it is to process a large amount of data."
"One aspect that I would like to highlight is the Glue Crawler, which we utilize when working with large datasets to ensure the schema updates seamlessly without requiring end-team knowledge."
"I can connect with different databases such as Oracle Database or SQL Server. It allows you to extract the data from one database to another. I can structure the data by filtering and mapping the fields. It is very user-friendly. You need to know the basics of SQL development or SQL queries, and you can use this tool."
"The initial setup of the product was very easy."
"This product is very easy to use."
"The solution has a good balance between automated items and the ability for a developer to integrate and extend what he needs. Other competing tools do not offer the same grade of flexibility when you need to go beyond what is provided by the tool. Talend, on the other hand, allows you to expand very easily."
"Talend lets you do everything — mapping, workflow, and orchestration — in a single place."
"The most valuable features are the ETL tools."
"The standout feature for me is the user-friendly nature of the components."
"You can use Talend as a stand-alone application without customization to collect data and generate reports over dashboards. It's got great functionality."
 

Cons

"It would be better if it were more user-friendly. The interesting thing we found is that it was a little strange at the beginning. The way Glue works is not very straightforward. After trying different things, for example, we used just the console to create jobs. Then we realized that things were not working as expected. After researching and learning more, we realized that even though the console creates the script for the ETL processes, you need to modify or write your own script in Spark to do everything you want it to do. For example, we are pulling data from our source database and our application database, which is in Aurora. From there, we are doing the ETL to transform the data and write the results into Redshift. But what was surprising is that it's almost like whatever you want to do, you can do it with Glue because you have the option to put together your own script. Even though there are many functionalities and many connections, you have the opportunity to write your own queries to do whatever transformations you need to do. It's a little deceiving that some options are supposed to work in a certain way when you set them up in the console, but then they are not exactly working the right way or not as expected. It would be better if they provided more examples and more documentation on options."
"On occasion, the solution's dashboard reports that a project failed due to runtime but it actually succeeded."
"I am quite technical, yet when it comes to data pipelines, I prefer using GUIs over code-based calls. It might be easier with a different skill set."
"The interface for AWS Glue could improve, they do not put a lot of details. You can write the code, in PySpark or in Scala, which is a big advantage, it is only easy to use for a developer. It will be difficult for new users to enter the cloud environment."
"Only people who can code, either in Java or Python, can use the product freely. Those who don't know Java or Python might find using AWS Glue difficult."
"In the building and deployment aspects, there is room for improvement. The current process is a bit complicated and could benefit from being more user-friendly and simpler, which would help speed up the deployment process."
"The start-up time is really high right now. For instance, when you start up a new job, you have to wait for five or eight minutes before it starts. If the start-up time is reduced to one or two minutes, it will be great. It will be better to have a direct linkage to Redshift in AWS. If we can use data catalogs from Redshift, it will be so easy to create some data catalogs. Currently, we can only use data catalogs from S3."
"In terms of improvement, the performance of AWS Glue could be faster."
"The security features could be improved."
"Inter-version compatibility is a problem, and migration projects between versions are required."
"I rate Talend Open Studio's stability an eight out of ten. Talend has some problems sometimes."
"The user interface could be made simpler."
"It needs better installation configuration for other databases. Although the installation allows you to select another database, this doesn't mean that all connection points in the application point to the database selected. You actually need to do a search through the entire install to locate the configuration settings and change them."
"Talend should improve the log and error handling to better track the errors you find during development. Sometimes it's challenging to see what's causing an issue, and tracking that on Talend is complicated."
"We need more components to be more efficient. We use a lot of components, such as Salesforce, and it's not easy to use. There's are minor bugs and it's not easy to use some of the features."
"I think my biggest problem with the tool is that the errors are very hard to debug."
 

Pricing and Cost Advice

"The pricing is a bit higher than other solutions like Athena and EC2. If the pricing becomes more scaled or flexible, it will be good because you have to pay 44 cents just for one DPU for an hour. If you increase DPUs to 5 or 10, the pricing gets multiplied. There are also some time limits like 0 to 10 minutes or 10 to 20 minutes. If the pricing is according to the minutes, it would be better because you have to limit your job to 10 minutes or 20 minutes."
"It is not expensive. AWS Glue works on the serverless architecture. We get charged for the time the server is up. For our use case, we have to use it once in a day, and it is not expensive for us."
"AWS Glue is quite costly, especially for small organizations."
"Technical support is a paid service, and which subscription you have is dependent on that. You must pay one of them, and it ranges from $15,000 to $25,000 per year."
"If you are using the solution for an enterprise business, it will be expensive."
"I would rate the solution a six or seven on a scale of one to ten, with ten being very expensive. Specifically, I rate its pricing a six out of ten."
"I rate the tool an eight on a scale of one to ten, where one is expensive, and ten is expensive."
"Its price is good. We pay as we go or based on the usage, which is a good thing for us because it is simple to forecast for the tool. It is good in terms of the financial planning of the company, and it is a good way to estimate the cost. It is also simple for our clients. In my opinion, it is one of the best tools in the market for ETL processes because of the fact that you pay as you use, which separates it from other big tools such as PowerCenter, Pentaho Data Integration, and Talend."
"It is an open-source product."
"Talend Open Studio is priced too high."
"There are many versions available and one is open-sourced which is free."
"It does the job well for nothing — without cost. That's the advantage of this product."
"Talend is free and you can download it."
"Talend Open Studio costs about 11,000 a year."
"Pricing and licensing are fairly straightforward. It is reasonably priced and managed."
"I am using the open-source version and it is free."
report
Use our free recommendation engine to learn which Cloud Data Integration solutions are best for your needs.
861,524 professionals have used our research since 2012.
 

Top Industries

By visitors reading reviews
Financial Services Firm
21%
Computer Software Company
13%
Manufacturing Company
8%
Government
6%
Financial Services Firm
17%
Computer Software Company
13%
Manufacturing Company
8%
Government
7%
 

Company Size

By reviewers
Large Enterprise
Midsize Enterprise
Small Business
 

Questions from the Community

How do you select the right cloud ETL tool?
AWS Glue and Azure Data factory for ELT best performance cloud services.
How does Talend Open Studio compare with AWS Glue?
We reviewed AWS Glue before choosing Talend Open Studio. AWS Glue is the managed ETL (extract, transform, and load) from Amazon Web Services. AWS Glue enables AWS users to create and manage jobs in...
What are the most common use cases for AWS Glue?
AWS Glue's main use case is for allowing users to discover, prepare, move, and integrate data from multiple sources. The product lets you use this data for analytics, application development, or ma...
What do you like most about Talend Open Studio?
It is easy to use and covers most of the functions needed. We can use the code without any extra effort. The open source is very good. They have the same commercials with additional connectors. The...
What needs improvement with Talend Open Studio?
There is a need for mastery in some areas.
 

Also Known As

No data available
Open Studio
 

Overview

 

Sample Customers

bp, Cerner, Expedia, Finra, HESS, intuit, Kellog's, Philips, TIME, workday
Almerys, BF&M, Findus
Find out what your peers are saying about AWS Glue vs. Talend Open Studio and other solutions. Updated: July 2025.
861,524 professionals have used our research since 2012.