Try our new research platform with insights from 80,000+ expert users
Senthil Kumar Veerasamy - PeerSpot reviewer
Senior Manager, Analytics at Azendian
Real User
Top 10
A highly scalable solution, but its visual ETL tool is of no use for actual implementation
Pros and Cons
  • "The most valuable feature of AWS Glue is scalability."
  • "The solution's visual ETL tool is of no use for actual implementation."

What is our primary use case?

We are implementing a solution in AWS for one of our customers. It is more of a data analytics solution. We wanted to process data from different sources and put it into a central repository that can be used for any analysis or predictive modeling.

What is most valuable?

The most valuable feature of AWS Glue is scalability.

What needs improvement?

Since AWS Glue is not like an enterprise ETL tool, we need to put quite a lot of effort into customization. The solution has a visual editor, but most ETL transformations cannot be implemented or constructed using that. We always have to do a script. The solution's visual ETL tool is of no use for actual implementation.

For how long have I used the solution?

I have been using AWS Glue for two years.

Buyer's Guide
AWS Glue
September 2025
Learn what your peers think about AWS Glue. Get advice and tips from experienced pros sharing their opinions. Updated: September 2025.
867,676 professionals have used our research since 2012.

What do I think about the stability of the solution?

I rate AWS Glue an eight out of ten for stability.

What do I think about the scalability of the solution?

Most of our clients for AWS Glue are enterprise businesses.

I rate AWS Glue ten out of ten for scalability.

How are customer service and support?

Nobody from AWS technical support has implementation experience.

How would you rate customer service and support?

Neutral

How was the initial setup?

On a scale from one to ten, where one is difficult and ten is easy, I rate the solution's configuration a five and its implementation a two or three out of ten.

What's my experience with pricing, setup cost, and licensing?

The solution's pricing is pay-as-you-go. If you are using the solution for an enterprise business, it will be expensive.

What other advice do I have?

AWS Glue is a cloud-based solution.

Overall, I rate AWS Glue a six out of ten.

Disclosure: My company has a business relationship with this vendor other than being a customer. Partner
PeerSpot user
ShilpaShivapuram - PeerSpot reviewer
Principal Data Architect at Wells Fargo
Real User
Scalable and lightweight option for migrating workloads
Pros and Cons
  • "AWS Glue's best features are scalability and cloud-based features."
  • "AWS Glue would be improved by making it easier to switch from single to multi-cloud."

What is our primary use case?

I primarily use AWS Glue as a lightweight ETL to migrate our existing on-prem workloads to a cloud environment without looking at a lot of migration paths. 

How has it helped my organization?

AWS Glue served the purpose of migrating our on-prem workloads to a cloud environment without involving a heavy load. It ensured that we were able to test every migrated component independently.

What is most valuable?

AWS Glue's best features are scalability and cloud-based features.

What needs improvement?

AWS Glue would be improved by making it easier to switch from single to multi-cloud.

For how long have I used the solution?

I've been using AWS Glue for eighteen months.

What do I think about the scalability of the solution?

I would rate AWS Glue's scalability eight out of ten, as it still has some room to improve.

How was the initial setup?

AWS Glue is straightforward to implement, and you won't see any technical complexity if you're from a development background.

Which other solutions did I evaluate?

We also considered Airflow, but Glue was a better fit for our engineering-heavy workloads because we wanted a serverless option.

What other advice do I have?

I would rate AWS Glue eight out of ten.

Which deployment model are you using for this solution?

Public Cloud
Disclosure: My company has a business relationship with this vendor other than being a customer. partner
PeerSpot user
Buyer's Guide
AWS Glue
September 2025
Learn what your peers think about AWS Glue. Get advice and tips from experienced pros sharing their opinions. Updated: September 2025.
867,676 professionals have used our research since 2012.
Sunil Morya - PeerSpot reviewer
Consultant at a tech vendor with 10,001+ employees
Real User
Easy to set up, useful for batch processing, and is free to try
Pros and Cons
  • "The solution helps organizations gain flexibility in defining the structure of the data."
  • "I haven't looked into Glue in terms of seeking out flaws. I've not come across missing features."

What is our primary use case?

Once you get the data and you don't know about the structure of the data, then Glue is very helpful to estimate the structure, including where is the structure, and it'll identify everything for you. It has one component that is called Glue Crawler that is quite useful for this task. It will go through segments of your data and try to guess their structure. It pops out the structure, and you can modify it according to your convenience.

It is good to basically perform the ETL when your files are stored in the S3 bucket. Glue supports other external sources also. That said, most of the time, we have basically given our proposal to clients if the data is available in S3.

How has it helped my organization?

The solution helps organizations gain flexibility in defining the structure of the data.

You can define and then include the original data structure and decide what the required fills are or what other ones you can omit. You can perform certain processing tasks also, and you can basically apply the multiplying factor; you can do the cleanup, et cetera, on the fly with the Glue.

What is most valuable?

The Glue Crawler can have a set of connectors, so you can utilize those connectors to connect with the external databases, which may be on-premise in different networks or maybe locally on AWS. Basically, you can use the connector to fetch the data. 

Once you have a data schema, you can start streaming or fetching the data in the particular format conversion. For example, suppose you have the text file, and you have Word in place or maybe in SQL, and you can use the connector on the fly to convert the database.

For batch processing, batch genetics, it is helpful for the ETL process.

The setup is easy. 

The solution offers a free trial. 

The solution can scale. 

It's stable.

Users only pay for what they use once they have a license. 

What needs improvement?

I haven't looked into Glue in terms of seeking out flaws. I've not come across missing features. 

For how long have I used the solution?

I've been dealing with the solution for two or three years. I have given a lot of proposals based on customer demand.

What do I think about the stability of the solution?

The solution is quite stable and reliable. There are no bugs or glitches. It doesn't crash or freeze. It is reliable. 

What do I think about the scalability of the solution?

Typically, data analytics individuals use the solution. It's not for an entire organization.

It's a scalable solution. I'd rate it ten out of ten. 

We do have plans to increase usage. We are in the process of moving many things to the cloud, and if they move onto AWS, they'll need Glue.

How are customer service and support?

I've never been in touch with technical support. I can't speak to how helpful or responsive they are. 

How was the initial setup?

The solution is very straightforward to set up and implement. 

I'd rate the ease of deployment at an eight or nine out of ten. However, it all depends on the circumstances. 

The deployment only takes two to three minutes. It's very fast.

Using the console, you have different sections of AWS Glue You can go and specify the input data source and output target data place. Then you need to specify the transformation. If you want to do the filtering, et cetera, you have to specify. You have the blueprint of transformation functions available also, and you can select from there and then just run it. 

What about the implementation team?

I've only just explored the solution. It has not been deployed yet. 

What's my experience with pricing, setup cost, and licensing?

When you are just learning and testing the solution, it is free. I cannot speak to the full cost beyond that, as I am just experimenting with the product. They do offer it to users for a limited time to try for free, however.

My understanding is you only pay for what you use, so pricing would vary based on that. You don't need to maintain a cluster and it is serverless. 

There are no extra costs beyond a standard license fee.

What other advice do I have?

We are using the latest version of the solution. The solution runs on the cloud and is serverless. 

It's a good solution to use when people are not exporting analytics. If you want to perform some ETL on your data and the data is complex, then you should go for Glue. It is easy to set up.

I'd rate the solution ten out of ten. 

Which deployment model are you using for this solution?

Public Cloud
Disclosure: My company has a business relationship with this vendor other than being a customer.
PeerSpot user
Sainagaraju Vaduka - PeerSpot reviewer
Data solution architect at a pharma/biotech company with 5,001-10,000 employees
Real User
Excellent scalability, with valuable features, and profitable return on investment
Pros and Cons
  • "The most valuable features currently are glue studio, jobs, and triggers."
  • "I would like to see stable libraries at the moment they are not there."

What is our primary use case?

We are primarily using it for batch crossing and transformations.

How has it helped my organization?

We have a large set of data and we are doing some transformations and identification. We are cleaning the data and transformations. Then we are putting the data into the destination table. So it is very comfortable.

What is most valuable?

The most valuable features currently are glue studio, jobs, and triggers.

What needs improvement?

I would like to see stable libraries at the moment they are not there.

For how long have I used the solution?

I have been using AWS Glue for the past five years.

What do I think about the stability of the solution?

The stability I would consider to be an extensible Apache Spark.

What do I think about the scalability of the solution?

The scalability is good and we have three hundred projects we are working with.

Which solution did I use previously and why did I switch?

Previously, we used EMR, Informatica, Data Pipeline, and Azure Data Factory.

How was the initial setup?

The initial setup is straightforward.

What about the implementation team?

We did our deployment in-house with the CI/CD integrations like GitHub and deployed the code on Glue. 

What was our ROI?

We are seeing a very good return on our investment.

What's my experience with pricing, setup cost, and licensing?

The current cost is around forty to fifty thousand a month.

What other advice do I have?

I would definitely recommend using AWS Glue for batching procedures. I would rate AWS Glue an eight out of ten.

Which deployment model are you using for this solution?

On-premises
Disclosure: My company does not have a business relationship with this vendor other than being a customer.
PeerSpot user
Jorge Encinas - PeerSpot reviewer
Sr. Data Engineer at a tech services company with 5,001-10,000 employees
MSP
An event-driven, serverless computing platform that is flexible, powerful, and customizable
Pros and Cons
  • "I like that it's flexible, powerful, and allows you to write your own queries and scripts to get the needed transformations."
  • "It would be better if it were more user-friendly. The interesting thing we found is that it was a little strange at the beginning. The way Glue works is not very straightforward. After trying different things, for example, we used just the console to create jobs. Then we realized that things were not working as expected. After researching and learning more, we realized that even though the console creates the script for the ETL processes, you need to modify or write your own script in Spark to do everything you want it to do. For example, we are pulling data from our source database and our application database, which is in Aurora. From there, we are doing the ETL to transform the data and write the results into Redshift. But what was surprising is that it's almost like whatever you want to do, you can do it with Glue because you have the option to put together your own script. Even though there are many functionalities and many connections, you have the opportunity to write your own queries to do whatever transformations you need to do. It's a little deceiving that some options are supposed to work in a certain way when you set them up in the console, but then they are not exactly working the right way or not as expected. It would be better if they provided more examples and more documentation on options."

What is our primary use case?

We used AWS Glue to build our data warehouse. We built prototypes to go all the way all across their warehouse platforms. From AWS Glue to Spreadsheets and then QuickSight, that's how we're building their warehouse.

What is most valuable?

I like that it's flexible, powerful, and allows you to write your own queries and scripts to get the needed transformations.

What needs improvement?

It would be better if it were more user-friendly. The interesting thing we found is that it was a little strange at the beginning. The way Glue works is not very straightforward. After trying different things, for example, we used just the console to create jobs. Then we realized that things were not working as expected. After researching and learning more, we realized that even though the console creates the script for the ETL processes, you need to modify or write your own script in Spark to do everything you want it to do.

For example, we are pulling data from our source database and our application database, which is in Aurora. From there, we are doing the ETL to transform the data and write the results into Redshift. But what was surprising is that it's almost like whatever you want to do, you can do it with Glue because you have the option to put together your own script. Even though there are many functionalities and many connections, you have the opportunity to write your own queries to do whatever transformations you need to do.

It's a little deceiving that some options are supposed to work in a certain way when you set them up in the console, but then they are not exactly working the right way or not as expected. It would be better if they provided more examples and more documentation on options.

For how long have I used the solution?

I have been using AWS Glue since last year.

What other advice do I have?

On a scale from one to ten, I would give AWS Glue a nine.

Disclosure: My company does not have a business relationship with this vendor other than being a customer.
PeerSpot user
Diego Henrique Da SilvaBastos - PeerSpot reviewer
Data Engineer at a tech services company with 501-1,000 employees
Real User
Top 20
Offers good documentation, stability but error handling is difficult
Pros and Cons
  • "It's very good to manage."
  • "AWS Glue's error handling is difficult."

What is our primary use case?

I use AWS Glue for data processing. Some of my colleagues have data for software, and I use AWS Glue to transform and inspect this data.

What is most valuable?

It's very good to manage. It is easy to integrate other products with AWS. 

Glue integrates with other AWS processes and networks. So, it's quite easy to integrate.

I've worked with AI integration but I haven't gone into much depth on that topic.

What needs improvement?

AWS Glue's error handling is difficult. 

The errors in AWS are very hard to handle. The screen is very hard to understand. 

I have to use CloudWatch, but whatever our error was, the new ones, and so on. I would test this with someone. It's not so easy for me, and there are more things related to this.

For how long have I used the solution?

I have been using it for a year and a half. 

What do I think about the stability of the solution?

I would rate the stability a nine out of ten. 

What do I think about the scalability of the solution?

I would rate the scalability a seven out of ten. 

My data is small, so we need to consider more days. We need to deal with what we have, but I understand the documentation. 

Some people find it hard, but I rated it a seven. In my company, TechOps uses AWS with about 1,200 users.

Which solution did I use previously and why did I switch?

I worked with Databricks. In my opinion, Databricks is improving and is easier to use. It's more user-friendly, and I think it's better overall.

How was the initial setup?

I work with a big company, and most of it is already quickly done, like using something that is a blueprint. This configuration stuff is already working in another place. The only thing I have to do with the cloud is the remote configuration.

What's my experience with pricing, setup cost, and licensing?

AWS can be expensive.

What other advice do I have?

Overall, I would rate it a seven out of ten. I would recommend it.

Which deployment model are you using for this solution?

Public Cloud
Disclosure: My company does not have a business relationship with this vendor other than being a customer.
PeerSpot user
Mbaye Babacar Gueye - PeerSpot reviewer
Owner at a tech services company with 51-200 employees
Real User
Top 10
Capable of handling real-time but ETL interface could be more user-friendly
Pros and Cons
  • "I also like that you can add custom libraries like JAR files and use them. So, the ability to use a fast processing engine and embed basic jobs easily are significant advantages."
  • "One area that could be improved is the ETL view. The drag-and-drop interface is not as user-friendly as some other ETL tools."

What is our primary use case?

One common use case is migrating data from one system to another.  So, mostly migrating data and data engineering, getting real-time or near-real-time data using Lambda functions and migrating big data from on-prem to the cloud for historical data before starting a project.

What is most valuable?

If you have the Fund Manager, you could use a fast processing engine, which is crucial for performance. 

I also like that you can add custom libraries like JAR files and use them. So, the ability to use a fast processing engine and embed basic jobs easily are significant advantages.

What needs improvement?

One area that could be improved is the ETL view. The drag-and-drop interface is not as user-friendly as some other ETL tools. 

Additionally, AWS Glue can sometimes be slow, especially when processing large datasets. It was sometimes a bit slow. Also, I couldn't directly use bucketed data. With Elastic Glue, you had to convert your data frames into the correct format before connecting them using the drag-and-drop interface. So that's something I didn't like because the conversion process wasn't straightforward. 

In future releases, I would like to see a feature that could trigger Glue pipeline using an API or something. 

For how long have I used the solution?

I have experience with AWS Glue. I have about one year of experience in a professional setting, but I have also done some personal work with this solution.

How are customer service and support?

Support was good, but I was working with a big client, so that might have influenced the experience. The response time was fast, we heard back from them within a day. 

How would you rate customer service and support?

Positive

How was the initial setup?

I would rate my experience with the initial setup an eight out of ten, where one is difficult and ten is easy. 

The initial setup is not very complex. You can customize parameters like minimum and maximum for your needs. For me, it wasn't complex to deploy the solution. 

What other advice do I have?

I'd rate it around six out of ten compared to other tools like Databricks.  

Which deployment model are you using for this solution?

On-premises
Disclosure: My company does not have a business relationship with this vendor other than being a customer.
PeerSpot user
Shifa Shah - PeerSpot reviewer
Data engineer at nust
Real User
Better than other tools for ETL jobs, but needs better documentation
Pros and Cons
  • "AWS Glue is quite better than other tools, but you have to learn it properly before you start using it."
  • "While working on AWS Glue, I could not find any training material for it."

What is our primary use case?

I constructed a straightforward ETL job using AWS Glue, wherein I had to load a couple of files in the Teradata database.

What is most valuable?

AWS Glue is quite better than other tools, but you have to learn it properly before you start using it.

What needs improvement?

While working on AWS Glue, I could not find any training material for it. Although it's not a problem with the product, the solution could include better documentation.

For how long have I used the solution?

I have been using AWS Glue for about two months.

What do I think about the stability of the solution?

AWS Glue is a stable solution.

How was the initial setup?

AWS Glue's initial setup is quite straightforward.

What other advice do I have?

Overall, I rate AWS Glue a seven out of ten.

Disclosure: My company does not have a business relationship with this vendor other than being a customer.
PeerSpot user
Buyer's Guide
Download our free AWS Glue Report and get advice and tips from experienced pros sharing their opinions.
Updated: September 2025
Product Categories
Cloud Data Integration
Buyer's Guide
Download our free AWS Glue Report and get advice and tips from experienced pros sharing their opinions.