Try our new research platform with insights from 80,000+ expert users
reviewer1994781 - PeerSpot reviewer
Consultant - Business Operations at a computer software company with 10,001+ employees
Real User
Transformations are valuable for modifying complex data but rely too heavily on code
Pros and Cons
  • "Transformations are valuable because you can modify or override complex data logic from an open source or Spark to solve issues."
  • "The setup and installation is a bit complex without advanced knowledge or training."

What is our primary use case?

Our company uses the solution for ETL data movement for our customers such as on-premises to cloud, cloud to cloud, and cloud to Snowflake. We also data catalog and schedule ETL jobs. We are able to monitor all jobs through AWS services. 

What is most valuable?

Transformations are valuable because you can modify or override complex data logic from an open source or Spark to solve issues. 

For example, it is easy to solve issues where volume is good but performance is degrading because you can split jobs into small chunks to more quickly handle data loads. 

What needs improvement?

The setup and installation is a bit complex without advanced knowledge or training. It would be easier for an AWS expert or someone in DevOps.

Transformations need improvements to be more user friendly and rely less on coding like Matillion. 

For how long have I used the solution?

I have been using the solution for three years. 

Buyer's Guide
AWS Glue
October 2025
Learn what your peers think about AWS Glue. Get advice and tips from experienced pros sharing their opinions. Updated: October 2025.
872,008 professionals have used our research since 2012.

What do I think about the stability of the solution?

The solution's stability is decent and rates higher than other products. It works well with Snowflake, Azure, GCP, and AWS-supported products. 

A hybrid situation may cause delays in performance. 

What do I think about the scalability of the solution?

The solution is scalable. 

How are customer service and support?

One of our customers used technical support and found them to be helpful. 

How was the initial setup?

The setup and installation is a bit complex. Training or advance knowledge is required. Someone with AWS experience or a DevOps perspective would have fewer issues. 

What about the implementation team?

We install the solution for customers and the timeline depends on the job. 

A complete project will take a few days to a week for deployment. The number of jobs and components determines how many technicians are required for setup, installation, and deployment. Technician requirements can range from two to fifteen. 

Deployment will take a couple of hours for a few announcement jobs that deploy from the CI/CD pipeline.

Which other solutions did I evaluate?

The solution is my second choice because I prefer Snowflake's capabilities. 

Which deployment model are you using for this solution?

Private Cloud

If public cloud, private cloud, or hybrid cloud, which cloud provider do you use?

Amazon Web Services (AWS)
Disclosure: My company has a business relationship with this vendor other than being a customer. Partner
PeerSpot user
Cloud Data Engineer at jems groupe
Real User
Great for serverless data transformations but more resources are needed for running Spark jobs
Pros and Cons
  • "The solution is serverless so it allows us to transform data while optimizing the cost and performance of Spark jobs."
  • "The solution should offer features for streaming data in addition to batching data."

What is our primary use case?

Our company is creating data warehousing in the cloud. Our team includes four data engineers, two data ops, and two data administrators. 

We use S3 to data lake or prepare data from two databases that are contained in MySQL and Oracle. For the migration, we use DMS.

Then, we use the solution to perform data transformation. For Oracle, we use Data Catalog and Data Crawler to create our catalog. Dev Endpoint is used to develop complex data transformations. We then migrate to Studio Notebook where we develop and schedule a complex Spark job. 

Finally, we load the transformed data to Redshift so our data analyst team can visualize it with QuickSight. 

What is most valuable?

The solution is serverless so it allows us to transform data while optimizing the cost and performance of Spark jobs. 

The solution works with many data sources and services in the cloud. 

Glue Watch monitors our Spark jobs and immediately alerts us to issues so we are able to resolve them quickly. 

What needs improvement?

The solution does not work with Spark DataFrame. We can use the solution's DynamicFrame for this function but transformations are expensive. 

Not enough resources or services are available to run managed Spark jobs within the solution. We have reached out to Amazon many times regarding this issue. 

The solution should offer features for streaming data in addition to batching data. We can use other products such as Scala or Python but prefer the features be available in the solution. 

For how long have I used the solution?

I have been using the solution for one year. 

What do I think about the stability of the solution?

The solution is stable with no issues. 

What do I think about the scalability of the solution?

The solution is scalable. 

How are customer service and support?

Technical support has been good and has handled any issues. 

I rate technical support an eight out of ten. 

How would you rate customer service and support?

Positive

Which solution did I use previously and why did I switch?

The solution is the best service in its category at this time. Based on project budget and use case, we use either the solution or EMR.

EMR is used for projects that require the latest version of Spark. 

We use the solution for any other versions of Spark. 

How was the initial setup?

I was not involved in the initial setup.

What's my experience with pricing, setup cost, and licensing?

The solution's pricing is based on DPUs so it is a good idea to optimize use or it can get expensive. 

I use Studio Notebook because it is less expensive and jobs can be deleted or clustered to run in one day. 

I rate pricing a four out of ten. 

Which other solutions did I evaluate?

Our company only uses Amazon cloud because other cloud environments do not offer the same features. 

The solution's Studio uses GCP which is easier than coding in Python Spark or Scala Spark. 

Azure Data Factory's features do not compare to what the solution can do in the cloud. 

What other advice do I have?

The solution is good for teams who do not want to worry about DevOps or who want to optimize cost by using the cloud. 

I rate the solution a seven out of ten. 

Which deployment model are you using for this solution?

Public Cloud

If public cloud, private cloud, or hybrid cloud, which cloud provider do you use?

Amazon Web Services (AWS)
Disclosure: My company has a business relationship with this vendor other than being a customer. Partner
PeerSpot user
Buyer's Guide
AWS Glue
October 2025
Learn what your peers think about AWS Glue. Get advice and tips from experienced pros sharing their opinions. Updated: October 2025.
872,008 professionals have used our research since 2012.
Murilo Hallgren - PeerSpot reviewer
Data Engineer at a consultancy with self employed
Real User
Easy to use, simple configurations, and good documentation
Pros and Cons
  • "The most valuable feature of AWS Glue is its ease of use and good documentation. Additionally, we can do all the transformations that we need."
  • "The price of the solution could improve."

What is our primary use case?

We are using AWS Glue for transforming firewalls synced to the Data Lake in the bronze zone. The ATL uses the solution to transform fields in the silver layer and later we will produce the gold zone. We are using the Delta Lake Architecture.

What is most valuable?

The most valuable feature of AWS Glue is its ease of use and good documentation. Additionally, we can do all the transformations that we need.

What needs improvement?

The price of the solution could improve.

For how long have I used the solution?

I have been using AWS Glue for approximately one month.

What do I think about the stability of the solution?

The stability of AWS Glue is good.

What do I think about the scalability of the solution?

AWS Glue is highly scalable.

There are dozens of customers using this solution.

How are customer service and support?

I have not used the support from AWS Glue but I know their support is good.

Which solution did I use previously and why did I switch?

I have previously used Azure and Spark for testing.

How was the initial setup?

The initial setup of AWS Glue is simple. In other solutions, such as Spark, the configuration would take a lot longer.

What about the implementation team?

I did the deployment of AWS Glue myself with the AMS console. I am a data engineer.

What's my experience with pricing, setup cost, and licensing?

The overall cost of AWS Glue could be better. It cost approximately $1,000 a month. There is paid support available from AWS Glue.

If the cost of AWS Glue was 50 percent less then we would not move to another solution.

What other advice do I have?

I am moving to the EMR serverless or GCP solution.

I rate AWS Glue a nine out of ten.

Disclosure: My company does not have a business relationship with this vendor other than being a customer.
PeerSpot user
Liana Iuhas - PeerSpot reviewer
CEO at Quark Technologies SRL
Real User
Highly scalable, reliable, and beneficial pay-as-you-go pricing model
Pros and Cons
  • "AWS Glue is a good solution for developers, they have the ability to write code in different languages and other software."
  • "The interface for AWS Glue could improve, they do not put a lot of details. You can write the code, in PySpark or in Scala, which is a big advantage, it is only easy to use for a developer. It will be difficult for new users to enter the cloud environment."

What is our primary use case?

My colleagues work with Spark, PySpark, and Scala as programming languages for writing complex aggregations. They have a repository in order to have a general view of all the sources and jobs on the platform and AWS Glue is very helpful.

What is most valuable?

AWS Glue is a good solution for developers, they have the ability to write code in different languages and other software.

What needs improvement?

The interface for AWS Glue could improve, they do not put a lot of details. You can write the code, in PySpark or in Scala, which is a big advantage, it is only easy to use for a developer. It will be difficult for new users to enter the cloud environment.

If business users want to run their own graphs they will not have the opportunity to use such features, such as running code inside AWS Glue in Spark, which will be complex for them.

For how long have I used the solution?

 I have been using AWS Glue for approximately four years.

What do I think about the stability of the solution?

AWS Glue is a highly stable solution. We didn't have bugs in production. 

The solution works well with Spark, which is a good framework for large volumes of data. It operates very well.

I rate the stability of AWS Glue a ten out of ten.

What do I think about the scalability of the solution?

The scalability of AWS Glue is great. It was used for enterprise customers. We worked a lot with AWS Glue for International companies.

We have approximately 10 people using AWS Glue in my company.

How are customer service and support?

I have to use the support from AWS Glue. The response time could improve.

I rate the support from AWS Glue a nine out of ten.

How would you rate customer service and support?

Positive

How was the initial setup?

The initial setup of AWS Glue is very simple.

What's my experience with pricing, setup cost, and licensing?

AWS Glue uses a pay-as-you-go approach which is helpful. The price of the overall solution is low and is a great advantage.

Which other solutions did I evaluate?

If I can compare AWS Glue to other solutions, it has the advantage of the cloud, which assures availability and scalability, and the pay-as-you-go is beneficial. This is why many companies are moving from their traditional ETL tools to the cloud because the costs will be reduced dramatically.

What other advice do I have?

I would recommend this solution to others.

I rate AWS Glue a nine out of ten.

Which deployment model are you using for this solution?

Public Cloud

If public cloud, private cloud, or hybrid cloud, which cloud provider do you use?

Amazon Web Services (AWS)
Disclosure: My company has a business relationship with this vendor other than being a customer. Partner
PeerSpot user
Ankit  Shukla - PeerSpot reviewer
Data Engineer at YASH Technologies
Real User
Cheap, reliable, and able to expand as needed
Pros and Cons
  • "The solution is stable and reliable."
  • "The monitoring is not that good."

What is most valuable?

The best feature is the price point. It's pretty cheap as compared to other tools like Informatica, et cetera. That's why major companies are moving to the cloud and using Glue. At least, that's what I found.

The solution is stable and reliable.

You can scale the product if you need to. 

What needs improvement?

The monitoring is not that good. We'd like to see job progress be more clear. Right now, how we can view that is not that good. The is that mostly it is Python or Scala code based. The UX is lacking.

There is a bit of a learning curve, particularly during the setup process. 

More connectors should be included.

For how long have I used the solution?

I've been using the solution for three years. 

What do I think about the stability of the solution?

The solution is very reliable. It's stable. There are no bugs or glitches It works just fine. 

What do I think about the scalability of the solution?

The solution can scale very well. It's not a problem.

How are customer service and support?

Technical support is okay. We tend to go to the partner if we have issues, and they'll go to WS if they need to.

Which solution did I use previously and why did I switch?

I'm also familiar with Informatica. However, Glue is less expensive. 

How was the initial setup?

In terms of the initial setup, the learning part was a little bit stiff. After that, it is okay. We didn't have any issues once we understood the process. 

What about the implementation team?

We didn't require any outside assistance such as integrators or consultants. We were able to handle it ourselves. 

What's my experience with pricing, setup cost, and licensing?

The price is very good. It's enticing people to move to the cloud. 

That said, I do not have exact information on pricing. 

What other advice do I have?

I'm an AWS engineer. My company is a gold partner.

I'd rate the product eight out of ten. So far, it's quite good. I don't have any complaints.  

Which deployment model are you using for this solution?

Hybrid Cloud

If public cloud, private cloud, or hybrid cloud, which cloud provider do you use?

Amazon Web Services (AWS)
Disclosure: My company has a business relationship with this vendor other than being a customer. Partner
PeerSpot user
Suraj Sachdeva - PeerSpot reviewer
Data Engineer | Developer at Sakshath Technologies
Real User
Data integration solution that hosts metadata before the roll out of actual data
Pros and Cons
  • "The key role for Glue is that it hosts our metadata before rolling out our actual data. This is the major advantage of using this solution and our clients client have been very satisfied with it."
  • "The technical support for this solution could be improved. In future, we would like to connect more services like Athena or Kinesis to help control more loads of data."

What is our primary use case?

The key role of Glue is that it hosts our metadata before rolling out our actual data. This is the major advantage of using this solution and our clients client have been very satisfied with it.

What is most valuable?

The most valuable aspect of this solution is its automation and ability to sync data from the source to the solution phase. 

What needs improvement?

The technical support for this solution could be improved. In future, we would like to connect more services like Athena or Kinesis to help control more loads of data.

For how long have I used the solution?

I have been using this solution for three years. 

What do I think about the stability of the solution?

This is a stable solution. We have isolated the environment using containerization so that if anything goes wrong, we have higher levels of scalability and availability. To achieve this, we have configured multiple servers for testing, UAT and development.

What do I think about the scalability of the solution?

This is a scalable solution which is supported in our organization by Docker and Kubernetes. We have 2,000 users.

How are customer service and support?

We used a vendor with an internal IT team who provided us with architecture so that we could leverage those services and reach a solution. They have 50 people in the IT team, who continuously help us and monitor the things that we are working on.

How would you rate customer service and support?

Positive

How was the initial setup?

The initial setup was straightforward and took approximately one month. For deployment, we worked in two teams. One person handled all the scripting which we are developing for automation. Two other members handled the database and servers.

What's my experience with pricing, setup cost, and licensing?

This solution is affordable and there is an option to pay for the solution based on your usage. 

What other advice do I have?

I would rate this solution a seven out of ten. 

Which deployment model are you using for this solution?

Hybrid Cloud

If public cloud, private cloud, or hybrid cloud, which cloud provider do you use?

Amazon Web Services (AWS)
Disclosure: My company does not have a business relationship with this vendor other than being a customer.
PeerSpot user
reviewer1084386 - PeerSpot reviewer
ECM CONSULTANT/ARCHITECT/SOFTWARE DEVELOPER, DELUXE MN at a tech services company with 5,001-10,000 employees
Real User
Easy to perform ETL on multiple data sources, and easy to use after you learn it
Pros and Cons
  • "Glue is a NoSQL-based data ETL tool that has some advantages over IIS and ISAs."
  • "There is a learning curve to this tool."

What is our primary use case?

Glue is a NoSQL-based data ETL tool that has some advantages over IIS and ISAs. It is tailored and customized to use with SQL Server, which works very well in that platform.

If you want to use other data sources, the NoSQL concept makes it very easy, because missing data can be inserted as a new column or with null values.

That is not the case with many other tools. If you have on-premises tools, such as IIS, they don't manage missing data well.

What is most valuable?

If you want extremely high-performance functionality, you have to use both AWS Glue or Data Lake to store it in some temporary table. First, you will have to do some cleaning of the data, then if you need performance and speed, you have to use IIS with an IBM tool. 

You have to use the right tool in the right places. For example, if you're using Oracle, you have got to use the Oracle tools. If you are using SQL, you have to use the SQL tools. There is no other tool that provides the performance.

It's context-based and project-based. In the projects that I have used, it has worked well.

What needs improvement?

There is a learning curve to this tool.

For how long have I used the solution?

I have been working with AWS Glue for four years.

Everything runs on AWS, even if it belongs to a third party. For example, if you have a Netflix subscription, it runs on AWS. We have other products or vendor subscriptions that run on AWS.

What do I think about the stability of the solution?

Undoubtedly, the cloud is built to handle failure. If you have your devices, and your resources configured correctly, you won't have any issues. I haven't seen a problem.

How are customer service and support?

You have to pay for their technical support, and depending on which level of subscription, you will receive a call within an hour; otherwise, you will have to wait for days.

Which solution did I use previously and why did I switch?

We also use Azure's Data Lake, and I worked with Tipco in the past, though it's been a few years since we used it.

You should select the best tool for the job or the projects that are currently being worked on. Tipco was heavily used in the previous project we worked on.

How was the initial setup?

It takes some time to learn, but once you get the hang of it, you'll be fine. It's like any other IT tool, where nobody is an expert or isn't an expert, it is just the way you are exposed to a tool. 

You've chosen the right tool if you understand how the data works and what it needs to do. It's like going to Home Depot to get the right tool. You can purchase a set of tools, and it will work for you, but you will still need to purchase something else.

It's one of those tools in which someone must be an expert. After that, all tools and platforms become secondary.

What's my experience with pricing, setup cost, and licensing?

With AWS Glue, you pay more, but if you want to process the data, with speed and performance, you need the correct EC2 instances.

There is a price to pay. It doesn't come free.

Technical support is a paid service, and which subscription you have is dependent on that. You must pay one of them, and it ranges from $15,000 to $25,000 per year. 

You sign up for a level of service, and it does not come for free. As previously stated, everything is based on performance, ELAs.

It was very expensive, at that time. If a company wants to pay the money, it makes my job easier. However, if the company or enterprise does not have the funds to pay for it, then it is a hassle.

What other advice do I have?

In that environment, there is a lot going on. There are some things that you can get for free, and there are some add-ons that you can develop or use that have been tested. It's all about convenience and service. You will get what you pay for if you pay for what you want.

I'm not a fan of any tools; it all depends on the organization I work for, where their data is, what they want to do with it, how quickly they want to get there, and what their budget is, and you work around that. For me, I would not choose one over the other, unless I know the details of the project.

I would rate AWS Glue a nine out of ten.

Which deployment model are you using for this solution?

Public Cloud
Disclosure: My company does not have a business relationship with this vendor other than being a customer.
PeerSpot user
reviewer1526064 - PeerSpot reviewer
Associate Consultant at a tech vendor with 10,001+ employees
Real User
An extremely user-friendly and stable tool requiring an easy initial setup
Pros and Cons
  • "The solution is highly user-friendly, and its features are easy to use. The new addition of AWS Glue Data Catalog is also very beneficial, making the tool even more helpful for its users."
  • "The solution could be cheaper. The price of the solution is an area that needs improvement."

What is our primary use case?

Currently, we are utilizing AWS Glue for various ETL workloads, specifically in the life sciences domain. Our primary objective is to acquire data from various sources. Then, we store it in Redshift. This is where the complete use case of AWS Glue comes into the picture.

What is most valuable?

The solution is highly user-friendly, and its features are easy to use. The new addition of AWS Glue Data Catalog is also very beneficial, making the tool even more helpful for its users.

What needs improvement?

The solution could be cheaper. The price of the solution is an area that needs improvement.

For how long have I used the solution?

I have been using AWS Glue in my organization for a year. I am an end-user and a customer of the solution.

What do I think about the stability of the solution?

It is a stable solution. We have not faced any issues in the past year, so it's pretty stable. Stability-wise, I rate it a ten out of ten.

What do I think about the scalability of the solution?

The solution has proven to be scalable, and from my experience in the data engineering domain, I rate it an eight out of ten. It is worth noting that I may not be the most qualified person to provide a rating since I mostly manage and work on data-related tasks. Currently, approximately 20-25 people in our company use the solution.

How are customer service and support?

I had no experience with the technical support team of AWS Glue.

Which solution did I use previously and why did I switch?

Previously, I used Azure Data Factory. But I did not find it really helpful. And it was a bit complex. It was not that user-friendly. And I am much more comfortable with the AWS services as compared to Azure services.

How was the initial setup?

The initial setup of the solution is straightforward, and I find it easy to implement. I rate the setup process a nine on a scale of one to ten, where ten is the easiest. As for the deployment process, we usually request our platform team to handle it, and they are quite efficient in deploying and managing the infrastructure. Although I am not directly involved in the deployment process, my understanding is that it can be completed in just a few hours with the help of two to three team members. Our platform team consists of data engineers, architects, and platform engineers who cater to the needs of various projects and products within the AWS ecosystem. Fortunately, the solution does not require any maintenance.

What's my experience with pricing, setup cost, and licensing?

Price-wise, the solution is adequate, and we have no issues with it. We believe that the cost is justified given the number of users and the features it provides. Overall, it can be considered an average-priced tool. I would rate the solution a six or seven on a scale of one to ten, with ten being very expensive. Specifically, I rate its pricing a six out of ten.

Which other solutions did I evaluate?

Before choosing AWS Glue, I evaluated Azure Data Factory.

What other advice do I have?

I would tell those planning to use AWS Glue to try it. I rate the overall solution a ten out of ten.

Which deployment model are you using for this solution?

Public Cloud
Disclosure: My company does not have a business relationship with this vendor other than being a customer.
PeerSpot user
Buyer's Guide
Download our free AWS Glue Report and get advice and tips from experienced pros sharing their opinions.
Updated: October 2025
Product Categories
Cloud Data Integration
Buyer's Guide
Download our free AWS Glue Report and get advice and tips from experienced pros sharing their opinions.