Try our new research platform with insights from 80,000+ expert users
Cloud Data Engineer at jems groupe
Real User
Great for serverless data transformations but more resources are needed for running Spark jobs
Pros and Cons
  • "The solution is serverless so it allows us to transform data while optimizing the cost and performance of Spark jobs."
  • "The solution should offer features for streaming data in addition to batching data."

What is our primary use case?

Our company is creating data warehousing in the cloud. Our team includes four data engineers, two data ops, and two data administrators. 

We use S3 to data lake or prepare data from two databases that are contained in MySQL and Oracle. For the migration, we use DMS.

Then, we use the solution to perform data transformation. For Oracle, we use Data Catalog and Data Crawler to create our catalog. Dev Endpoint is used to develop complex data transformations. We then migrate to Studio Notebook where we develop and schedule a complex Spark job. 

Finally, we load the transformed data to Redshift so our data analyst team can visualize it with QuickSight. 

What is most valuable?

The solution is serverless so it allows us to transform data while optimizing the cost and performance of Spark jobs. 

The solution works with many data sources and services in the cloud. 

Glue Watch monitors our Spark jobs and immediately alerts us to issues so we are able to resolve them quickly. 

What needs improvement?

The solution does not work with Spark DataFrame. We can use the solution's DynamicFrame for this function but transformations are expensive. 

Not enough resources or services are available to run managed Spark jobs within the solution. We have reached out to Amazon many times regarding this issue. 

The solution should offer features for streaming data in addition to batching data. We can use other products such as Scala or Python but prefer the features be available in the solution. 

For how long have I used the solution?

I have been using the solution for one year. 

Buyer's Guide
AWS Glue
March 2025
Learn what your peers think about AWS Glue. Get advice and tips from experienced pros sharing their opinions. Updated: March 2025.
842,651 professionals have used our research since 2012.

What do I think about the stability of the solution?

The solution is stable with no issues. 

What do I think about the scalability of the solution?

The solution is scalable. 

How are customer service and support?

Technical support has been good and has handled any issues. 

I rate technical support an eight out of ten. 

How would you rate customer service and support?

Positive

Which solution did I use previously and why did I switch?

The solution is the best service in its category at this time. Based on project budget and use case, we use either the solution or EMR.

EMR is used for projects that require the latest version of Spark. 

We use the solution for any other versions of Spark. 

How was the initial setup?

I was not involved in the initial setup.

What's my experience with pricing, setup cost, and licensing?

The solution's pricing is based on DPUs so it is a good idea to optimize use or it can get expensive. 

I use Studio Notebook because it is less expensive and jobs can be deleted or clustered to run in one day. 

I rate pricing a four out of ten. 

Which other solutions did I evaluate?

Our company only uses Amazon cloud because other cloud environments do not offer the same features. 

The solution's Studio uses GCP which is easier than coding in Python Spark or Scala Spark. 

Azure Data Factory's features do not compare to what the solution can do in the cloud. 

What other advice do I have?

The solution is good for teams who do not want to worry about DevOps or who want to optimize cost by using the cloud. 

I rate the solution a seven out of ten. 

Which deployment model are you using for this solution?

Public Cloud

If public cloud, private cloud, or hybrid cloud, which cloud provider do you use?

Amazon Web Services (AWS)
Disclosure: My company has a business relationship with this vendor other than being a customer: Partner
PeerSpot user
Suraj Sachdeva - PeerSpot reviewer
Data Engineer | Developer at Sakshath Technologies
Real User
Data integration solution that hosts metadata before the roll out of actual data
Pros and Cons
  • "The key role for Glue is that it hosts our metadata before rolling out our actual data. This is the major advantage of using this solution and our clients client have been very satisfied with it."
  • "The technical support for this solution could be improved. In future, we would like to connect more services like Athena or Kinesis to help control more loads of data."

What is our primary use case?

The key role of Glue is that it hosts our metadata before rolling out our actual data. This is the major advantage of using this solution and our clients client have been very satisfied with it.

What is most valuable?

The most valuable aspect of this solution is its automation and ability to sync data from the source to the solution phase. 

What needs improvement?

The technical support for this solution could be improved. In future, we would like to connect more services like Athena or Kinesis to help control more loads of data.

For how long have I used the solution?

I have been using this solution for three years. 

What do I think about the stability of the solution?

This is a stable solution. We have isolated the environment using containerization so that if anything goes wrong, we have higher levels of scalability and availability. To achieve this, we have configured multiple servers for testing, UAT and development.

What do I think about the scalability of the solution?

This is a scalable solution which is supported in our organization by Docker and Kubernetes. We have 2,000 users.

How are customer service and support?

We used a vendor with an internal IT team who provided us with architecture so that we could leverage those services and reach a solution. They have 50 people in the IT team, who continuously help us and monitor the things that we are working on.

How would you rate customer service and support?

Positive

How was the initial setup?

The initial setup was straightforward and took approximately one month. For deployment, we worked in two teams. One person handled all the scripting which we are developing for automation. Two other members handled the database and servers.

What's my experience with pricing, setup cost, and licensing?

This solution is affordable and there is an option to pay for the solution based on your usage. 

What other advice do I have?

I would rate this solution a seven out of ten. 

Which deployment model are you using for this solution?

Hybrid Cloud

If public cloud, private cloud, or hybrid cloud, which cloud provider do you use?

Amazon Web Services (AWS)
Disclosure: I am a real user, and this review is based on my own experience and opinions.
PeerSpot user
Buyer's Guide
AWS Glue
March 2025
Learn what your peers think about AWS Glue. Get advice and tips from experienced pros sharing their opinions. Updated: March 2025.
842,651 professionals have used our research since 2012.
reviewer2070318 - PeerSpot reviewer
Manager at a construction company with 51-200 employees
Real User
Excellent capabilities, proven stability, however would like a more robust interface on the no-code side
Pros and Cons
  • "We have found it beneficial when moving data from one source to another."
  • "I would like to see a more robust interface on the no-code side. This would be nice to be able to split cells."

What is our primary use case?

Our primary use case is ETL.

How has it helped my organization?

We have found it beneficial when moving data from one source to another.

What is most valuable?

The most valuable feature In terms of convenience, the drag-and-drop is really nice. The no-code interface, is really nice, being able to drag in my connectors. And then the nice thing, as well, is that it generates the framework, the wireframe of your code, so then you can just input whatever Spark or Python you want to input to make any further transformations.

What needs improvement?

I would like to see in general, documentation, on the limitations on which loads you can actually pull in when you are running Python. The additional Python Jupyter Notebook now has been nice. But yeah, generally speaking, you can not import every LOB. You can import branders now and you can use photos, but you can not import a lot of the other sorts of statistical-based loads. That is an issue currently. I would like to see a more robust interface on the no-code side. This would be nice to be able to split cells.

For how long have I used the solution?

I have been using AWS Glue for the past three years.

What do I think about the stability of the solution?

The stability is excellent.

What do I think about the scalability of the solution?

There is good scalability you can set up your minimum and maximum users and you are ready to implement.

How was the initial setup?

The initial setup is straightforward If you are just doing a file format conversion, then it is very simple, but if you want to do a little bit more robust sort of transformations, like inserting transformations or you want to do transformations on multiple delimiters, then there is a bit of learning curve. The deployment time is literally minutes.

What other advice do I have?

I would rate AWS Glue a seven on a scale of one to ten.

Which deployment model are you using for this solution?

Public Cloud
Disclosure: I am a real user, and this review is based on my own experience and opinions.
PeerSpot user
Team Lead at a financial services firm with 5,001-10,000 employees
Real User
It can generate the code and has a good user interface, but it lacks Java support
Pros and Cons
  • "Its user interface is quite good. You just need to choose some options to create a job in AWS Glue. The code-generation feature is also useful. If you don't want to customize it and simply want to read a file and store the data in the database, it can generate the code for you."
  • "Currently, it supports only two languages in the background: Python and Scala. From our customization point of view, it would be helpful if it can also support Java in the background."

What is our primary use case?

We are using it for file ingestion. Its primary role is to ingest a file from a vendor to a database.

What is most valuable?

Its user interface is quite good. You just need to choose some options to create a job in AWS Glue. 

The code-generation feature is also useful. If you don't want to customize it and simply want to read a file and store the data in the database, it can generate the code for you.

What needs improvement?

Currently, it supports only two languages in the background: Python and Scala. From our customization point of view, it would be helpful if it can also support Java in the background.

For how long have I used the solution?

I have been using AWS Glue for three months. We have just started using these services.

What do I think about the stability of the solution?

We have not been using AWS Glue for a long time. Till now, we haven't found any issues.

How are customer service and technical support?

Their technical support is good. We faced an issue with AWS Glue where we had to read a flat file. In a flat file, you only have spaces. You don't have commas or anything else. AWS Glue does not directly support flat files. You need to provide it with an expression to read the file, and that expression itself has some limitation of characters. We contacted the AWS support team. They had a call with us and first tried to understand our problem and then our use case. We gave them some sample files for our use case, and they come up with a solution for this limitation. There are some custom patterns in AWS Glue that can be used. Even though they took some time, they provided the solution. If you give a file today, they will take three to four days to get back.

How was the initial setup?

It was straightforward. A lot of documentation is available on the AWS website, which can guide you through the simple steps to set it up. Its setup was easy for me.

What's my experience with pricing, setup cost, and licensing?

It is not expensive. AWS Glue works on the serverless architecture. We get charged for the time the server is up. For our use case, we have to use it once in a day, and it is not expensive for us.

What other advice do I have?

We have just recently started to use this solution. We haven't used all features properly. It is good for the features we are using. We did not find any drawbacks or limitations so far. We are already getting whatever we want from it. 

I would rate AWS Glue a seven out of ten. It needs improvements in terms of Java support and the turnaround time for our problems. 

Which deployment model are you using for this solution?

Public Cloud
Disclosure: I am a real user, and this review is based on my own experience and opinions.
PeerSpot user
reviewer2116998 - PeerSpot reviewer
Consultant Data junior at a computer software company with 51-200 employees
Consultant
User-friendly visual interface, but only a few built-in transformations
Pros and Cons
  • "The most valuable feature for me is the visual interface of AWS Glue."
  • "The product has only a few built-in transformations."

What is our primary use case?

The primary use cases of AWS Glue in our organization are for implementing ETL processes and for data flow.

What is most valuable?

The most valuable feature for me is the visual interface of AWS Glue. It is user-friendly and it is not complicated. Moreover, the coding part of AWS Glue allows users to upload their scripts after dropping some components. The product has flexibility and scalability, which is common in most cloud tools.

What needs improvement?

The product has only a few built-in transformations; additional custom-building transformations could be improved in the next release.

For additional features, I would like documentation on the equivalent of legacy ETL tools and their equivalent in AWS to make it easier for users to migrate their ETL processing to the cloud. It would save time and help users find the best transformation or solution to satisfy their new business needs.

For how long have I used the solution?

I have been using this solution for three months, and I am using the latest version.

What do I think about the stability of the solution?

The stability is good; I have not faced any crashes so far.

What do I think about the scalability of the solution?

I would rate its scalability a seven out of ten.

Which solution did I use previously and why did I switch?

I used a product called SysTrack. For me, it was just a switch from SysTrack to AWS Glue.

What's my experience with pricing, setup cost, and licensing?

The pricing depends on the usage, such as the number of users, computers, and the time jobs run.

What other advice do I have?

Overall, I would rate this product a seven out of ten. It is a good product, but I have not experienced all the additional features.

Which deployment model are you using for this solution?

Private Cloud
Disclosure: I am a real user, and this review is based on my own experience and opinions.
PeerSpot user
Diksha  Hirole - PeerSpot reviewer
Data Engineer at BlazeClan Technologies
MSP
Top 10
Great for ETL and batch processing
Pros and Cons
  • "AWS Glue's most valuable features are the data catalog, including crawlers and tables, and Glue Studio, which means you don't have to use custom code."
  • "If there's a cluster-related configuration, we have to make worker notes, which is quite a headache when processing a large amount of data."

What is our primary use case?

I mainly use AWS Glue for ETL purposes and batch processing of data.

What is most valuable?

AWS Glue's most valuable features are the data catalog, including crawlers and tables, and Glue Studio, which means you don't have to use custom code.

What needs improvement?

There are a couple of issues with AWS Glue. First, AWS Control randomly logs off, which disturbs coding. Second, if there's a cluster-related configuration, we have to make worker notes, which is quite a headache when processing a large amount of data. In the next release, AWS Glue should include more transformations with AWS Studio.

For how long have I used the solution?

I've been using AWS Glue for around eight months.

What do I think about the stability of the solution?

AWS Glue is stable.

How are customer service and support?

AWS' technical support responds within an hour on email.

How was the initial setup?

The initial setup was very easy, with only some minimal configuration. However, there is a drawback that once we file the name of a user, it can't be changed.

What's my experience with pricing, setup cost, and licensing?

AWS Glue is quite costly, especially for small organizations. The licensing fee is around $200 per year.

What other advice do I have?

Glue supports Spark, so if you have a team that's good with Spark, definitely go with Glue. I would rate AWS Glue as eight out of ten.

Which deployment model are you using for this solution?

Private Cloud

If public cloud, private cloud, or hybrid cloud, which cloud provider do you use?

Amazon Web Services (AWS)
Disclosure: My company has a business relationship with this vendor other than being a customer: Partner
PeerSpot user
reviewer1688958 - PeerSpot reviewer
Net Full-Stack developer at a tech services company with 201-500 employees
Real User
A stable solution which can easily integrate with other AWS services
Pros and Cons
  • "One of the best features of the solution is its ability to easily integrate with other AWS services."
  • "Overall, I consider the technical support to be fine, although the response time could be faster in certain cases."

What is our primary use case?

We use the solution as a level of loading data from the source systems.

What is most valuable?

One of the best features of the solution is its ability to easily integrate with other AWS services. So, it's like we are using AWS as a main cloud provider. It's easy to put everything together. it is very flexible when it comes to compute features. We find the solution very useful when we make use of certain scripts. In some cases, it allows us to get rid of duplicates.

What needs improvement?

When there is a need to configure connections to different database sources in respect of the target, it would be good if it were easier to deal with roles. I am referring to the need to configure connections in a different target process, something which would require a certain time outlay for configuring VPC and checking that everything is okay, in respect of the creation of required roles. It would save time were this process to be made easier and more user friendly. 

The technical support depends on the type of question, whether there is a need to understand additional inter-related information on multiple levels. Overall, I consider the technical support to be fine, although the response time could be faster in certain cases. 

For how long have I used the solution?

I have been using AWS Glue for about two years.

What do I think about the stability of the solution?

The solution is stable. 

How are customer service and support?

While the technical support can vary with the type of question, I feel that, overall, it is okay, although receipt of information could be faster in certain cases. 

Which solution did I use previously and why did I switch?

We previously had experience with Database Migration Service at AWS. I recommend it over AWS Glue if one needs to do full database migration from on-premises deployment or in cases involving large volumes of data.

How was the initial setup?

I handled the installation on my own. 

What's my experience with pricing, setup cost, and licensing?

I consider the the price to be standard-plus when it comes to optimal usage. 

What other advice do I have?

I rate AWS Glue as an eight out of ten. 

Which deployment model are you using for this solution?

Hybrid Cloud
Disclosure: I am a real user, and this review is based on my own experience and opinions.
PeerSpot user
reviewer907167 - PeerSpot reviewer
Cloud Solution Architect at a tech services company with 1-10 employees
Real User
Top 10
Cost-effective and stable
Pros and Cons
  • "I appreciate AWS Glue for its cost-effectiveness."
  • "In terms of improvement, the performance of AWS Glue could be faster."

What is our primary use case?

AWS Glue is a versatile tool and we mostly use it for "lift and shift" server migrations.

What is most valuable?

I appreciate AWS Glue for its cost-effectiveness. The service provides a good balance between its capabilities and the cost associated with using it.

What needs improvement?

In terms of improvement, the performance of AWS Glue could be faster.

For how long have I used the solution?

I have been using AWS Glue for five years.

What do I think about the stability of the solution?

It is a stable product.

What do I think about the scalability of the solution?

It is fairly scalable.

How are customer service and support?

The partner program support is very good.

How was the initial setup?

The initial setup is not too complex. To deploy and maintain a data platform, a general data team of around four to five skilled individuals is typically required.

What's my experience with pricing, setup cost, and licensing?

For AWS Glue, there is no separate license fee. It is part of the AWS service, and you pay for its usage as part of your overall AWS bill.

What other advice do I have?

Overall, I would rate AWS Glue as an eight out of ten.

Disclosure: I am a real user, and this review is based on my own experience and opinions.
PeerSpot user
Buyer's Guide
Download our free AWS Glue Report and get advice and tips from experienced pros sharing their opinions.
Updated: March 2025
Product Categories
Cloud Data Integration
Buyer's Guide
Download our free AWS Glue Report and get advice and tips from experienced pros sharing their opinions.