Try our new research platform with insights from 80,000+ expert users

AWS Data Pipeline [EOL] vs AWS Glue comparison

 

Comparison Buyer's Guide

Executive Summary
 

Categories and Ranking

AWS Data Pipeline [EOL]
Average Rating
8.0
Number of Reviews
2
Ranking in other categories
No ranking in other categories
AWS Glue
Average Rating
7.8
Reviews Sentiment
7.0
Number of Reviews
46
Ranking in other categories
Cloud Data Integration (1st)
 

Featured Reviews

Geoffrey Leigh - PeerSpot reviewer
A stable, scalable, and reliable solution for moving and processing data
We're only considering enhancing the presentation layer to give a more multidimensional OLAP view that AWS seems to have decided on. Redshift with the data mart structure is like an OLAP cube. Oracle Analytics Cloud is an over-code killer and is not what we need. I was looking at Mondrian, which used to be part of the open-source stack from another vendor that works. Still, I am also looking at some of the other OLAP environments like Kaiser and perhaps decided to go to Azure with Microsoft Azure analysis cloud, but that's not multidimensional either as SSAS used to be. We tried the Mondrian, and that didn't perform how we expected. So, we are looking at resetting something to perform as an OLAP in the cloud, particularly AWS, so that we might consider an Azure solution.
Ajaykumar Myana - PeerSpot reviewer
Provides serverless mechanism, easy data transformation and automated infrastructure management
We no longer had to worry much about infrastructure management because AWS Glue is serverless, and Amazon takes care of the underlying infrastructure. This allowed us to focus on the code and application logic without concerns about scaling, CPU management, or handling fluctuations in flow. The serverless nature of Glue jobs relieved us from these infrastructure-related worries.

Quotes from Members

We asked business professionals to review the solutions they use. Here are some excerpts of what they said:
 

Pros

"It is a stable solution...It is a scalable solution."
"The most valuable feature of the solution is that orchestration and development capabilities are easier with the tool."
"Its ease of use, cost-effectiveness, and highly secure architecture are some of the most valuable features."
"You do not need many frameworks to run Glue."
"The solution is serverless so it allows us to transform data while optimizing the cost and performance of Spark jobs."
"AWS Glue is very quick to start without cold starts, unlike AWS Lambda."
"What I like best about AWS Glue is its real-time data backup feature. Last week, there was a production push, and what used to take almost ten days to send out around fifty-six thousand emails now takes only two hours."
"The solution’s most valuable feature is the ETL job."
"It is AWS-integrated. There is end-to-end integration with the other AWS services. It is also user-friendly."
"Transformations are valuable because you can modify or override complex data logic from an open source or Spark to solve issues."
 

Cons

"It's almost semi-automatic because you must review and approve code push, which works well. Still, we had many problems getting there during the deployment process, but we got there."
"The user-defined functions have shortcomings in AWS Data Pipeline."
"The solution's visual ETL tool is of no use for actual implementation."
"The solution could be cheaper. The price of the solution is an area that needs improvement."
"There could be an enhanced way of managing pure metadata management or data cataloging."
"It would be better if it were more user-friendly. The interesting thing we found is that it was a little strange at the beginning. The way Glue works is not very straightforward. After trying different things, for example, we used just the console to create jobs. Then we realized that things were not working as expected. After researching and learning more, we realized that even though the console creates the script for the ETL processes, you need to modify or write your own script in Spark to do everything you want it to do. For example, we are pulling data from our source database and our application database, which is in Aurora. From there, we are doing the ETL to transform the data and write the results into Redshift. But what was surprising is that it's almost like whatever you want to do, you can do it with Glue because you have the option to put together your own script. Even though there are many functionalities and many connections, you have the opportunity to write your own queries to do whatever transformations you need to do. It's a little deceiving that some options are supposed to work in a certain way when you set them up in the console, but then they are not exactly working the right way or not as expected. It would be better if they provided more examples and more documentation on options."
"While working on AWS Glue, I could not find any training material for it."
"AWS Glue is more costly compared to other tools like Airflow."
"In terms of improvement, the performance of AWS Glue could be faster."
"I haven't looked into Glue in terms of seeking out flaws. I've not come across missing features."
 

Pricing and Cost Advice

"I rate the pricing between six to eight on a scale from one to ten, where one is low price, and ten is high price."
"The way we use it, I think it is fair as we're getting a good value for money compared to having a server or some other data pipeline."
"It is an expensive product. I rate its pricing a nine out of ten."
"The solution's pricing is based on DPUs so it is a good idea to optimize use or it can get expensive."
"Its price is good. We pay as we go or based on the usage, which is a good thing for us because it is simple to forecast for the tool. It is good in terms of the financial planning of the company, and it is a good way to estimate the cost. It is also simple for our clients. In my opinion, it is one of the best tools in the market for ETL processes because of the fact that you pay as you use, which separates it from other big tools such as PowerCenter, Pentaho Data Integration, and Talend."
"AWS Glue is a paid service that doesn't come under the free trial of AWS."
"The overall cost of AWS Glue could be better. It cost approximately $1,000 a month. There is paid support available from AWS Glue."
"Technical support is a paid service, and which subscription you have is dependent on that. You must pay one of them, and it ranges from $15,000 to $25,000 per year."
"I rate pricing an eight out of ten."
"AWS Glue is quite costly, especially for small organizations."
report
Use our free recommendation engine to learn which Cloud Data Integration solutions are best for your needs.
824,067 professionals have used our research since 2012.
 

Top Industries

By visitors reading reviews
Computer Software Company
21%
Financial Services Firm
20%
Government
8%
Wholesaler/Distributor
5%
Financial Services Firm
22%
Computer Software Company
13%
Manufacturing Company
8%
Insurance Company
6%
 

Company Size

By reviewers
Large Enterprise
Midsize Enterprise
Small Business
No data available
 

Questions from the Community

What do you like most about AWS Data Pipeline?
The most valuable feature of the solution is that orchestration and development capabilities are easier with the tool.
What is your experience regarding pricing and costs for AWS Data Pipeline?
I rate the pricing between six to eight on a scale from one to ten, where one is low price, and ten is high price.
What needs improvement with AWS Data Pipeline?
The user-defined functions have shortcomings in AWS Data Pipeline. The user-defined functions could be one of the areas where I can write a custom function and embed it as a part of AWS Data Pipeli...
How do you select the right cloud ETL tool?
AWS Glue and Azure Data factory for ELT best performance cloud services.
How does Talend Open Studio compare with AWS Glue?
We reviewed AWS Glue before choosing Talend Open Studio. AWS Glue is the managed ETL (extract, transform, and load) from Amazon Web Services. AWS Glue enables AWS users to create and manage jobs in...
What are the most common use cases for AWS Glue?
AWS Glue's main use case is for allowing users to discover, prepare, move, and integrate data from multiple sources. The product lets you use this data for analytics, application development, or ma...
 

Overview

 

Sample Customers

bp, Cerner, Expedia, Finra, HESS, intuit, Kellog's, Philips, TIME, workday
bp, Cerner, Expedia, Finra, HESS, intuit, Kellog's, Philips, TIME, workday
Find out what your peers are saying about Amazon Web Services (AWS), Informatica, Salesforce and others in Cloud Data Integration. Updated: December 2024.
824,067 professionals have used our research since 2012.