Try our new research platform with insights from 80,000+ expert users

Amazon EMR vs Amazon Redshift comparison

 

Comparison Buyer's Guide

Executive Summary
 

Categories and Ranking

Amazon EMR
Ranking in Cloud Data Warehouse
11th
Average Rating
7.8
Number of Reviews
21
Ranking in other categories
Hadoop (3rd)
Amazon Redshift
Ranking in Cloud Data Warehouse
4th
Average Rating
7.8
Reviews Sentiment
7.4
Number of Reviews
66
Ranking in other categories
No ranking in other categories
 

Mindshare comparison

As of November 2024, in the Cloud Data Warehouse category, the mindshare of Amazon EMR is 4.5%, up from 4.5% compared to the previous year. The mindshare of Amazon Redshift is 7.7%, down from 11.9% compared to the previous year. It is calculated based on PeerSpot user engagement data.
Cloud Data Warehouse
 

Featured Reviews

Quan Vu - PeerSpot reviewer
Provides efficient data processing features and has good scalability
We need to have a data pipeline tool to ensure consistent data processing for the initial setup. We create a framework, read the code, and execute it in a data catalog. The size of the maintenance team depends on the project and the use cases. Usually, one backup team of four or five DevOps executives takes care of the backend and database. We need to separate our environments into production and development. We use GitHub for source control, Jenkins for the deployment pipeline, and a standard CI/CD tool to deploy code changes into production. We need to develop a deployment framework so developers only need to provide the code for their projects. The underlying engine then deploys the code, reads it, addresses the EMR filter, executes it, and completes the data processing.
Ved Prakash Yadav - PeerSpot reviewer
Works as a data warehouse system and collects data from different sources
In terms of improvement, I believe Amazon Redshift could work on reducing its costs, as they tend to increase significantly. Additionally, there are occasional issues with nodes going down, which can be problematic. We often encounter issues like someone dropping a column or changing the order of columns, which can cause synchronization problems when pushing data through our pipeline. It's a minor issue, but it can be annoying.

Quotes from Members

We asked business professionals to review the solutions they use. Here are some excerpts of what they said:
 

Pros

"The solution helps us manage huge volumes of data."
"The initial setup is pretty straightforward."
"This is the best tool for hosts and it's really flexible and scalable."
"The solution is scalable."
"We are using applications, such as Splunk, Livy, Hadoop, and Spark. We are using all of these applications in Amazon EMR and they're helping us a lot."
"Amazon EMR is a good solution that can be used to manage big data."
"In Amazon EMR it is easy to rebuild anything, easy to upgrade and has good fault tolerance."
"The security of the managed workflow and the managed services are the best features for us. Since we inherited their security model and it's all managed services, those are the key benefits for our clients."
"With the APIs that are available, it can be easily integrated with other tools."
"It's very easy to migrate from other databases to Redshift. There are migration tools dedicated for this purpose, enabling migration from other databases like MS SQL directly to Redshift. The majority of the scripts will be automatically transposed."
"The most valuable feature is its scalability."
"It is quite simple to use and there are no issues with creating the tables."
"The ability to reload data multiple times at different times."
"Redshift is a major service of Amazon and is very scalable. It enables faster recalculations and data management, helping to retrieve data quickly."
"Easy to build out our snowflake design and load data."
"If the analyst knows SQL, which is comfortable and easy to use to go between all of these tool stacks, I think it's reliable. It's a secure and reliable data warehouse."
 

Cons

"The legacy versions of the solution are not supported in the new versions."
"There were times where they would release new versions and it seemed to end up breaking old versions, which is very strange."
"As people are shifting from legacy solutions to other technologies, Amazon EMR needs to add more features that give more flexibility in managing user data."
"The dashboard management could be better. Right now, it's lacking a bit."
"The most complicated thing is configuring to the cluster and ensure it's running correctly."
"We don't have much control. If we have multiple users, if they want to scale up, the cost will go and increase and we don't know how we can restrict that price part."
"The problem for us is it starts very slow."
"There is no need to pay extra for third-party software."
"The explain panel in the Redshift database could be better."
"The solution could improve in handling more data formats and more native support for RDF."
"The OLAP slide and dice features need to be improved."
"The refreshment rate of data reaching Redshift from other sources should be faster."
"Query compilation time needs a lot of improvement for cases where you are generating queries dynamically."
"Planting is the primary key enforcement that should be improved."
"Pricing is one of the things that it could improve. It should be more competitive."
"The only minor issue I faced was that it took a bit longer than expected to change the cluster to have more space or storage."
 

Pricing and Cost Advice

"I rate the tool's pricing a five out of ten. It can be expensive since it's a managed service, and if you are not careful, you can run into unexpected charges. You can make a mistake that costs you tens of thousands of dollars. That's happened to us twice, so I'm sensitive to it. We're still trying to work on that. Our smallest client probably spends a hundred thousand dollars yearly on licensing, while our largest is well over a million."
"There is a small fee for the EMR system, but major cost components are the underlying infrastructure resources which we actually use."
"There is no need to pay extra for third-party software."
"The product is not cheap, but it is not expensive."
"The cost of Amazon EMR is very high."
"Amazon EMR's price is reasonable."
"You don't need to pay for licensing on a yearly or monthly basis, you only pay for what you use, in terms of underlying instances."
"Amazon EMR is not very expensive."
"If you want a fixed price, an to not worry about every query, but you need to manage your nodes personally, use Redshift."
"The price of Amazon Redshift is reasonable because it depends on the usage that you use and for DWH for the long term."
"The product is quite expensive."
"At the moment, pricing is a little bit on the higher side, although it depends on the size of the company."
"The cost will depend on how you set up your warehouse and what kind of data you store."
"This solution implements the pay-as-you-use model, so no license. Pricing could be cheaper."
"Pricing for Amazon Redshift is reasonable, though it could be somewhat higher than other solutions, such as Azure. Still, when you base your comparison on the services offered and the pricing, it's the most reasonable versus its competitors, such as RDS."
"BI is sold to our customer base as a part of the initial sales bundle. A customer may elect to opt for a white labeled site for an up-charge."
report
Use our free recommendation engine to learn which Cloud Data Warehouse solutions are best for your needs.
816,406 professionals have used our research since 2012.
 

Top Industries

By visitors reading reviews
Financial Services Firm
25%
Computer Software Company
13%
Manufacturing Company
9%
Educational Organization
7%
Educational Organization
60%
Financial Services Firm
7%
Computer Software Company
6%
Manufacturing Company
4%
 

Company Size

By reviewers
Large Enterprise
Midsize Enterprise
Small Business
 

Questions from the Community

What do you like most about Amazon EMR?
Amazon EMR is a good solution that can be used to manage big data.
What is your experience regarding pricing and costs for Amazon EMR?
I rate the tool's pricing a five out of ten. It can be expensive since it's a managed service, and if you are not careful, you can run into unexpected charges. You can make a mistake that costs you...
What needs improvement with Amazon EMR?
The solution can become expensive if you are not careful.
How does Amazon Redshift compare with Microsoft Azure Synapse Analytics?
Amazon Redshift is very fast, has a very good response time, and is very user-friendly. The initial setup is very straightforward. This solution can merge and integrate well with many different dat...
What do you like most about Amazon Redshift?
The tool's most valuable feature is its parallel processing capability. It can handle massive amounts of data, even when pushing hundreds of terabytes, and its scaling capabilities are good.
What is your experience regarding pricing and costs for Amazon Redshift?
You can start small with a basic cluster to learn and practice with it. Selecting the most basic and economical cluster type can save you enough money to move forward with the solution or go with a...
 

Also Known As

Amazon Elastic MapReduce
No data available
 

Overview

 

Sample Customers

Yelp
Liberty Mutual Insurance, 4Cite Marketing, BrandVerity, DNA Plc, Sirocco Systems, Gainsight, Blue 449
Find out what your peers are saying about Amazon EMR vs. Amazon Redshift and other solutions. Updated: October 2024.
816,406 professionals have used our research since 2012.