Try our new research platform with insights from 80,000+ expert users

Amazon EMR vs Cloudera Distribution for Hadoop comparison

 

Comparison Buyer's Guide

Executive Summary
 

Categories and Ranking

Amazon EMR
Ranking in Hadoop
3rd
Average Rating
7.8
Reviews Sentiment
7.2
Number of Reviews
22
Ranking in other categories
Cloud Data Warehouse (11th)
Cloudera Distribution for H...
Ranking in Hadoop
2nd
Average Rating
8.0
Reviews Sentiment
6.4
Number of Reviews
49
Ranking in other categories
NoSQL Databases (8th)
 

Mindshare comparison

As of December 2024, in the Hadoop category, the mindshare of Amazon EMR is 14.7%, down from 18.7% compared to the previous year. The mindshare of Cloudera Distribution for Hadoop is 28.2%, up from 23.1% compared to the previous year. It is calculated based on PeerSpot user engagement data.
Hadoop
 

Featured Reviews

Prashant  Singh - PeerSpot reviewer
Easy to manage and reliable but the cost is hard to control
The cost is increasing. We are looking into how we can optimize the cost part of EMR. We're doing a comparison between Cloudera running on AWS and running AWS EMR. We don't have much control. If we have multiple users, if they want to scale up, the cost will go and increase and we don't know how we can restrict that price part.
Shahan Rehman - PeerSpot reviewer
Can host multiple technologies and help businesses with their AI initiatives
The ease or difficulty in setting up the product depends on the environment of the customer where the tool is deployed. If a banking, industrial, or retail sector firm is taken into concentration, depending on how big of a database is maintained, including the applications that are to be hosted, the deployment process can range from a simple to a very complex phase, depending on the architecture. For Cloudera Distribution for Hadoop, one has to go through the usual deployment process, like for any software product. You have to have different environments before going into production, like pre-production environments, test and dev environments. You install and configure all the components in the test environment and then test them on the pre-production environment. Once UAT is done, you move them to the production environment. In general, it's a critical product deployed in a company.

Quotes from Members

We asked business professionals to review the solutions they use. Here are some excerpts of what they said:
 

Pros

"In Amazon EMR it is easy to rebuild anything, easy to upgrade and has good fault tolerance."
"The solution is scalable."
"The project management is very streamlined."
"The initial setup is straightforward."
"Amazon EMR is a good solution that can be used to manage big data."
"The solution is pretty simple to set up."
"The initial setup is pretty straightforward."
"It allows users to access the data through a web interface."
"We experienced many issues when we started working with Hadoop 3.0 in the Cloudera 6.0 version, so there are a lot of things that need to improve. I believe they are working on that."
"The tool can be deployed using different container technologies, which makes it very scalable."
"We're now able to store large volumes of data through Cloudera Distribution for Hadoop. We're able to push large volumes of data to the platform, and that used to be a challenge, especially when storing a terabyte of information. This is the area where Cloudera Distribution for Hadoop improved the organization."
"It is helpful to gather and process data."
"Very good end-to-end security features."
"The file system is a valuable feature."
"The most valuable feature is Impala, the querying engine, which is very fast."
"The search function is the most valuable aspect of the solution."
 

Cons

"There is no need to pay extra for third-party software."
"The problem for us is it starts very slow."
"As people are shifting from legacy solutions to other technologies, Amazon EMR needs to add more features that give more flexibility in managing user data."
"There were times where they would release new versions and it seemed to end up breaking old versions, which is very strange."
"Spark jobs take longer on Amazon EMR compared to previous experiences."
"Amazon EMR is continuously improving, but maybe something like CI/CD out-of-the-box or integration with Prometheus Grafana."
"There is room for improvement in pricing."
"The dashboard management could be better. Right now, it's lacking a bit."
"The price of this solution could be lowered."
"There are multiple bugs when we update."
"It would be useful if Cloudera had more tools like SQL Engines that offer the traditional relational database. We have to do a lot of work preparing the data outside Cloudera before getting it into the platform."
"I would like to see an improvement in how the solution helps me to handle the whole cluster."
"The competitors provide better functionalities."
"This is a very expensive solution."
"While the deployed product is generally functional, there are instances where it presents difficulties."
"The initial setup of Cloudera is difficult."
 

Pricing and Cost Advice

"The price of the solution is expensive."
"The cost of Amazon EMR is very high."
"There is no need to pay extra for third-party software."
"Amazon EMR is not very expensive."
"I rate the tool's pricing a five out of ten. It can be expensive since it's a managed service, and if you are not careful, you can run into unexpected charges. You can make a mistake that costs you tens of thousands of dollars. That's happened to us twice, so I'm sensitive to it. We're still trying to work on that. Our smallest client probably spends a hundred thousand dollars yearly on licensing, while our largest is well over a million."
"Amazon EMR's price is reasonable."
"The product is not cheap, but it is not expensive."
"There is a small fee for the EMR system, but major cost components are the underlying infrastructure resources which we actually use."
"The solution is expensive."
"I believe we pay for a three-year license."
"Cloudera Distribution for Hadoop is expensive, with support costs involved."
"Cloudera requires a license to use."
"The solution is fairly expensive."
"The product’s price depends from project to project."
"It is an expensive product."
"I wouldn't recommend CDH to others because of its high cost."
report
Use our free recommendation engine to learn which Hadoop solutions are best for your needs.
824,067 professionals have used our research since 2012.
 

Top Industries

By visitors reading reviews
Financial Services Firm
25%
Computer Software Company
13%
Manufacturing Company
9%
Educational Organization
7%
Financial Services Firm
23%
Computer Software Company
15%
Educational Organization
11%
Manufacturing Company
8%
 

Company Size

By reviewers
Large Enterprise
Midsize Enterprise
Small Business
 

Questions from the Community

What do you like most about Amazon EMR?
Amazon EMR is a good solution that can be used to manage big data.
What is your experience regarding pricing and costs for Amazon EMR?
The cost of Amazon EMR is a little bit expensive, especially considering the support package, which includes a gold package.
What needs improvement with Amazon EMR?
Spark jobs take longer on Amazon EMR compared to previous experiences. This aspect could be improved to make them more efficient.
What do you like most about Cloudera Distribution for Hadoop?
The tool can be deployed using different container technologies, which makes it very scalable.
What is your experience regarding pricing and costs for Cloudera Distribution for Hadoop?
The tool is expensive. Overall, it's not a cheap software tool, and that is why only large enterprises who are mature enough and have an architecture that is complex enough opt for Cloudera, as its...
What needs improvement with Cloudera Distribution for Hadoop?
The tool doesn't support reporting, and relational databases are still the major source of reporting data. Apache Iceberg will be launched soon within the Cloudera cluster for analytical purposes. ...
 

Also Known As

Amazon Elastic MapReduce
No data available
 

Overview

 

Sample Customers

Yelp
37signals, Adconion,adgooroo, Aggregate Knowledge, AMD, Apollo Group, Blackberry, Box, BT, CSC
Find out what your peers are saying about Amazon EMR vs. Cloudera Distribution for Hadoop and other solutions. Updated: December 2024.
824,067 professionals have used our research since 2012.