Try our new research platform with insights from 80,000+ expert users

Amazon EMR vs Hortonworks Data Platform comparison

 

Comparison Buyer's Guide

Executive Summary
 

Categories and Ranking

Amazon EMR
Ranking in Hadoop
3rd
Average Rating
7.8
Number of Reviews
21
Ranking in other categories
Cloud Data Warehouse (11th)
Hortonworks Data Platform
Ranking in Hadoop
6th
Average Rating
8.0
Reviews Sentiment
6.1
Number of Reviews
25
Ranking in other categories
Open Source Databases (15th), Data Management Platforms (DMP) (9th)
 

Featured Reviews

Quan Vu - PeerSpot reviewer
Provides efficient data processing features and has good scalability
We need to have a data pipeline tool to ensure consistent data processing for the initial setup. We create a framework, read the code, and execute it in a data catalog. The size of the maintenance team depends on the project and the use cases. Usually, one backup team of four or five DevOps executives takes care of the backend and database. We need to separate our environments into production and development. We use GitHub for source control, Jenkins for the deployment pipeline, and a standard CI/CD tool to deploy code changes into production. We need to develop a deployment framework so developers only need to provide the code for their projects. The underlying engine then deploys the code, reads it, addresses the EMR filter, executes it, and completes the data processing.
Prashant  Singh - PeerSpot reviewer
A good technology with an easy setup but is at end of life
The solution is fairly simple to set up. It's not too complex or difficult. If you know the solution, it's easy. However, there is a learning curve. If you don't know anything about it, it can be more complex. You can typically deploy it within a week. We have five or six people capable of handling a deployment.

Quotes from Members

We asked business professionals to review the solutions they use. Here are some excerpts of what they said:
 

Pros

"Amazon EMR is a good solution that can be used to manage big data."
"We are using applications, such as Splunk, Livy, Hadoop, and Spark. We are using all of these applications in Amazon EMR and they're helping us a lot."
"The ability to resize the cluster is what really makes it stand out over other Hadoop and big data solutions."
"The solution helps us manage huge volumes of data."
"Amazon EMR's most valuable features are processing speed and data storage capacity."
"One of the valuable features about this solution is that it's managed services, so it's pretty stable, and scalable as much as you wish. It has all the necessary distributions. With some additional work, it's also possible to change to a Spark version with the latest version of EMR. It also has Hudi, so we are leveraging Apache Hudi on EMR for change data capture, so then it comes out-of-the-box in EMR."
"The initial setup is straightforward."
"The solution is scalable."
"Ranger for security; with Ranger we can manager user’s permissions/access controls very easily."
"The Hortonworks solution is so stable. It is working as a production system, without any error, without any downtime. If I have downtime, it is mostly caused by the hardware of the computers."
"Ambari Web UI: user-friendly."
"The scalability is the key reason why we are on this platform."
"Hortonworks should not be expensive at all to those looking into using it."
"The upgrades and patches must come from Hortonworks."
"The data platform is pretty neat. The workflow is also really good."
"It is a scalable platform."
 

Cons

"The problem for us is it starts very slow."
"The legacy versions of the solution are not supported in the new versions."
"Amazon EMR can improve by adding some features, such as megastore services and HiveServer2. Additionally, the user interface could be better, similar to what Apache service provides, cross-platform services."
"We don't have much control. If we have multiple users, if they want to scale up, the cost will go and increase and we don't know how we can restrict that price part."
"The most complicated thing is configuring to the cluster and ensure it's running correctly."
"As people are shifting from legacy solutions to other technologies, Amazon EMR needs to add more features that give more flexibility in managing user data."
"There is room for improvement in pricing."
"Amazon EMR is continuously improving, but maybe something like CI/CD out-of-the-box or integration with Prometheus Grafana."
"It's at end of life and no longer will there be improvements."
"It would also be nice if there were less coding involved."
"Since Cloudera acquired HDP, it's been bundled with CBH and HDP. However, the biggest challenge is cloud storage integration with Azure, GCP, and AWS."
"Deleting any service requires a lot of clean up, unlike Cloudera."
"More information could be there to simplify the process of running the product."
"I would like to see more support for containers such as Docker and OpenShift."
"The cost of the solution is high and there is room for improvement."
"Security and workload management need improvement."
 

Pricing and Cost Advice

"The product is not cheap, but it is not expensive."
"Amazon EMR's price is reasonable."
"You don't need to pay for licensing on a yearly or monthly basis, you only pay for what you use, in terms of underlying instances."
"The price of the solution is expensive."
"There is no need to pay extra for third-party software."
"There is a small fee for the EMR system, but major cost components are the underlying infrastructure resources which we actually use."
"Amazon EMR is not very expensive."
"The cost of Amazon EMR is very high."
"Currently, we are using the product in a sandbox environment, and there is no licensing. We might choose a licensing option once we get the results."
"It is priced well and it is affordable"
report
Use our free recommendation engine to learn which Hadoop solutions are best for your needs.
816,406 professionals have used our research since 2012.
 

Top Industries

By visitors reading reviews
Financial Services Firm
25%
Computer Software Company
13%
Manufacturing Company
9%
Educational Organization
7%
Computer Software Company
21%
Financial Services Firm
14%
University
9%
Government
9%
 

Company Size

By reviewers
Large Enterprise
Midsize Enterprise
Small Business
 

Questions from the Community

What do you like most about Amazon EMR?
Amazon EMR is a good solution that can be used to manage big data.
What is your experience regarding pricing and costs for Amazon EMR?
I rate the tool's pricing a five out of ten. It can be expensive since it's a managed service, and if you are not careful, you can run into unexpected charges. You can make a mistake that costs you...
What needs improvement with Amazon EMR?
The solution can become expensive if you are not careful.
What do you like most about Hortonworks Data Platform?
Distributed computing, secure containerization, and governance capabilities are the most valuable features.
What is your experience regarding pricing and costs for Hortonworks Data Platform?
I haven't done a price analysis specifically for HDP. However, when it was first introduced as Hadoop 2.0, there were a few use cases where the price was quite high. It was particularly expensive f...
What needs improvement with Hortonworks Data Platform?
Since Cloudera acquired HDP, it's been bundled with CBH and HDP. However, the biggest challenge is cloud storage integration with Azure, GCP, and AWS. These platforms offer competitive storage solu...
 

Also Known As

Amazon Elastic MapReduce
Hortonworks, HDP
 

Overview

 

Sample Customers

Yelp
Mayo Clinic, Symantec, Progressive Insurance, Noble Energy, Cardinal Health, Rogers, Mercy, Neustar, TRUECar, T-Mobile
Find out what your peers are saying about Amazon EMR vs. Hortonworks Data Platform and other solutions. Updated: October 2024.
816,406 professionals have used our research since 2012.