Try our new research platform with insights from 80,000+ expert users

Amazon EMR vs Hortonworks Data Platform comparison

 

Comparison Buyer's Guide

Executive Summary
 

Categories and Ranking

Amazon EMR
Ranking in Hadoop
3rd
Average Rating
7.8
Number of Reviews
21
Ranking in other categories
Cloud Data Warehouse (11th)
Hortonworks Data Platform
Ranking in Hadoop
6th
Average Rating
8.0
Number of Reviews
25
Ranking in other categories
Open Source Databases (15th), Data Management Platforms (DMP) (9th)
 

Featured Reviews

Quan Vu - PeerSpot reviewer
Aug 22, 2023
Provides efficient data processing features and has good scalability
We need to have a data pipeline tool to ensure consistent data processing for the initial setup. We create a framework, read the code, and execute it in a data catalog. The size of the maintenance team depends on the project and the use cases. Usually, one backup team of four or five DevOps executives takes care of the backend and database. We need to separate our environments into production and development. We use GitHub for source control, Jenkins for the deployment pipeline, and a standard CI/CD tool to deploy code changes into production. We need to develop a deployment framework so developers only need to provide the code for their projects. The underlying engine then deploys the code, reads it, addresses the EMR filter, executes it, and completes the data processing.
Leslie Mavonyani - PeerSpot reviewer
Aug 31, 2023
Helps with data management and has good scalability
We use Hortonworks Data Platform for data management, significant data ingestion, and analytics Hortonworks Data Platform has a limited user community. I haven't seen much discussion about user experiences. More information could be there to simplify the process of running the product. We have…

Quotes from Members

We asked business professionals to review the solutions they use. Here are some excerpts of what they said:
 

Pros

"The initial setup is pretty straightforward."
"In Amazon EMR it is easy to rebuild anything, easy to upgrade and has good fault tolerance."
"It has a variety of options and support systems."
"One of the valuable features about this solution is that it's managed services, so it's pretty stable, and scalable as much as you wish. It has all the necessary distributions. With some additional work, it's also possible to change to a Spark version with the latest version of EMR. It also has Hudi, so we are leveraging Apache Hudi on EMR for change data capture, so then it comes out-of-the-box in EMR."
"We are using applications, such as Splunk, Livy, Hadoop, and Spark. We are using all of these applications in Amazon EMR and they're helping us a lot."
"This is the best tool for hosts and it's really flexible and scalable."
"When we grade big jobs from on-prem to the cloud, we do it in EMR with Spark."
"The security of the managed workflow and the managed services are the best features for us. Since we inherited their security model and it's all managed services, those are the key benefits for our clients."
"The upgrades and patches must come from Hortonworks."
"Ambari Web UI: user-friendly."
"The Hortonworks solution is so stable. It is working as a production system, without any error, without any downtime. If I have downtime, it is mostly caused by the hardware of the computers."
"Hortonworks should not be expensive at all to those looking into using it."
"We use it for data science activities."
"Now, using this solution, it is much cheaper to have all of the data available for searching, not in real-time, but whenever there is a pending request."
"The data platform is pretty neat. The workflow is also really good."
"Ranger for security; with Ranger we can manager user’s permissions/access controls very easily."
 

Cons

"There is no need to pay extra for third-party software."
"The legacy versions of the solution are not supported in the new versions."
"The solution can become expensive if you are not careful."
"There is room for improvement in pricing."
"There were times where they would release new versions and it seemed to end up breaking old versions, which is very strange."
"The product's features for storing data in static clusters could be better."
"The problem for us is it starts very slow."
"We don't have much control. If we have multiple users, if they want to scale up, the cost will go and increase and we don't know how we can restrict that price part."
"It would also be nice if there were less coding involved."
"Hive performance. If Hive performance increased, Hadoop would replace (not everywhere) traditional databases."
"I would like to see more support for containers such as Docker and OpenShift."
"It's at end of life and no longer will there be improvements."
"Deleting any service requires a lot of clean up, unlike Cloudera."
"More information could be there to simplify the process of running the product."
"Since Cloudera acquired HDP, it's been bundled with CBH and HDP. However, the biggest challenge is cloud storage integration with Azure, GCP, and AWS."
"Security and workload management need improvement."
 

Pricing and Cost Advice

"You don't need to pay for licensing on a yearly or monthly basis, you only pay for what you use, in terms of underlying instances."
"I rate the tool's pricing a five out of ten. It can be expensive since it's a managed service, and if you are not careful, you can run into unexpected charges. You can make a mistake that costs you tens of thousands of dollars. That's happened to us twice, so I'm sensitive to it. We're still trying to work on that. Our smallest client probably spends a hundred thousand dollars yearly on licensing, while our largest is well over a million."
"There is a small fee for the EMR system, but major cost components are the underlying infrastructure resources which we actually use."
"There is no need to pay extra for third-party software."
"Amazon EMR is not very expensive."
"The price of the solution is expensive."
"The cost of Amazon EMR is very high."
"The product is not cheap, but it is not expensive."
"It is priced well and it is affordable"
"Currently, we are using the product in a sandbox environment, and there is no licensing. We might choose a licensing option once we get the results."
report
Use our free recommendation engine to learn which Hadoop solutions are best for your needs.
814,649 professionals have used our research since 2012.
 

Top Industries

By visitors reading reviews
Financial Services Firm
25%
Computer Software Company
13%
Manufacturing Company
9%
Educational Organization
7%
Computer Software Company
21%
Financial Services Firm
14%
Government
10%
University
9%
 

Company Size

By reviewers
Large Enterprise
Midsize Enterprise
Small Business
 

Questions from the Community

What do you like most about Amazon EMR?
Amazon EMR is a good solution that can be used to manage big data.
What is your experience regarding pricing and costs for Amazon EMR?
I rate the tool's pricing a five out of ten. It can be expensive since it's a managed service, and if you are not careful, you can run into unexpected charges. You can make a mistake that costs you...
What needs improvement with Amazon EMR?
The solution can become expensive if you are not careful.
What do you like most about Hortonworks Data Platform?
Distributed computing, secure containerization, and governance capabilities are the most valuable features.
What is your experience regarding pricing and costs for Hortonworks Data Platform?
I haven't done a price analysis specifically for HDP. However, when it was first introduced as Hadoop 2.0, there were a few use cases where the price was quite high. It was particularly expensive f...
What needs improvement with Hortonworks Data Platform?
Since Cloudera acquired HDP, it's been bundled with CBH and HDP. However, the biggest challenge is cloud storage integration with Azure, GCP, and AWS. These platforms offer competitive storage solu...
 

Also Known As

Amazon Elastic MapReduce
Hortonworks, HDP
 

Overview

 

Sample Customers

Yelp
Mayo Clinic, Symantec, Progressive Insurance, Noble Energy, Cardinal Health, Rogers, Mercy, Neustar, TRUECar, T-Mobile
Find out what your peers are saying about Amazon EMR vs. Hortonworks Data Platform and other solutions. Updated: October 2024.
814,649 professionals have used our research since 2012.