Try our new research platform with insights from 80,000+ expert users

Apache HBase vs Cloudera Distribution for Hadoop comparison

 

Comparison Buyer's Guide

Executive SummaryUpdated on Jan 7, 2025

Review summaries and opinions

We asked business professionals to review the solutions they use. Here are some excerpts of what they said:
 

Categories and Ranking

Apache HBase
Ranking in NoSQL Databases
13th
Average Rating
6.0
Reviews Sentiment
6.3
Number of Reviews
2
Ranking in other categories
No ranking in other categories
Cloudera Distribution for H...
Ranking in NoSQL Databases
8th
Average Rating
8.0
Reviews Sentiment
6.4
Number of Reviews
49
Ranking in other categories
Hadoop (2nd)
 

Mindshare comparison

As of January 2025, in the NoSQL Databases category, the mindshare of Apache HBase is 5.1%, down from 5.7% compared to the previous year. The mindshare of Cloudera Distribution for Hadoop is 2.3%, down from 3.0% compared to the previous year. It is calculated based on PeerSpot user engagement data.
NoSQL Databases
 

Featured Reviews

Sekhar Reddy B - PeerSpot reviewer
Offers real-time aggregations and easy for a beginner to learn to use this
We use it for real-time data grouping The most valuable part is the column family structure. We mainly use it for real-time aggregations. That's why we prefer it as a NoSQL database. We've seen performance issues when we have more regions. The product needs improvement in that area. So we…
Miodrag-Stanic - PeerSpot reviewer
You can manage all services from one place in an integrated manner
We switched to Airflow because Cloudera is outdated. It's not widely used. It would be good if we had the Spark 3.5. Spark is quite old. Cloudera is now offering an alternate solution as a replacement for AWS. AWS works badly with small files. The solution is not fit for on-premise distributions. It should be containerized so we can deploy it as containers within Kubernetes. We had one upgrade from CDH to CDP, which lasted for a long time. And I would expect with containerized deployment, it would be upgraded much more quickly than we had the experience.

Quotes from Members

We asked business professionals to review the solutions they use. Here are some excerpts of what they said:
 

Pros

"Apache HBase is a database used for data storage."
"The most valuable part is the column family structure."
"Cloudera is a very manageable solution with good support."
"The scalability of Cloudera Distribution for Hadoop is excellent."
"With a cluster available, you can manage the security layer using the shared SDX - it provides flexibility."
"The solution is reliable and stable, it fits our requirements."
"It has the best proxy, security, and support features compared to open-source products."
"We're now able to store large volumes of data through Cloudera Distribution for Hadoop. We're able to push large volumes of data to the platform, and that used to be a challenge, especially when storing a terabyte of information. This is the area where Cloudera Distribution for Hadoop improved the organization."
"The product as a whole is good."
"The most valuable feature is Kubernetes."
 

Cons

"I don't like using Apache HBase to store huge amounts of data because of many performance issues."
"We've seen performance issues."
"The price of this solution could be lowered."
"The tool doesn't support reporting, and relational databases are still the major source of reporting data. Apache Iceberg will be launched soon within the Cloudera cluster for analytical purposes. The Cloudera Machine Learning aspect could be tuned and enhanced to enable us to host some predictive analytics machine learning and AI use cases."
"Cloudera Distribution for Hadoop is not always completely stable in some cases, which can be a concern for big data solutions."
"It could be faster and more user-friendly."
"Cloudera's support is extremely bad and cannot be relied on."
"Cloudera Distribution for Hadoop has a limited feature list and a lot of costs involved."
"There is a maximum of a one-gigabyte block size, which is an area of storage that can be improved upon."
"The solution is not fit for on-premise distributions."
 

Pricing and Cost Advice

Information not available
"Cloudera Distribution for Hadoop is expensive, with support costs involved."
"The solution is fairly expensive."
"When comparing with Oracle Sybase and SQL, it's cheaper. It's not expensive."
"The price is very high. The solution is expensive."
"The tool is expensive...For the SMB market or customers whose environments are not that complex and do not have multiple systems running, Cloudera might not be a good option."
"I believe we pay for a three-year license."
"The pricing must be improved."
"Cloudera requires a license to use."
report
Use our free recommendation engine to learn which NoSQL Databases solutions are best for your needs.
831,265 professionals have used our research since 2012.
 

Top Industries

By visitors reading reviews
Financial Services Firm
21%
Computer Software Company
15%
Manufacturing Company
10%
Government
7%
Financial Services Firm
23%
Computer Software Company
14%
Educational Organization
11%
Manufacturing Company
9%
 

Company Size

By reviewers
Large Enterprise
Midsize Enterprise
Small Business
No data available
 

Questions from the Community

What do you like most about Apache HBase?
Apache HBase is a database used for data storage.
What needs improvement with Apache HBase?
We've seen performance issues when we have more regions. The product needs improvement in that area. So we experience performance issues sometimes when the load increases.
What advice do you have for others considering Apache HBase?
It's better to use AWS DynamoDB or Cassandra. I would rate it an eight out of ten. It is easy for a beginner to learn.
What do you like most about Cloudera Distribution for Hadoop?
The tool can be deployed using different container technologies, which makes it very scalable.
What is your experience regarding pricing and costs for Cloudera Distribution for Hadoop?
The tool is expensive. Overall, it's not a cheap software tool, and that is why only large enterprises who are mature enough and have an architecture that is complex enough opt for Cloudera, as its...
What needs improvement with Cloudera Distribution for Hadoop?
The tool doesn't support reporting, and relational databases are still the major source of reporting data. Apache Iceberg will be launched soon within the Cloudera cluster for analytical purposes. ...
 

Also Known As

HBase
No data available
 

Learn More

 

Overview

 

Sample Customers

Bloomberg, Wells Fargo, Apple, Capital One, NVIDIA
37signals, Adconion,adgooroo, Aggregate Knowledge, AMD, Apollo Group, Blackberry, Box, BT, CSC
Find out what your peers are saying about Apache HBase vs. Cloudera Distribution for Hadoop and other solutions. Updated: January 2025.
831,265 professionals have used our research since 2012.