Try our new research platform with insights from 80,000+ expert users

Apache HBase vs Cassandra comparison

 

Comparison Buyer's Guide

Executive Summary
 

Categories and Ranking

Apache HBase
Ranking in NoSQL Databases
12th
Average Rating
6.0
Number of Reviews
2
Ranking in other categories
No ranking in other categories
Cassandra
Ranking in NoSQL Databases
5th
Average Rating
8.0
Reviews Sentiment
4.4
Number of Reviews
21
Ranking in other categories
Vector Databases (14th)
 

Mindshare comparison

As of November 2024, in the NoSQL Databases category, the mindshare of Apache HBase is 5.0%, down from 6.1% compared to the previous year. The mindshare of Cassandra is 13.6%, up from 12.0% compared to the previous year. It is calculated based on PeerSpot user engagement data.
NoSQL Databases
 

Featured Reviews

Sekhar Reddy B - PeerSpot reviewer
Offers real-time aggregations and easy for a beginner to learn to use this
We use it for real-time data grouping The most valuable part is the column family structure. We mainly use it for real-time aggregations. That's why we prefer it as a NoSQL database. We've seen performance issues when we have more regions. The product needs improvement in that area. So we…
Himanshu Amodwala - PeerSpot reviewer
Well-equipped to handle a massive influx of data and billions of requests
The use of Cassandra in real-time data analytics has been pivotal for our e-commerce platform. As our platform operates 24/7, providing services to sellers and customers alike, the need for real-time updates is paramount. For instance, when a customer leaves comments or feedback on an image, they anticipate an immediate reflection of these changes on the portal. Similarly, sellers altering product attributes or updating images expect instant visibility of these modifications. Handling large data volumes with Cassandra has been an excellent experience. Despite challenges related to the influx, these were not attributed to Cassandra itself but rather to middle-layer issues. Generally, it demonstrated scalability with workloads, thanks to its horizontal scaling capabilities. We could easily add new nodes to the system as needed, ensuring the platform coped well with increasing loads. The tool's most beneficial feature for scalability is its entire architecture. The absence of a single point of failure or a leader within the ecosystem contributes to its robust scalability. This key aspect influenced our decision to opt for the Cassandra ecosystem. In terms of performance, it demonstrated the ability to handle approximately 1.6 billion requests per day. This was achieved on AWS using EC2 instances, and it was during a period about four to five years ago.

Quotes from Members

We asked business professionals to review the solutions they use. Here are some excerpts of what they said:
 

Pros

"Apache HBase is a database used for data storage."
"The most valuable part is the column family structure."
"Our primary use case for the solution is testing."
"I am satisfied with the performance."
"We can add almost one million columns to the solution."
"Cassandra has some features that are more useful for specific use cases where you have time series where you have huge amounts of writes. That should be quick, but not specifically the reads. We needed to have quicker reads and writes and this is why we are using Cassandra right now."
"The technical evaluation is very good."
"The most valuable features of this solution are its speed and distributed nature."
"Since I haven't had years of experience with it, it's still new to me. One valuable feature is its distribution, so I can run it partly in the cloud and part on-prem. That's a feature I'd like to use but haven't yet because we're trying to move to Azure. I don't know if or when that will happen. Ideally, we'd have it distributed over the cloud and on-prem simultaneously, so if something happens to our on-prem, we can keep going in the cloud, like a pay-as-you-go model with Azure."
"The most valuable features are the counter features and the NoSQL schema. It also has good scalability. You can scale Cassandra to any finite level."
 

Cons

"We've seen performance issues."
"I don't like using Apache HBase to store huge amounts of data because of many performance issues."
"Doesn't support a solution that can give aggregation."
"The solution doesn't have joins between tables so you need other tools for that."
"Depending upon our schema, we can't make ORDER BY or GROUP BY clauses in the product."
"Cassandra can improve by adding more built-in tools. For example, if you want to do some maintenance activities in the cluster, we have to depend on third-party tools. Having these tools build-in would be e benefit."
"The disc space is lacking. You need to free it up as you are working."
"Cassandra could be more user-friendly like MongoDB."
"Maybe they can improve their performance in data fetching from a high volume of data sets."
"I want Cassandra to update its open-source version more quickly. It's already feature-rich, but I'd appreciate better integration with other NoSQL databases like MariaDB or MongoDB. If I ever need to work with customers or vendors using different NoSQL databases, having native integration in Cassandra would make managing and interacting with their databases much easier."
 

Pricing and Cost Advice

Information not available
"There are licensing fees that must be paid, but I'm not sure if they are paid monthly or yearly."
"Cassandra is a free open source solution, but there is a commercial version available called DataStax Enterprise."
"I use the tool's open-source version."
"We pay for a license."
"We are using the open-source version of Cassandra, the solution is free."
"I don't have the specific numbers on pricing, but it was fairly priced."
report
Use our free recommendation engine to learn which NoSQL Databases solutions are best for your needs.
816,406 professionals have used our research since 2012.
 

Top Industries

By visitors reading reviews
Financial Services Firm
21%
Computer Software Company
16%
Manufacturing Company
11%
Government
8%
Financial Services Firm
20%
Computer Software Company
15%
Healthcare Company
7%
Manufacturing Company
5%
 

Company Size

By reviewers
Large Enterprise
Midsize Enterprise
Small Business
No data available
 

Questions from the Community

What do you like most about Apache HBase?
Apache HBase is a database used for data storage.
What needs improvement with Apache HBase?
We've seen performance issues when we have more regions. The product needs improvement in that area. So we experience performance issues sometimes when the load increases.
What advice do you have for others considering Apache HBase?
It's better to use AWS DynamoDB or Cassandra. I would rate it an eight out of ten. It is easy for a beginner to learn.
What do you like most about Cassandra?
The use of Cassandra in real-time data analytics has been pivotal for our e-commerce platform. As our platform operates 24/7, providing services to sellers and customers alike, the need for real-ti...
What needs improvement with Cassandra?
I want Cassandra to update its open-source version more quickly. It's already feature-rich, but I'd appreciate better integration with other NoSQL databases like MariaDB or MongoDB. If I ever need ...
 

Comparisons

 

Also Known As

HBase
No data available
 

Learn More

 

Overview

 

Sample Customers

Bloomberg, Wells Fargo, Apple, Capital One, NVIDIA
1. Apple 2. Netflix 3. Facebook 4. Instagram 5. Twitter 6. eBay 7. Spotify 8. Uber 9. Airbnb 10. Adobe 11. Cisco 12. IBM 13. Microsoft 14. Yahoo 15. Reddit 16. Pinterest 17. Salesforce 18. LinkedIn 19. Hulu 20. Airbnb 21. Walmart 22. Target 23. Sony 24. Intel 25. Cisco 26. HP 27. Oracle 28. SAP 29. GE 30. Siemens 31. Volkswagen 32. Toyota
Find out what your peers are saying about Apache HBase vs. Cassandra and other solutions. Updated: October 2024.
816,406 professionals have used our research since 2012.