Try our new research platform with insights from 80,000+ expert users

Cassandra vs Cloudera Distribution for Hadoop comparison

 

Comparison Buyer's Guide

Executive SummaryUpdated on Jan 7, 2025

Review summaries and opinions

We asked business professionals to review the solutions they use. Here are some excerpts of what they said:
 

Categories and Ranking

Cassandra
Ranking in NoSQL Databases
6th
Average Rating
8.0
Reviews Sentiment
6.0
Number of Reviews
25
Ranking in other categories
Vector Databases (14th)
Cloudera Distribution for H...
Ranking in NoSQL Databases
8th
Average Rating
8.0
Reviews Sentiment
6.3
Number of Reviews
51
Ranking in other categories
Hadoop (1st)
 

Mindshare comparison

As of November 2025, in the NoSQL Databases category, the mindshare of Cassandra is 8.5%, down from 13.1% compared to the previous year. The mindshare of Cloudera Distribution for Hadoop is 3.0%, up from 2.4% compared to the previous year. It is calculated based on PeerSpot user engagement data.
NoSQL Databases Market Share Distribution
ProductMarket Share (%)
Cassandra8.5%
Cloudera Distribution for Hadoop3.0%
Other88.5%
NoSQL Databases
 

Featured Reviews

Monirul Islam Khan - PeerSpot reviewer
Has maintained secure document storage and efficient data distribution with peer-to-peer architecture
The functions or features in Cassandra that I have found most valuable are that it is a distributed system similar to Mongo. It's good enough for comparison with another SQL database, so it's smooth and organized for distributed database system. The peer-to-peer architecture in Cassandra is helpful for network decentralization, and I have already introduced that feature. Cassandra features in peer-to-peer as well as another monitoring, so basically, it's good enough for our service. The tunable consistency level in Cassandra is good, and we are using that feature already. In terms of built-in caching and lightweight transactions in Cassandra, the transaction level is good, and it's optimized, so there are no more issues in that database. Based on my experience, Cassandra is good for document management system, as well as distributed database system, and the automatic recovery process is there. Additionally, the database monitoring system or auditing system is well-comparable with other database systems, so we are actually happy to be using this Cassandra database.
Rok Dolinsek - PeerSpot reviewer
Enables on-premise implementation with powerful data processing capabilities
This is the only solution that is possible to install on-premise. Cloudera provides a hybrid solution that combines compute on cloud or on-premises. It includes all machine learning algorithms in the Spark machine learning library. All functionalities needed for a big data platform and ETL are on the platform, eliminating the need for other tools. It is scalable, ready for vertical scaling, and very powerful, offering numerous functionalities and configurations for generative AI.

Quotes from Members

We asked business professionals to review the solutions they use. Here are some excerpts of what they said:
 

Pros

"Cassandra has some features that are more useful for specific use cases where you have time series where you have huge amounts of writes. That should be quick, but not specifically the reads. We needed to have quicker reads and writes and this is why we are using Cassandra right now."
"The most valuable features of Cassandra are the NoSQL database, high performance, and zero-copy streaming."
"Its retrieval is similar to an RDBMS, so our team finds it easy to adapt."
"The most valuable features are the counter features and the NoSQL schema. It also has good scalability. You can scale Cassandra to any finite level."
"Cassandra offers high availability and fault tolerance, making it suitable for large-scale data storage and real-time processing."
"We can add almost one million columns to the solution."
"The time series data was one of the best features along with auto publishing."
"Based on my experience, Cassandra is good for document management system, as well as distributed database system, and the automatic recovery process is there."
"The most valuable feature is Impala, the querying engine, which is very fast."
"Cloudera is a very manageable solution with good support."
"This is the only solution that is possible to install on-premise."
"The product is completely secure."
"The tool can be deployed using different container technologies, which makes it very scalable."
"We had a data warehouse before all the data. We can process a lot more data structures."
"We also really like the Cloudera community. You can have any question and will have your answer within a few hours."
"The most valuable feature is Kubernetes."
 

Cons

"Cassandra can improve by adding more built-in tools. For example, if you want to do some maintenance activities in the cluster, we have to depend on third-party tools. Having these tools build-in would be e benefit."
"We experience configuration issues when accommodating the volumes we require, which often necessitates consultation with the Cassandra development team."
"Depending upon our schema, we can't make ORDER BY or GROUP BY clauses in the product."
"The solution doesn't have joins between tables so you need other tools for that."
"Maybe they can improve their performance in data fetching from a high volume of data sets."
"I want Cassandra to update its open-source version more quickly. It's already feature-rich, but I'd appreciate better integration with other NoSQL databases like MariaDB or MongoDB. If I ever need to work with customers or vendors using different NoSQL databases, having native integration in Cassandra would make managing and interacting with their databases much easier."
"Cassandra could be more user-friendly like MongoDB."
"Fine-tuning was a bit of a challenge."
"The Cloudera training has deteriorated significantly."
"Cloudera Distribution for Hadoop has a limited feature list and a lot of costs involved."
"The one thing that we struggled with predominately was support. Because it was relatively new, support was always a big issue and I think it's still a bit of an ongoing concern with the team currently managing it."
"While the deployed product is generally functional, there are instances where it presents difficulties."
"The solution does not support multiple languages very well and this means users need to create work-arounds to implement some solutions."
"The governance aspect of the solution should be improved."
"The dashboard could be improved."
"The solution is not fit for on-premise distributions."
 

Pricing and Cost Advice

"Cassandra is a free open source solution, but there is a commercial version available called DataStax Enterprise."
"We are using the open-source version of Cassandra, the solution is free."
"I don't have the specific numbers on pricing, but it was fairly priced."
"We pay for a license."
"I use the tool's open-source version."
"There are licensing fees that must be paid, but I'm not sure if they are paid monthly or yearly."
"It is an expensive product."
"I wouldn't recommend CDH to others because of its high cost."
"The price is very high. The solution is expensive."
"The tool is expensive...For the SMB market or customers whose environments are not that complex and do not have multiple systems running, Cloudera might not be a good option."
"I haven't bought a license for this solution. I'm only using the Apache license version."
"Cloudera requires a license to use."
"The price could be better for the product."
"The pricing must be improved."
report
Use our free recommendation engine to learn which NoSQL Databases solutions are best for your needs.
873,085 professionals have used our research since 2012.
 

Top Industries

By visitors reading reviews
Financial Services Firm
17%
Computer Software Company
10%
Retailer
7%
Comms Service Provider
6%
Educational Organization
19%
Financial Services Firm
18%
Computer Software Company
11%
Energy/Utilities Company
6%
 

Company Size

By reviewers
Large Enterprise
Midsize Enterprise
Small Business
By reviewers
Company SizeCount
Small Business8
Midsize Enterprise1
Large Enterprise14
By reviewers
Company SizeCount
Small Business16
Midsize Enterprise9
Large Enterprise31
 

Questions from the Community

What do you like most about Cassandra?
The use of Cassandra in real-time data analytics has been pivotal for our e-commerce platform. As our platform operates 24/7, providing services to sellers and customers alike, the need for real-ti...
What needs improvement with Cassandra?
While Cassandra can handle NoSQL, I think there should be more flexibility for whole schema design when data is stored in wide columns. Additionally, I believe that eventual consistency should be e...
What do you like most about Cloudera Distribution for Hadoop?
The tool can be deployed using different container technologies, which makes it very scalable.
What is your experience regarding pricing and costs for Cloudera Distribution for Hadoop?
The price for Cloudera is average, yet it is very good compared to other solutions. It can be deployed on-premises, unlike competitors' cloud-only solutions.
What needs improvement with Cloudera Distribution for Hadoop?
If they could support modifying the data more easily than the current implementation, it would be beneficial.
 

Overview

 

Sample Customers

1. Apple 2. Netflix 3. Facebook 4. Instagram 5. Twitter 6. eBay 7. Spotify 8. Uber 9. Airbnb 10. Adobe 11. Cisco 12. IBM 13. Microsoft 14. Yahoo 15. Reddit 16. Pinterest 17. Salesforce 18. LinkedIn 19. Hulu 20. Airbnb 21. Walmart 22. Target 23. Sony 24. Intel 25. Cisco 26. HP 27. Oracle 28. SAP 29. GE 30. Siemens 31. Volkswagen 32. Toyota
37signals, Adconion,adgooroo, Aggregate Knowledge, AMD, Apollo Group, Blackberry, Box, BT, CSC
Find out what your peers are saying about Cassandra vs. Cloudera Distribution for Hadoop and other solutions. Updated: September 2025.
873,085 professionals have used our research since 2012.