Try our new research platform with insights from 80,000+ expert users

Cloudera Distribution for Hadoop vs ScyllaDB comparison

 

Comparison Buyer's Guide

Executive SummaryUpdated on Jan 7, 2025

Review summaries and opinions

We asked business professionals to review the solutions they use. Here are some excerpts of what they said:
 

Categories and Ranking

Cloudera Distribution for H...
Ranking in NoSQL Databases
8th
Average Rating
8.0
Reviews Sentiment
6.4
Number of Reviews
49
Ranking in other categories
Hadoop (2nd)
ScyllaDB
Ranking in NoSQL Databases
2nd
Average Rating
7.8
Reviews Sentiment
7.0
Number of Reviews
12
Ranking in other categories
No ranking in other categories
 

Mindshare comparison

As of January 2025, in the NoSQL Databases category, the mindshare of Cloudera Distribution for Hadoop is 2.3%, down from 3.0% compared to the previous year. The mindshare of ScyllaDB is 11.5%, up from 8.9% compared to the previous year. It is calculated based on PeerSpot user engagement data.
NoSQL Databases
 

Featured Reviews

Miodrag-Stanic - PeerSpot reviewer
You can manage all services from one place in an integrated manner
We switched to Airflow because Cloudera is outdated. It's not widely used. It would be good if we had the Spark 3.5. Spark is quite old. Cloudera is now offering an alternate solution as a replacement for AWS. AWS works badly with small files. The solution is not fit for on-premise distributions. It should be containerized so we can deploy it as containers within Kubernetes. We had one upgrade from CDH to CDP, which lasted for a long time. And I would expect with containerized deployment, it would be upgraded much more quickly than we had the experience.
ArpitShah - PeerSpot reviewer
Self-hosting complexity and the way ScyllaDB counts operations can be confusing and may not reflect actual usage
It seems we have better options available. So probably don't go for ScyllaDB. The reason is, first, it's very high. It's not as straightforward as, like, Postgres or ClickHouse to set up. It requires a complex setup. The other problem is what they call. For example, they will say that for up to a million operations, you experience this. But the problem is if they have nine servers, then your one operation is counted as nine operations, not one. So, even though you have one write, they count it as nine. It's like it's just not false premises. You can always host it yourself, but then it's way more complex. The benefits are not substantially more than those of other databases. It's not that it's slow or anything. It's good enough and all. But it's just that ClickHouse or other databases are simpler and faster and probably provide more features. So, I kind of burn out from the database, and that's why I would keep it small.

Quotes from Members

We asked business professionals to review the solutions they use. Here are some excerpts of what they said:
 

Pros

"The product is completely secure."
"Customer service and support were able to fix whatever the issue was."
"Cloudera is a very manageable solution with good support."
"Very good end-to-end security features."
"We experienced many issues when we started working with Hadoop 3.0 in the Cloudera 6.0 version, so there are a lot of things that need to improve. I believe they are working on that."
"I don't see any performance issues."
"The search function is the most valuable aspect of the solution."
"The most valuable feature is Impala, the querying engine, which is very fast."
"The documentation is good. It integrates easily with our existing data infrastructure."
"Firstly, if I update something, it's most likely to finish within milliseconds."
"The best features of ScyllaDB are how it synchronizes data and its failover system. There's a unique formula to decide the number of nodes you need and the minimum required, which I find helpful. It also offers encryption and supports APIs, making it great for distributed systems and scaling databases across different regions. While it's easy to use, having prior experience helps configure it properly. There are many configurations; if you don't understand them, you might mess up the design. So, understanding your system's needs, like whether it requires more read or write operations, is crucial for setting up the correct configuration."
"ScyllaDB is very fast, and I can use it for so many things."
"It is lightweight, and it requires less infrastructure."
"The performance aspects of Scylla are good, as always... A good point about Scylla is that it can be used extensively."
"The product's most valuable features are efficiency and reliability."
"The database is easy to use, fast, and accessible for applications because the API is straightforward."
 

Cons

"The areas of improvement depend on the scale of the project. For banking customers, security features and an essential budget for commercial licenses would be the top priority. Data regulation could be the most crucial for a project with extensive data or an extra use case."
"There is a maximum of a one-gigabyte block size, which is an area of storage that can be improved upon."
"The pricing needs to improve."
"The user infrastructure and user interface needs to be improved, as well as the performance. The GUI needs to be better."
"It would be useful if Cloudera had more tools like SQL Engines that offer the traditional relational database. We have to do a lot of work preparing the data outside Cloudera before getting it into the platform."
"The price of this solution could be lowered."
"The one thing that we struggled with predominately was support. Because it was relatively new, support was always a big issue and I think it's still a bit of an ongoing concern with the team currently managing it."
"The competitors provide better functionalities."
"The documentation of Scylla is an area with shortcomings and needs to be improved."
"Some of the regular commands in NoSQL do not work."
"The documentation is not well established for new developers."
"We faced several challenges while integrating ScyllaDB into our AWS environment. One common issue was that a security port wasn’t opened on one node, preventingdata synchronization across clusters. We noticed the data wasn’t syncing correctly when we saw different record counts in other regions. After investigating, we found that the port was closed in one AWS region. Once we opened the port, the data synchronization across all nodes resumed as expected."
"From a sales pitch standpoint, it needs to deliver on promises of better ROI and compaction."
"ScyllaDB needs to improve its handling of transactions."
"If you don't have the best computing resources, then it's not easy to set up. In such cases, we have to run ScyllaDB in developer mode."
"The product needs to add more features and improve the response time of the support team."
 

Pricing and Cost Advice

"The price is very high. The solution is expensive."
"I wouldn't recommend CDH to others because of its high cost."
"The product’s price depends from project to project."
"Cloudera requires a license to use."
"The pricing must be improved."
"The tool is expensive...For the SMB market or customers whose environments are not that complex and do not have multiple systems running, Cloudera might not be a good option."
"When comparing with Oracle Sybase and SQL, it's cheaper. It's not expensive."
"I believe we pay for a three-year license."
"It's a bit expensive."
"It's free."
"It is an expensive tool compared to its competitor."
"The paid version of ScyllaDB is not that expensive. The main advantage of the paid version is direct support from the ScyllaDB team, which can resolve issues faster—typically within a day, compared to two to three days with the free version. The paid version also offers better guidance and support, while the free version has good documentation and is more high-level. I’d rate their support team nine out of ten because of the quick responses from their community."
"I believe that there is a yearly licensing cost and that it's expensive."
report
Use our free recommendation engine to learn which NoSQL Databases solutions are best for your needs.
831,265 professionals have used our research since 2012.
 

Top Industries

By visitors reading reviews
Financial Services Firm
23%
Computer Software Company
14%
Educational Organization
11%
Manufacturing Company
9%
Computer Software Company
18%
Financial Services Firm
14%
Media Company
6%
Comms Service Provider
5%
 

Company Size

By reviewers
Large Enterprise
Midsize Enterprise
Small Business
 

Questions from the Community

What do you like most about Cloudera Distribution for Hadoop?
The tool can be deployed using different container technologies, which makes it very scalable.
What is your experience regarding pricing and costs for Cloudera Distribution for Hadoop?
The tool is expensive. Overall, it's not a cheap software tool, and that is why only large enterprises who are mature enough and have an architecture that is complex enough opt for Cloudera, as its...
What needs improvement with Cloudera Distribution for Hadoop?
The tool doesn't support reporting, and relational databases are still the major source of reporting data. Apache Iceberg will be launched soon within the Cloudera cluster for analytical purposes. ...
What do you like most about Scylla?
The performance aspects of Scylla are good, as always... A good point about Scylla is that it can be used extensively.
What is your experience regarding pricing and costs for Scylla?
The enterprise version comes with a cost of about $300,000 per year, however, we did not experience the promised compaction benefits.
What needs improvement with Scylla?
From a sales pitch standpoint, it needs to deliver on promises of better ROI and compaction. Additionally, ticketing and support systems could be improved due to the time it takes to get answers. T...
 

Learn More

 

Overview

 

Sample Customers

37signals, Adconion,adgooroo, Aggregate Knowledge, AMD, Apollo Group, Blackberry, Box, BT, CSC
IBM, Investing.com, mParticle, Comcast, GE, Fanatics, Ola, CERN, adgear, Samsung
Find out what your peers are saying about Cloudera Distribution for Hadoop vs. ScyllaDB and other solutions. Updated: January 2025.
831,265 professionals have used our research since 2012.