Try our new research platform with insights from 80,000+ expert users

Cloudera Distribution for Hadoop vs SingleStore comparison

 

Comparison Buyer's Guide

Executive Summary

Review summaries and opinions

We asked business professionals to review the solutions they use. Here are some excerpts of what they said:
 

Categories and Ranking

Cloudera Distribution for H...
Average Rating
8.0
Reviews Sentiment
6.4
Number of Reviews
49
Ranking in other categories
Hadoop (2nd), NoSQL Databases (8th)
SingleStore
Average Rating
8.8
Reviews Sentiment
7.1
Number of Reviews
7
Ranking in other categories
Database as a Service (DBaaS) (7th), Vector Databases (13th)
 

Mindshare comparison

While both are Databases solutions, they serve different purposes. Cloudera Distribution for Hadoop is designed for Hadoop and holds a mindshare of 27.8%, up 23.5% compared to last year.
SingleStore, on the other hand, focuses on Database as a Service (DBaaS), holds 1.4% mindshare, up 0.8% since last year.
Hadoop
Database as a Service (DBaaS)
 

Featured Reviews

Miodrag-Stanic - PeerSpot reviewer
You can manage all services from one place in an integrated manner
We switched to Airflow because Cloudera is outdated. It's not widely used. It would be good if we had the Spark 3.5. Spark is quite old. Cloudera is now offering an alternate solution as a replacement for AWS. AWS works badly with small files. The solution is not fit for on-premise distributions. It should be containerized so we can deploy it as containers within Kubernetes. We had one upgrade from CDH to CDP, which lasted for a long time. And I would expect with containerized deployment, it would be upgraded much more quickly than we had the experience.
Hitesh Kunchakuri - PeerSpot reviewer
A reasonably priced product that offers good speed and seamless support
Currently, I can't think of any areas that require improvement because SingleStore was recently launched in the market. The product can be developed further to provide more appropriate output to users as it is one of the areas where there are shortcomings. The current SingleStore model provides output based on the RANK function. If a user searches for a liquor bottle, then with all the data the product has, it will search for the liquor bottle in the data, and based on a match, the product has an algorithm to rank the product because of which the paragraph that has the best match will be ranked as a 100, the next one as 99, following which the next product will be ranked as 98 and so on. The output from the solution will fetch you all the 100 products that are available in a store, but sometimes a user might require a product with a 97 or 98 percent match from the DB, meaning the product doesn't always work to provide a 100 percent match, an area I feel that can be optimized in the product. Currently, SingleStore's features are excellent as it can read documents, images, and everything. The product works seamlessly for me.

Quotes from Members

We asked business professionals to review the solutions they use. Here are some excerpts of what they said:
 

Pros

"Provides a viable open-source solution for enterprise implementations and reliable, intelligent data analysis."
"Cloudera is a very manageable solution with good support."
"Customer service and support were able to fix whatever the issue was."
"CDH has a wide variety of proprietary tools that we use, like Impala. So from that perspective, it's quite useful as opposed to something open-source. We get a lot of value from Cloudera's proprietary tools."
"The search function is the most valuable aspect of the solution."
"The product provides better data processing features than other tools."
"In terms of scalability, if you have enough hardware you can scale out. Scalability doesn't have any issues."
"Cloudera, as a whole, is designed to provide organizations with solutions for big data."
"The most valuable feature is the ability to create pipelines, streamline and extract data from the pipelines."
"The ability to store data in memory is a standout feature, enhanced by robust failover mechanisms."
"The product can automatically reinstall and reconfigure in case of a shutdown."
"The paramount advantage is the exceptional speed."
"It's a distributed relational database, so it does not have a single server, it has multiple servers. Its architecture itself is fast because it has multiple nodes to distribute the workload and process large amounts of data."
"MemSQL supports the MySQL protocol, and many functions are similar, so the learning curve is very short."
"The product's initial setup phase was pretty straightforward, with no complex processes."
 

Cons

"There are better solutions out there that have more features than this one."
"While the deployed product is generally functional, there are instances where it presents difficulties."
"Without the big data environment, we cannot store all of this data live. We have billions of records and terabytes of storage to be used. It's not an option actually for us to have a big data environment."
"The initial setup of Cloudera is difficult."
"Cloudera's support is extremely bad and cannot be relied on."
"There are multiple bugs when we update."
"The performance of some analytics engines provided by Cloudera is not that good."
"The Cloudera training has deteriorated significantly."
"The product can be developed further to provide more appropriate output to users as it is one of the areas where there are shortcomings."
"We don't get good discounts in Pakistan."
"There should be more pipelines available because I think that if MemSQL can connect to other services, that would be great."
"It is not the optimal choice for direct data collection through queries, and it's more suited for aggregation tasks."
"Poor key distribution can significantly impact performance, requiring a backward approach in design rather than adding tables incrementally."
"Having the ability to migrate servers using a single command would be extremely beneficial."
"For new customers, it's very tough to start. Their documentation isn't organized, and there's no online training available. SingleStore is working on it, but that's a major drawback."
 

Pricing and Cost Advice

"It is an expensive product."
"Cloudera Distribution for Hadoop is expensive, with support costs involved."
"The tool is expensive...For the SMB market or customers whose environments are not that complex and do not have multiple systems running, Cloudera might not be a good option."
"The tool is not expensive."
"I haven't bought a license for this solution. I'm only using the Apache license version."
"The price could be better for the product."
"The solution is fairly expensive."
"The product’s price depends from project to project."
"The price of the product is okay compared to the other available solutions in the market. SingleStore is a reasonably priced product, considering the functions it offers."
"I would advise users to try the free 128GB version."
"The product's licensing is not expensive. It is comparable."
"SingleStore is a bit expensive."
"Using it for analytical purposes can be cost-effective in the long run, especially in terms of infrastructure."
"They have two main options: cloud installation and bare-metal installation, each with different pricing models."
report
Use our free recommendation engine to learn which Hadoop solutions are best for your needs.
831,265 professionals have used our research since 2012.
 

Top Industries

By visitors reading reviews
Financial Services Firm
23%
Computer Software Company
14%
Educational Organization
11%
Manufacturing Company
9%
Financial Services Firm
31%
Computer Software Company
13%
Manufacturing Company
6%
Healthcare Company
4%
 

Company Size

By reviewers
Large Enterprise
Midsize Enterprise
Small Business
 

Questions from the Community

What do you like most about Cloudera Distribution for Hadoop?
The tool can be deployed using different container technologies, which makes it very scalable.
What is your experience regarding pricing and costs for Cloudera Distribution for Hadoop?
The tool is expensive. Overall, it's not a cheap software tool, and that is why only large enterprises who are mature enough and have an architecture that is complex enough opt for Cloudera, as its...
What needs improvement with Cloudera Distribution for Hadoop?
The tool doesn't support reporting, and relational databases are still the major source of reporting data. Apache Iceberg will be launched soon within the Cloudera cluster for analytical purposes. ...
What do you like most about SingleStore DB?
The paramount advantage is the exceptional speed.
What is your experience regarding pricing and costs for SingleStore DB?
Using it for analytical purposes can be cost-effective in the long run, especially in terms of infrastructure. While building an on-premise cluster incurs an initial cost for servers with ample RAM...
What needs improvement with SingleStore DB?
There's a noteworthy consideration when it comes to collecting massive amounts of data. It is not the optimal choice for direct data collection through queries, and it's more suited for aggregation...
 

Learn More

Video not available
 

Overview

 

Sample Customers

37signals, Adconion,adgooroo, Aggregate Knowledge, AMD, Apollo Group, Blackberry, Box, BT, CSC
6Sense, ADNOC, Adobe, Akamai, CARFAX, Cigna, Cisco, Comcast, DBS Bank, Dell, Dentsu, EY, FirstEnergy, GE, Goldman Sachs, Heap, Hulu, IMAX, Kakao, Kroger, LG, LiveRamp, Lumana, NBC, OpenDialog, Outreach, Palo Alto Networks, PicPay, RBC, Samsung, Siemens, SiriusXM, SK Telecom, SKAI, Sony, State Street Financial, STC, SunRun, TATA, Thorn, and ZoomInfo.
Find out what your peers are saying about Apache, Cloudera, Amazon Web Services (AWS) and others in Hadoop. Updated: January 2025.
831,265 professionals have used our research since 2012.