Try our new research platform with insights from 80,000+ expert users

Cloudera Distribution for Hadoop vs SingleStore comparison

 

Comparison Buyer's Guide

Executive Summary

Review summaries and opinions

We asked business professionals to review the solutions they use. Here are some excerpts of what they said:
 

Categories and Ranking

Cloudera Distribution for H...
Average Rating
8.0
Reviews Sentiment
6.4
Number of Reviews
50
Ranking in other categories
Hadoop (2nd), NoSQL Databases (8th)
SingleStore
Average Rating
8.8
Reviews Sentiment
7.1
Number of Reviews
7
Ranking in other categories
Database as a Service (DBaaS) (7th), Vector Databases (13th)
 

Mindshare comparison

While both are Databases solutions, they serve different purposes. Cloudera Distribution for Hadoop is designed for Hadoop and holds a mindshare of 25.7%, up 22.7% compared to last year.
SingleStore, on the other hand, focuses on Database as a Service (DBaaS), holds 1.6% mindshare, up 0.8% since last year.
Hadoop
Database as a Service (DBaaS)
 

Featured Reviews

Rok Dolinsek - PeerSpot reviewer
Enables on-premise implementation with powerful data processing capabilities
This is the only solution that is possible to install on-premise. Cloudera provides a hybrid solution that combines compute on cloud or on-premises. It includes all machine learning algorithms in the Spark machine learning library. All functionalities needed for a big data platform and ETL are on the platform, eliminating the need for other tools. It is scalable, ready for vertical scaling, and very powerful, offering numerous functionalities and configurations for generative AI.
Hitesh Kunchakuri - PeerSpot reviewer
A reasonably priced product that offers good speed and seamless support
Currently, I can't think of any areas that require improvement because SingleStore was recently launched in the market. The product can be developed further to provide more appropriate output to users as it is one of the areas where there are shortcomings. The current SingleStore model provides output based on the RANK function. If a user searches for a liquor bottle, then with all the data the product has, it will search for the liquor bottle in the data, and based on a match, the product has an algorithm to rank the product because of which the paragraph that has the best match will be ranked as a 100, the next one as 99, following which the next product will be ranked as 98 and so on. The output from the solution will fetch you all the 100 products that are available in a store, but sometimes a user might require a product with a 97 or 98 percent match from the DB, meaning the product doesn't always work to provide a 100 percent match, an area I feel that can be optimized in the product. Currently, SingleStore's features are excellent as it can read documents, images, and everything. The product works seamlessly for me.

Quotes from Members

We asked business professionals to review the solutions they use. Here are some excerpts of what they said:
 

Pros

"We also really like the Cloudera community. You can have any question and will have your answer within a few hours."
"It is helpful to gather and process data."
"We had a data warehouse before all the data. We can process a lot more data structures."
"We're now able to store large volumes of data through Cloudera Distribution for Hadoop. We're able to push large volumes of data to the platform, and that used to be a challenge, especially when storing a terabyte of information. This is the area where Cloudera Distribution for Hadoop improved the organization."
"The solution is reliable and stable, it fits our requirements."
"Cloudera is a very manageable solution with good support."
"The features I find most valuable is that the solution is that it is easy to install and to work with. It starts with the installation and from there on the management is very simple and centralized."
"Provides a viable open-source solution for enterprise implementations and reliable, intelligent data analysis."
"The most valuable feature is the ability to create pipelines, streamline and extract data from the pipelines."
"The paramount advantage is the exceptional speed."
"The product's initial setup phase was pretty straightforward, with no complex processes."
"It's a distributed relational database, so it does not have a single server, it has multiple servers. Its architecture itself is fast because it has multiple nodes to distribute the workload and process large amounts of data."
"The product can automatically reinstall and reconfigure in case of a shutdown."
"The ability to store data in memory is a standout feature, enhanced by robust failover mechanisms."
"MemSQL supports the MySQL protocol, and many functions are similar, so the learning curve is very short."
 

Cons

"It would be useful if Cloudera had more tools like SQL Engines that offer the traditional relational database. We have to do a lot of work preparing the data outside Cloudera before getting it into the platform."
"The price of this solution could be lowered."
"The security of this solution could be improved. There should also be a way to basically have a blockchain enabled storage with the HDFS."
"It could be faster and more user-friendly."
"Currently, we are using many other tools such as Spark and Blade Job to improve the performance."
"Cloudera Distribution for Hadoop has a limited feature list and a lot of costs involved."
"The dashboard could be improved."
"The solution does not support multiple languages very well and this means users need to create work-arounds to implement some solutions."
"Poor key distribution can significantly impact performance, requiring a backward approach in design rather than adding tables incrementally."
"There should be more pipelines available because I think that if MemSQL can connect to other services, that would be great."
"It is not the optimal choice for direct data collection through queries, and it's more suited for aggregation tasks."
"We don't get good discounts in Pakistan."
"For new customers, it's very tough to start. Their documentation isn't organized, and there's no online training available. SingleStore is working on it, but that's a major drawback."
"The product can be developed further to provide more appropriate output to users as it is one of the areas where there are shortcomings."
"Having the ability to migrate servers using a single command would be extremely beneficial."
 

Pricing and Cost Advice

"When comparing with Oracle Sybase and SQL, it's cheaper. It's not expensive."
"The price is very high. The solution is expensive."
"The solution is fairly expensive."
"The product’s price depends from project to project."
"I wouldn't recommend CDH to others because of its high cost."
"I haven't bought a license for this solution. I'm only using the Apache license version."
"I believe we pay for a three-year license."
"The price could be better for the product."
"I would advise users to try the free 128GB version."
"The product's licensing is not expensive. It is comparable."
"Using it for analytical purposes can be cost-effective in the long run, especially in terms of infrastructure."
"SingleStore is a bit expensive."
"They have two main options: cloud installation and bare-metal installation, each with different pricing models."
"The price of the product is okay compared to the other available solutions in the market. SingleStore is a reasonably priced product, considering the functions it offers."
report
Use our free recommendation engine to learn which Hadoop solutions are best for your needs.
838,713 professionals have used our research since 2012.
 

Top Industries

By visitors reading reviews
Financial Services Firm
24%
Computer Software Company
14%
Educational Organization
12%
Manufacturing Company
9%
Financial Services Firm
33%
Computer Software Company
13%
Healthcare Company
5%
Manufacturing Company
5%
 

Company Size

By reviewers
Large Enterprise
Midsize Enterprise
Small Business
 

Questions from the Community

What do you like most about Cloudera Distribution for Hadoop?
The tool can be deployed using different container technologies, which makes it very scalable.
What is your experience regarding pricing and costs for Cloudera Distribution for Hadoop?
The price for Cloudera is average, yet it is very good compared to other solutions. It can be deployed on-premises, unlike competitors' cloud-only solutions.
What needs improvement with Cloudera Distribution for Hadoop?
It is quite complicated to configure and install. Integrating the platform into an information system is always a challenge, especially when starting with on-premise implementation. Integrating wit...
What do you like most about SingleStore DB?
The paramount advantage is the exceptional speed.
What is your experience regarding pricing and costs for SingleStore DB?
Using it for analytical purposes can be cost-effective in the long run, especially in terms of infrastructure. While building an on-premise cluster incurs an initial cost for servers with ample RAM...
What needs improvement with SingleStore DB?
There's a noteworthy consideration when it comes to collecting massive amounts of data. It is not the optimal choice for direct data collection through queries, and it's more suited for aggregation...
 

Overview

 

Sample Customers

37signals, Adconion,adgooroo, Aggregate Knowledge, AMD, Apollo Group, Blackberry, Box, BT, CSC
6Sense, ADNOC, Adobe, Akamai, CARFAX, Cigna, Cisco, Comcast, DBS Bank, Dell, Dentsu, EY, FirstEnergy, GE, Goldman Sachs, Heap, Hulu, IMAX, Kakao, Kroger, LG, LiveRamp, Lumana, NBC, OpenDialog, Outreach, Palo Alto Networks, PicPay, RBC, Samsung, Siemens, SiriusXM, SK Telecom, SKAI, Sony, State Street Financial, STC, SunRun, TATA, Thorn, and ZoomInfo.
Find out what your peers are saying about Apache, Cloudera, Amazon Web Services (AWS) and others in Hadoop. Updated: February 2025.
838,713 professionals have used our research since 2012.