Try our new research platform with insights from 80,000+ expert users

Cloudera Distribution for Hadoop vs SingleStore comparison

 

Comparison Buyer's Guide

Executive Summary
 

Categories and Ranking

Cloudera Distribution for H...
Average Rating
8.0
Number of Reviews
49
Ranking in other categories
Hadoop (2nd), NoSQL Databases (7th)
SingleStore
Average Rating
8.8
Number of Reviews
7
Ranking in other categories
Database as a Service (DBaaS) (7th), Vector Databases (12th)
 

Mindshare comparison

While both are Databases solutions, they serve different purposes. Cloudera Distribution for Hadoop is designed for Hadoop and holds a mindshare of 27.1%, up 22.7% compared to last year.
SingleStore, on the other hand, focuses on Database as a Service (DBaaS), holds 1.2% mindshare, up 0.8% since last year.
Hadoop
Database as a Service (DBaaS)
 

Featured Reviews

Shahan Rehman - PeerSpot reviewer
Mar 21, 2024
Can host multiple technologies and help businesses with their AI initiatives
The ease or difficulty in setting up the product depends on the environment of the customer where the tool is deployed. If a banking, industrial, or retail sector firm is taken into concentration, depending on how big of a database is maintained, including the applications that are to be hosted, the deployment process can range from a simple to a very complex phase, depending on the architecture. For Cloudera Distribution for Hadoop, one has to go through the usual deployment process, like for any software product. You have to have different environments before going into production, like pre-production environments, test and dev environments. You install and configure all the components in the test environment and then test them on the pre-production environment. Once UAT is done, you move them to the production environment. In general, it's a critical product deployed in a company.
Hitesh Kunchakuri - PeerSpot reviewer
Dec 18, 2023
A reasonably priced product that offers good speed and seamless support
Currently, I can't think of any areas that require improvement because SingleStore was recently launched in the market. The product can be developed further to provide more appropriate output to users as it is one of the areas where there are shortcomings. The current SingleStore model provides output based on the RANK function. If a user searches for a liquor bottle, then with all the data the product has, it will search for the liquor bottle in the data, and based on a match, the product has an algorithm to rank the product because of which the paragraph that has the best match will be ranked as a 100, the next one as 99, following which the next product will be ranked as 98 and so on. The output from the solution will fetch you all the 100 products that are available in a store, but sometimes a user might require a product with a 97 or 98 percent match from the DB, meaning the product doesn't always work to provide a 100 percent match, an area I feel that can be optimized in the product. Currently, SingleStore's features are excellent as it can read documents, images, and everything. The product works seamlessly for me.

Quotes from Members

We asked business professionals to review the solutions they use. Here are some excerpts of what they said:
 

Pros

"The most valuable feature is Kubernetes."
"The file system is a valuable feature."
"The product is completely secure."
"Provides a viable open-source solution for enterprise implementations and reliable, intelligent data analysis."
"The product provides better data processing features than other tools."
"The solution's most valuable feature is the enterprise data platform."
"The tool's most interesting features are the distributed file system and unstructured data processing capability. Because we have a lot of unstructured data, like XML and social media logs, these features make it more valuable than the usual data warehousing solutions."
"The main advantage is the storage is less expensive."
"MemSQL supports the MySQL protocol, and many functions are similar, so the learning curve is very short."
"The product's initial setup phase was pretty straightforward, with no complex processes."
"It's a distributed relational database, so it does not have a single server, it has multiple servers. Its architecture itself is fast because it has multiple nodes to distribute the workload and process large amounts of data."
"The paramount advantage is the exceptional speed."
"The most valuable feature is the ability to create pipelines, streamline and extract data from the pipelines."
"The product can automatically reinstall and reconfigure in case of a shutdown."
"The ability to store data in memory is a standout feature, enhanced by robust failover mechanisms."
 

Cons

"The pricing needs to improve."
"The tool's ability to be deployed on a cloud model is an area of concern where improvements are required."
"There are multiple bugs when we update."
"The initial setup of Cloudera is difficult."
"The performance of some analytics engines provided by Cloudera is not that good."
"The tool doesn't support reporting, and relational databases are still the major source of reporting data. Apache Iceberg will be launched soon within the Cloudera cluster for analytical purposes. The Cloudera Machine Learning aspect could be tuned and enhanced to enable us to host some predictive analytics machine learning and AI use cases."
"The Cloudera training has deteriorated significantly."
"This is a very expensive solution."
"The product can be developed further to provide more appropriate output to users as it is one of the areas where there are shortcomings."
"We don't get good discounts in Pakistan."
"There should be more pipelines available because I think that if MemSQL can connect to other services, that would be great."
"Poor key distribution can significantly impact performance, requiring a backward approach in design rather than adding tables incrementally."
"It is not the optimal choice for direct data collection through queries, and it's more suited for aggregation tasks."
"Having the ability to migrate servers using a single command would be extremely beneficial."
"For new customers, it's very tough to start. Their documentation isn't organized, and there's no online training available. SingleStore is working on it, but that's a major drawback."
 

Pricing and Cost Advice

"Cloudera Distribution for Hadoop is expensive, with support costs involved."
"I wouldn't recommend CDH to others because of its high cost."
"The solution is expensive."
"The tool is expensive...For the SMB market or customers whose environments are not that complex and do not have multiple systems running, Cloudera might not be a good option."
"The pricing must be improved."
"The price could be better for the product."
"When comparing with Oracle Sybase and SQL, it's cheaper. It's not expensive."
"I believe we pay for a three-year license."
"The price of the product is okay compared to the other available solutions in the market. SingleStore is a reasonably priced product, considering the functions it offers."
"The product's licensing is not expensive. It is comparable."
"SingleStore is a bit expensive."
"They have two main options: cloud installation and bare-metal installation, each with different pricing models."
"I would advise users to try the free 128GB version."
"Using it for analytical purposes can be cost-effective in the long run, especially in terms of infrastructure."
report
Use our free recommendation engine to learn which Hadoop solutions are best for your needs.
814,649 professionals have used our research since 2012.
 

Top Industries

By visitors reading reviews
Financial Services Firm
23%
Computer Software Company
15%
Educational Organization
10%
Manufacturing Company
8%
Financial Services Firm
30%
Computer Software Company
14%
Manufacturing Company
6%
Retailer
5%
 

Company Size

By reviewers
Large Enterprise
Midsize Enterprise
Small Business
 

Questions from the Community

What do you like most about Cloudera Distribution for Hadoop?
The tool can be deployed using different container technologies, which makes it very scalable.
What is your experience regarding pricing and costs for Cloudera Distribution for Hadoop?
The tool is expensive. Overall, it's not a cheap software tool, and that is why only large enterprises who are mature enough and have an architecture that is complex enough opt for Cloudera, as its...
What needs improvement with Cloudera Distribution for Hadoop?
The tool doesn't support reporting, and relational databases are still the major source of reporting data. Apache Iceberg will be launched soon within the Cloudera cluster for analytical purposes. ...
What do you like most about SingleStore DB?
The paramount advantage is the exceptional speed.
What is your experience regarding pricing and costs for SingleStore DB?
Using it for analytical purposes can be cost-effective in the long run, especially in terms of infrastructure. While building an on-premise cluster incurs an initial cost for servers with ample RAM...
What needs improvement with SingleStore DB?
There's a noteworthy consideration when it comes to collecting massive amounts of data. It is not the optimal choice for direct data collection through queries, and it's more suited for aggregation...
 

Learn More

Video not available
 

Overview

 

Sample Customers

37signals, Adconion,adgooroo, Aggregate Knowledge, AMD, Apollo Group, Blackberry, Box, BT, CSC
6Sense, ADNOC, Adobe, Akamai, CARFAX, Cigna, Cisco, Comcast, DBS Bank, Dell, Dentsu, EY, FirstEnergy, GE, Goldman Sachs, Heap, Hulu, IMAX, Kakao, Kroger, LG, LiveRamp, Lumana, NBC, OpenDialog, Outreach, Palo Alto Networks, PicPay, RBC, Samsung, Siemens, SiriusXM, SK Telecom, SKAI, Sony, State Street Financial, STC, SunRun, TATA, Thorn, and ZoomInfo.
Find out what your peers are saying about Apache, Cloudera, Amazon Web Services (AWS) and others in Hadoop. Updated: October 2024.
814,649 professionals have used our research since 2012.