Try our new research platform with insights from 80,000+ expert users

Cloudera Distribution for Hadoop vs QueryIO comparison

 

Comparison Buyer's Guide

Executive Summary
 

Categories and Ranking

Cloudera Distribution for H...
Ranking in Hadoop
2nd
Average Rating
8.0
Reviews Sentiment
6.4
Number of Reviews
49
Ranking in other categories
NoSQL Databases (7th)
QueryIO
Ranking in Hadoop
16th
Average Rating
8.0
Number of Reviews
1
Ranking in other categories
No ranking in other categories
 

Mindshare comparison

As of November 2024, in the Hadoop category, the mindshare of Cloudera Distribution for Hadoop is 27.1%, up from 22.7% compared to the previous year. The mindshare of QueryIO is 0.9%, up from 0.5% compared to the previous year. It is calculated based on PeerSpot user engagement data.
Hadoop
 

Featured Reviews

Shahan Rehman - PeerSpot reviewer
Can host multiple technologies and help businesses with their AI initiatives
The ease or difficulty in setting up the product depends on the environment of the customer where the tool is deployed. If a banking, industrial, or retail sector firm is taken into concentration, depending on how big of a database is maintained, including the applications that are to be hosted, the deployment process can range from a simple to a very complex phase, depending on the architecture. For Cloudera Distribution for Hadoop, one has to go through the usual deployment process, like for any software product. You have to have different environments before going into production, like pre-production environments, test and dev environments. You install and configure all the components in the test environment and then test them on the pre-production environment. Once UAT is done, you move them to the production environment. In general, it's a critical product deployed in a company.
MR
Stable with good connectivity and good integration capabilities
Data cleansing is not intuitive and user-friendly. When things have errors, you have to hunt them down as opposed to the solution simply showing you intuitively where to find it. I would recommend that they look at that Tableau Prep tool and see how it is pieced together. That's a great data cleansing tool. If Microsoft has something like that, then we wouldn't even have to look at some of the other options. There needs to be some simplification of the user interface. Right now it's too complicated. There isn't a way to put controls on the solution, so anyone can use any part of it, and sometimes novices will go and try to create things, but not know enough about what is official and what is published. It would be ideal if we could segment off certain sections so that not everyone had access to the whole solution. I'd like to see something more of a mapping tool so that you could see how the reports are connected, similar to Tableau Prep and Naim. That would make for a pretty useful diagnostics check. People would be better able to understand the linkage between your datasets. It would be nice if the solution offered some templates. It would make it even more plug and play, and give people a good jumping-off point. After that, they could explore other bells and whistles as they get further into understanding the solution. The solution should work in some virtualization. It would be a good added feature. If this product had those things then I wouldn't need to use other products.

Quotes from Members

We asked business professionals to review the solutions they use. Here are some excerpts of what they said:
 

Pros

"The product provides better data processing features than other tools."
"CDH has a wide variety of proprietary tools that we use, like Impala. So from that perspective, it's quite useful as opposed to something open-source. We get a lot of value from Cloudera's proprietary tools."
"The scalability of Cloudera Distribution for Hadoop is excellent."
"It is helpful to gather and process data."
"The data science aspect of the solution is valuable."
"The product is completely secure."
"The file system is a valuable feature."
"We had a data warehouse before all the data. We can process a lot more data structures."
"Anyone who has even a little bit of knowledge of the solution can begin to create things. You don't have to be technical to use the solution."
 

Cons

"The tool's ability to be deployed on a cloud model is an area of concern where improvements are required."
"Without the big data environment, we cannot store all of this data live. We have billions of records and terabytes of storage to be used. It's not an option actually for us to have a big data environment."
"The tool doesn't support reporting, and relational databases are still the major source of reporting data. Apache Iceberg will be launched soon within the Cloudera cluster for analytical purposes. The Cloudera Machine Learning aspect could be tuned and enhanced to enable us to host some predictive analytics machine learning and AI use cases."
"They should focus on upgrading their technical capabilities in the market."
"The user infrastructure and user interface needs to be improved, as well as the performance. The GUI needs to be better."
"The pricing needs to improve."
"The governance aspect of the solution should be improved."
"It could be faster and more user-friendly."
"There needs to be some simplification of the user interface."
 

Pricing and Cost Advice

"Cloudera Distribution for Hadoop is expensive, with support costs involved."
"I wouldn't recommend CDH to others because of its high cost."
"The tool is not expensive."
"It is an expensive product."
"The solution is expensive."
"I believe we pay for a three-year license."
"The product’s price depends from project to project."
"The solution is fairly expensive."
Information not available
report
Use our free recommendation engine to learn which Hadoop solutions are best for your needs.
816,406 professionals have used our research since 2012.
 

Top Industries

By visitors reading reviews
Financial Services Firm
23%
Computer Software Company
15%
Educational Organization
10%
Manufacturing Company
8%
No data available
 

Company Size

By reviewers
Large Enterprise
Midsize Enterprise
Small Business
No data available
 

Questions from the Community

What do you like most about Cloudera Distribution for Hadoop?
The tool can be deployed using different container technologies, which makes it very scalable.
What is your experience regarding pricing and costs for Cloudera Distribution for Hadoop?
The tool is expensive. Overall, it's not a cheap software tool, and that is why only large enterprises who are mature enough and have an architecture that is complex enough opt for Cloudera, as its...
What needs improvement with Cloudera Distribution for Hadoop?
The tool doesn't support reporting, and relational databases are still the major source of reporting data. Apache Iceberg will be launched soon within the Cloudera cluster for analytical purposes. ...
Ask a question
Earn 20 points
 

Learn More

Video not available
 

Overview

 

Sample Customers

37signals, Adconion,adgooroo, Aggregate Knowledge, AMD, Apollo Group, Blackberry, Box, BT, CSC
Information Not Available
Find out what your peers are saying about Apache, Cloudera, Amazon Web Services (AWS) and others in Hadoop. Updated: November 2024.
816,406 professionals have used our research since 2012.