Try our new research platform with insights from 80,000+ expert users

Cloudera Distribution for Hadoop vs QueryIO comparison

 

Comparison Buyer's Guide

Executive Summary

Review summaries and opinions

We asked business professionals to review the solutions they use. Here are some excerpts of what they said:
 

Categories and Ranking

Cloudera Distribution for H...
Ranking in Hadoop
2nd
Average Rating
8.0
Reviews Sentiment
6.4
Number of Reviews
49
Ranking in other categories
NoSQL Databases (8th)
QueryIO
Ranking in Hadoop
16th
Average Rating
8.0
Number of Reviews
1
Ranking in other categories
No ranking in other categories
 

Mindshare comparison

As of January 2025, in the Hadoop category, the mindshare of Cloudera Distribution for Hadoop is 27.8%, up from 23.5% compared to the previous year. The mindshare of QueryIO is 0.5%, down from 0.6% compared to the previous year. It is calculated based on PeerSpot user engagement data.
Hadoop
 

Featured Reviews

Miodrag-Stanic - PeerSpot reviewer
You can manage all services from one place in an integrated manner
We switched to Airflow because Cloudera is outdated. It's not widely used. It would be good if we had the Spark 3.5. Spark is quite old. Cloudera is now offering an alternate solution as a replacement for AWS. AWS works badly with small files. The solution is not fit for on-premise distributions. It should be containerized so we can deploy it as containers within Kubernetes. We had one upgrade from CDH to CDP, which lasted for a long time. And I would expect with containerized deployment, it would be upgraded much more quickly than we had the experience.
MR
Stable with good connectivity and good integration capabilities
Data cleansing is not intuitive and user-friendly. When things have errors, you have to hunt them down as opposed to the solution simply showing you intuitively where to find it. I would recommend that they look at that Tableau Prep tool and see how it is pieced together. That's a great data cleansing tool. If Microsoft has something like that, then we wouldn't even have to look at some of the other options. There needs to be some simplification of the user interface. Right now it's too complicated. There isn't a way to put controls on the solution, so anyone can use any part of it, and sometimes novices will go and try to create things, but not know enough about what is official and what is published. It would be ideal if we could segment off certain sections so that not everyone had access to the whole solution. I'd like to see something more of a mapping tool so that you could see how the reports are connected, similar to Tableau Prep and Naim. That would make for a pretty useful diagnostics check. People would be better able to understand the linkage between your datasets. It would be nice if the solution offered some templates. It would make it even more plug and play, and give people a good jumping-off point. After that, they could explore other bells and whistles as they get further into understanding the solution. The solution should work in some virtualization. It would be a good added feature. If this product had those things then I wouldn't need to use other products.

Quotes from Members

We asked business professionals to review the solutions they use. Here are some excerpts of what they said:
 

Pros

"I don't see any performance issues."
"The main advantage is the storage is less expensive."
"The file system is a valuable feature."
"Cloudera is a very manageable solution with good support."
"The most valuable feature is Impala, the querying engine, which is very fast."
"The solution is reliable and stable, it fits our requirements."
"The solution's most valuable feature is the enterprise data platform."
"Very good end-to-end security features."
"Anyone who has even a little bit of knowledge of the solution can begin to create things. You don't have to be technical to use the solution."
 

Cons

"It would be useful if Cloudera had more tools like SQL Engines that offer the traditional relational database. We have to do a lot of work preparing the data outside Cloudera before getting it into the platform."
"There are better solutions out there that have more features than this one."
"The procedure for operations could be simplified."
"The governance aspect of the solution should be improved."
"Cloudera Distribution for Hadoop has a limited feature list and a lot of costs involved."
"The Cloudera training has deteriorated significantly."
"While the deployed product is generally functional, there are instances where it presents difficulties."
"The initial setup of Cloudera is difficult."
"There needs to be some simplification of the user interface."
 

Pricing and Cost Advice

"I haven't bought a license for this solution. I'm only using the Apache license version."
"The solution is fairly expensive."
"I wouldn't recommend CDH to others because of its high cost."
"The tool is not expensive."
"The price could be better for the product."
"I believe we pay for a three-year license."
"It is an expensive product."
"The product’s price depends from project to project."
Information not available
report
Use our free recommendation engine to learn which Hadoop solutions are best for your needs.
831,265 professionals have used our research since 2012.
 

Top Industries

By visitors reading reviews
Financial Services Firm
23%
Computer Software Company
14%
Educational Organization
11%
Manufacturing Company
9%
No data available
 

Company Size

By reviewers
Large Enterprise
Midsize Enterprise
Small Business
No data available
 

Questions from the Community

What do you like most about Cloudera Distribution for Hadoop?
The tool can be deployed using different container technologies, which makes it very scalable.
What is your experience regarding pricing and costs for Cloudera Distribution for Hadoop?
The tool is expensive. Overall, it's not a cheap software tool, and that is why only large enterprises who are mature enough and have an architecture that is complex enough opt for Cloudera, as its...
What needs improvement with Cloudera Distribution for Hadoop?
The tool doesn't support reporting, and relational databases are still the major source of reporting data. Apache Iceberg will be launched soon within the Cloudera cluster for analytical purposes. ...
Ask a question
Earn 20 points
 

Learn More

Video not available
 

Overview

 

Sample Customers

37signals, Adconion,adgooroo, Aggregate Knowledge, AMD, Apollo Group, Blackberry, Box, BT, CSC
Information Not Available
Find out what your peers are saying about Apache, Cloudera, Amazon Web Services (AWS) and others in Hadoop. Updated: January 2025.
831,265 professionals have used our research since 2012.