Try our new research platform with insights from 80,000+ expert users

Cloudera Distribution for Hadoop vs QueryIO comparison

 

Comparison Buyer's Guide

Executive Summary
 

Categories and Ranking

Cloudera Distribution for H...
Ranking in Hadoop
2nd
Average Rating
8.0
Reviews Sentiment
6.4
Number of Reviews
49
Ranking in other categories
NoSQL Databases (8th)
QueryIO
Ranking in Hadoop
16th
Average Rating
8.0
Number of Reviews
1
Ranking in other categories
No ranking in other categories
 

Mindshare comparison

As of December 2024, in the Hadoop category, the mindshare of Cloudera Distribution for Hadoop is 28.2%, up from 23.1% compared to the previous year. The mindshare of QueryIO is 0.6%, up from 0.5% compared to the previous year. It is calculated based on PeerSpot user engagement data.
Hadoop
 

Featured Reviews

Shahan Rehman - PeerSpot reviewer
Can host multiple technologies and help businesses with their AI initiatives
The ease or difficulty in setting up the product depends on the environment of the customer where the tool is deployed. If a banking, industrial, or retail sector firm is taken into concentration, depending on how big of a database is maintained, including the applications that are to be hosted, the deployment process can range from a simple to a very complex phase, depending on the architecture. For Cloudera Distribution for Hadoop, one has to go through the usual deployment process, like for any software product. You have to have different environments before going into production, like pre-production environments, test and dev environments. You install and configure all the components in the test environment and then test them on the pre-production environment. Once UAT is done, you move them to the production environment. In general, it's a critical product deployed in a company.
MR
Stable with good connectivity and good integration capabilities
Data cleansing is not intuitive and user-friendly. When things have errors, you have to hunt them down as opposed to the solution simply showing you intuitively where to find it. I would recommend that they look at that Tableau Prep tool and see how it is pieced together. That's a great data cleansing tool. If Microsoft has something like that, then we wouldn't even have to look at some of the other options. There needs to be some simplification of the user interface. Right now it's too complicated. There isn't a way to put controls on the solution, so anyone can use any part of it, and sometimes novices will go and try to create things, but not know enough about what is official and what is published. It would be ideal if we could segment off certain sections so that not everyone had access to the whole solution. I'd like to see something more of a mapping tool so that you could see how the reports are connected, similar to Tableau Prep and Naim. That would make for a pretty useful diagnostics check. People would be better able to understand the linkage between your datasets. It would be nice if the solution offered some templates. It would make it even more plug and play, and give people a good jumping-off point. After that, they could explore other bells and whistles as they get further into understanding the solution. The solution should work in some virtualization. It would be a good added feature. If this product had those things then I wouldn't need to use other products.

Quotes from Members

We asked business professionals to review the solutions they use. Here are some excerpts of what they said:
 

Pros

"The most valuable feature is that I can use CDH for almost all use cases across all industries, including the financial sector, public sector, private retailers, and so on."
"We also really like the Cloudera community. You can have any question and will have your answer within a few hours."
"It is helpful to gather and process data."
"It has the best proxy, security, and support features compared to open-source products."
"CDH has a wide variety of proprietary tools that we use, like Impala. So from that perspective, it's quite useful as opposed to something open-source. We get a lot of value from Cloudera's proprietary tools."
"The product is completely secure."
"The tool can be deployed using different container technologies, which makes it very scalable."
"We had a data warehouse before all the data. We can process a lot more data structures."
"Anyone who has even a little bit of knowledge of the solution can begin to create things. You don't have to be technical to use the solution."
 

Cons

"The price of this solution could be lowered."
"The tool doesn't support reporting, and relational databases are still the major source of reporting data. Apache Iceberg will be launched soon within the Cloudera cluster for analytical purposes. The Cloudera Machine Learning aspect could be tuned and enhanced to enable us to host some predictive analytics machine learning and AI use cases."
"It would be useful if Cloudera had more tools like SQL Engines that offer the traditional relational database. We have to do a lot of work preparing the data outside Cloudera before getting it into the platform."
"The governance aspect of the solution should be improved."
"The tool's ability to be deployed on a cloud model is an area of concern where improvements are required."
"The Cloudera training has deteriorated significantly."
"Cloudera's support is extremely bad and cannot be relied on."
"The areas of improvement depend on the scale of the project. For banking customers, security features and an essential budget for commercial licenses would be the top priority. Data regulation could be the most crucial for a project with extensive data or an extra use case."
"There needs to be some simplification of the user interface."
 

Pricing and Cost Advice

"I wouldn't recommend CDH to others because of its high cost."
"It is an expensive product."
"The solution is expensive."
"Cloudera Distribution for Hadoop is expensive, with support costs involved."
"The product’s price depends from project to project."
"The price could be better for the product."
"I haven't bought a license for this solution. I'm only using the Apache license version."
"Cloudera requires a license to use."
Information not available
report
Use our free recommendation engine to learn which Hadoop solutions are best for your needs.
824,067 professionals have used our research since 2012.
 

Top Industries

By visitors reading reviews
Financial Services Firm
23%
Computer Software Company
15%
Educational Organization
11%
Manufacturing Company
8%
No data available
 

Company Size

By reviewers
Large Enterprise
Midsize Enterprise
Small Business
No data available
 

Questions from the Community

What do you like most about Cloudera Distribution for Hadoop?
The tool can be deployed using different container technologies, which makes it very scalable.
What is your experience regarding pricing and costs for Cloudera Distribution for Hadoop?
The tool is expensive. Overall, it's not a cheap software tool, and that is why only large enterprises who are mature enough and have an architecture that is complex enough opt for Cloudera, as its...
What needs improvement with Cloudera Distribution for Hadoop?
The tool doesn't support reporting, and relational databases are still the major source of reporting data. Apache Iceberg will be launched soon within the Cloudera cluster for analytical purposes. ...
Ask a question
Earn 20 points
 

Learn More

Video not available
 

Overview

 

Sample Customers

37signals, Adconion,adgooroo, Aggregate Knowledge, AMD, Apollo Group, Blackberry, Box, BT, CSC
Information Not Available
Find out what your peers are saying about Apache, Cloudera, Amazon Web Services (AWS) and others in Hadoop. Updated: December 2024.
824,067 professionals have used our research since 2012.