Try our new research platform with insights from 80,000+ expert users
Cloudera Distribution for Hadoop Logo

Cloudera Distribution for Hadoop pros and cons

Vendor: Cloudera
4.0 out of 5
Badge Leader
1,476 followers
Post review

Pros & Cons summary

Buyer's Guide

Get pricing advice, tips, use cases and valuable features from real users of this product.
Get the report

Prominent pros & cons

PROS

Cloudera Distribution for Hadoop offers scalability if there is sufficient hardware, allowing seamless scaling out.
Impala, the querying engine, stands out for its speed and efficiency in data querying processes.
It efficiently manages large volumes of data, addressing challenges associated with storing substantial data, such as terabytes of information.
The tool excels in data security, offering superior security features compared to open-source alternatives.
Cloudera provides a flexible hybrid solution that integrates compute capabilities both on cloud and on-premises.

CONS

Cloudera Distribution for Hadoop requires improvements in support, as it is not reliable and has been a concern since it was relatively new.
There is a need to enhance the stability of Cloudera Distribution for Hadoop, as it is not always stable, especially for large-scale data solutions.
The price of Cloudera Distribution for Hadoop is high, making it an expensive option, and there is a need for better pricing strategies.
Security is a significant concern, and improvements are needed, with emphasis on blockchain-enabled storage capabilities.
Cloudera Distribution for Hadoop's deployment and integration process is quite complicated and challenging, particularly for on-premise implementations.
 

Cloudera Distribution for Hadoop Pros review quotes

LS
Nov 4, 2022
Very good end-to-end security features.
Thishen Govender - PeerSpot reviewer
Jul 14, 2019
Provides a viable open-source solution for enterprise implementations and reliable, intelligent data analysis.
Shahan Rehman - PeerSpot reviewer
Mar 21, 2024
The tool can be deployed using different container technologies, which makes it very scalable.
Learn what your peers think about Cloudera Distribution for Hadoop. Get advice and tips from experienced pros sharing their opinions. Updated: February 2025.
832,891 professionals have used our research since 2012.
reviewer1850319 - PeerSpot reviewer
Apr 29, 2022
We're now able to store large volumes of data through Cloudera Distribution for Hadoop. We're able to push large volumes of data to the platform, and that used to be a challenge, especially when storing a terabyte of information. This is the area where Cloudera Distribution for Hadoop improved the organization.
Miodrag-Stanic - PeerSpot reviewer
Dec 19, 2023
We had a data warehouse before all the data. We can process a lot more data structures.
MohamedSaied - PeerSpot reviewer
Apr 10, 2024
The tool's most interesting features are the distributed file system and unstructured data processing capability. Because we have a lot of unstructured data, like XML and social media logs, these features make it more valuable than the usual data warehousing solutions.
Miodrag Milojevic - PeerSpot reviewer
Jul 21, 2023
The scalability of Cloudera Distribution for Hadoop is excellent.
it_user900987 - PeerSpot reviewer
Jul 16, 2019
In terms of scalability, if you have enough hardware you can scale out. Scalability doesn't have any issues.
it_user1296588 - PeerSpot reviewer
Mar 9, 2020
The most valuable feature is Impala, the querying engine, which is very fast.
MG
Mar 25, 2020
We also really like the Cloudera community. You can have any question and will have your answer within a few hours.
 

Cloudera Distribution for Hadoop Cons review quotes

LS
Nov 4, 2022
The Cloudera training has deteriorated significantly.
Thishen Govender - PeerSpot reviewer
Jul 14, 2019
The solution does not support multiple languages very well and this means users need to create work-arounds to implement some solutions.
Shahan Rehman - PeerSpot reviewer
Mar 21, 2024
The tool's ability to be deployed on a cloud model is an area of concern where improvements are required.
Learn what your peers think about Cloudera Distribution for Hadoop. Get advice and tips from experienced pros sharing their opinions. Updated: February 2025.
832,891 professionals have used our research since 2012.
reviewer1850319 - PeerSpot reviewer
Apr 29, 2022
Cloudera Distribution for Hadoop has a limited feature list and a lot of costs involved.
Miodrag-Stanic - PeerSpot reviewer
Dec 19, 2023
The solution is not fit for on-premise distributions.
MohamedSaied - PeerSpot reviewer
Apr 10, 2024
The tool doesn't support reporting, and relational databases are still the major source of reporting data. Apache Iceberg will be launched soon within the Cloudera cluster for analytical purposes. The Cloudera Machine Learning aspect could be tuned and enhanced to enable us to host some predictive analytics machine learning and AI use cases.
Miodrag Milojevic - PeerSpot reviewer
Jul 21, 2023
Cloudera Distribution for Hadoop is not always completely stable in some cases, which can be a concern for big data solutions.
it_user900987 - PeerSpot reviewer
Jul 16, 2019
The one thing that we struggled with predominately was support. Because it was relatively new, support was always a big issue and I think it's still a bit of an ongoing concern with the team currently managing it.
it_user1296588 - PeerSpot reviewer
Mar 9, 2020
There is a maximum of a one-gigabyte block size, which is an area of storage that can be improved upon.
MG
Mar 25, 2020
Without the big data environment, we cannot store all of this data live. We have billions of records and terabytes of storage to be used. It's not an option actually for us to have a big data environment.