Try our new research platform with insights from 80,000+ expert users
Cloudera Distribution for Hadoop Logo

Cloudera Distribution for Hadoop pros and cons

Vendor: Cloudera
4.0 out of 5
Badge Leader
1,475 followers
Post review
 

Cloudera Distribution for Hadoop Pros review quotes

LS
Nov 4, 2022
Very good end-to-end security features.
Thishen Govender - PeerSpot reviewer
Jul 14, 2019
Provides a viable open-source solution for enterprise implementations and reliable, intelligent data analysis.
Shahan Rehman - PeerSpot reviewer
Mar 21, 2024
The tool can be deployed using different container technologies, which makes it very scalable.
Learn what your peers think about Cloudera Distribution for Hadoop. Get advice and tips from experienced pros sharing their opinions. Updated: November 2024.
816,406 professionals have used our research since 2012.
reviewer1850319 - PeerSpot reviewer
Apr 29, 2022
We're now able to store large volumes of data through Cloudera Distribution for Hadoop. We're able to push large volumes of data to the platform, and that used to be a challenge, especially when storing a terabyte of information. This is the area where Cloudera Distribution for Hadoop improved the organization.
Miodrag-Stanic - PeerSpot reviewer
Dec 19, 2023
We had a data warehouse before all the data. We can process a lot more data structures.
MohamedSaied - PeerSpot reviewer
Apr 10, 2024
The tool's most interesting features are the distributed file system and unstructured data processing capability. Because we have a lot of unstructured data, like XML and social media logs, these features make it more valuable than the usual data warehousing solutions.
Miodrag Milojevic - PeerSpot reviewer
Jul 21, 2023
The scalability of Cloudera Distribution for Hadoop is excellent.
it_user900987 - PeerSpot reviewer
Jul 16, 2019
In terms of scalability, if you have enough hardware you can scale out. Scalability doesn't have any issues.
NK
Mar 9, 2020
The most valuable feature is Impala, the querying engine, which is very fast.
MG
Mar 25, 2020
We also really like the Cloudera community. You can have any question and will have your answer within a few hours.
 

Cloudera Distribution for Hadoop Cons review quotes

LS
Nov 4, 2022
The Cloudera training has deteriorated significantly.
Thishen Govender - PeerSpot reviewer
Jul 14, 2019
The solution does not support multiple languages very well and this means users need to create work-arounds to implement some solutions.
Shahan Rehman - PeerSpot reviewer
Mar 21, 2024
The tool's ability to be deployed on a cloud model is an area of concern where improvements are required.
Learn what your peers think about Cloudera Distribution for Hadoop. Get advice and tips from experienced pros sharing their opinions. Updated: November 2024.
816,406 professionals have used our research since 2012.
reviewer1850319 - PeerSpot reviewer
Apr 29, 2022
Cloudera Distribution for Hadoop has a limited feature list and a lot of costs involved.
Miodrag-Stanic - PeerSpot reviewer
Dec 19, 2023
The solution is not fit for on-premise distributions.
MohamedSaied - PeerSpot reviewer
Apr 10, 2024
The tool doesn't support reporting, and relational databases are still the major source of reporting data. Apache Iceberg will be launched soon within the Cloudera cluster for analytical purposes. The Cloudera Machine Learning aspect could be tuned and enhanced to enable us to host some predictive analytics machine learning and AI use cases.
Miodrag Milojevic - PeerSpot reviewer
Jul 21, 2023
Cloudera Distribution for Hadoop is not always completely stable in some cases, which can be a concern for big data solutions.
it_user900987 - PeerSpot reviewer
Jul 16, 2019
The one thing that we struggled with predominately was support. Because it was relatively new, support was always a big issue and I think it's still a bit of an ongoing concern with the team currently managing it.
NK
Mar 9, 2020
There is a maximum of a one-gigabyte block size, which is an area of storage that can be improved upon.
MG
Mar 25, 2020
Without the big data environment, we cannot store all of this data live. We have billions of records and terabytes of storage to be used. It's not an option actually for us to have a big data environment.