Learn what your peers think about Cloudera Distribution for Hadoop. Get advice and tips from experienced pros sharing their opinions. Updated: December 2024.
AI & Data Engineering Lead at a tech services company with 10,001+ employees
Real User
2022-05-20T12:33:27Z
May 20, 2022
The most valuable feature is that I can use CDH for almost all use cases across all industries, including the financial sector, public sector, private retailers, and so on.
Vice President at a financial services firm with 10,001+ employees
Real User
2022-04-29T11:53:00Z
Apr 29, 2022
We're now able to store large volumes of data through Cloudera Distribution for Hadoop. We're able to push large volumes of data to the platform, and that used to be a challenge, especially when storing a terabyte of information. This is the area where Cloudera Distribution for Hadoop improved the organization.
CDH has a wide variety of proprietary tools that we use, like Impala. So from that perspective, it's quite useful as opposed to something open-source. We get a lot of value from Cloudera's proprietary tools.
DBA team manager at a financial services firm with 1,001-5,000 employees
Real User
2019-07-16T05:40:00Z
Jul 16, 2019
The features I find most valuable is that the solution is that it is easy to install and to work with. It starts with the installation and from there on the management is very simple and centralized.
Senior Consultant & Training at a tech services company with 51-200 employees
Consultant
2019-07-16T05:40:00Z
Jul 16, 2019
We experienced many issues when we started working with Hadoop 3.0 in the Cloudera 6.0 version, so there are a lot of things that need to improve. I believe they are working on that.
Cloudera Distribution for Hadoop is the world's most complete, tested, and popular distribution of Apache Hadoop and related projects. CDH is 100% Apache-licensed open source and is the only Hadoop solution to offer unified batch processing, interactive SQL, and interactive search, and role-based access controls. More enterprises have downloaded CDH than all other such distributions combined.
The tool can be deployed using different container technologies, which makes it very scalable.
The product is completely secure.
We had a data warehouse before all the data. We can process a lot more data structures.
The data science aspect of the solution is valuable.
Customer service and support were able to fix whatever the issue was.
The product provides better data processing features than other tools.
The scalability of Cloudera Distribution for Hadoop is excellent.
The solution's most valuable feature is the enterprise data platform.
The solution is stable.
Very good end-to-end security features.
The most valuable feature is that I can use CDH for almost all use cases across all industries, including the financial sector, public sector, private retailers, and so on.
We're now able to store large volumes of data through Cloudera Distribution for Hadoop. We're able to push large volumes of data to the platform, and that used to be a challenge, especially when storing a terabyte of information. This is the area where Cloudera Distribution for Hadoop improved the organization.
With a cluster available, you can manage the security layer using the shared SDX - it provides flexibility.
The solution is reliable and stable, it fits our requirements.
The file system is a valuable feature.
CDH has a wide variety of proprietary tools that we use, like Impala. So from that perspective, it's quite useful as opposed to something open-source. We get a lot of value from Cloudera's proprietary tools.
I don't see any performance issues.
The product as a whole is good.
The main advantage is the storage is less expensive.
The most valuable feature is Kubernetes.
We also really like the Cloudera community. You can have any question and will have your answer within a few hours.
The most valuable feature is Impala, the querying engine, which is very fast.
The features I find most valuable is that the solution is that it is easy to install and to work with. It starts with the installation and from there on the management is very simple and centralized.
We experienced many issues when we started working with Hadoop 3.0 in the Cloudera 6.0 version, so there are a lot of things that need to improve. I believe they are working on that.
In terms of scalability, if you have enough hardware you can scale out. Scalability doesn't have any issues.
The search function is the most valuable aspect of the solution.
Provides a viable open-source solution for enterprise implementations and reliable, intelligent data analysis.