Cloudera Distribution for Hadoop and Cloudera Data Platform are enterprise data management and analytics products. Cloudera Data Platform has the upper hand with its advanced features and integration capabilities, offering a comprehensive and scalable solution for modern data handling.
Features: Cloudera Distribution for Hadoop provides robust data processing capabilities, focusing on batch processing and offering strong community code support. Cloudera Data Platform includes advanced machine learning, data analytics tools, and seamless cloud service integration, supporting both on-premise and cloud environments with a modernized architecture.
Room for Improvement: Cloudera Distribution for Hadoop could enhance its cloud integration and reduce complexity in traditional deployment practices. It could also modernize its interface and expand machine learning capabilities. Cloudera Data Platform might improve affordability for smaller enterprises, simplify initial setup, and streamline updates to reduce potential downtime during upgrades.
Ease of Deployment and Customer Service: Cloudera Data Platform provides a cloud-native deployment model, ensuring flexibility and minimal downtime while offering proactive customer service. Cloudera Distribution for Hadoop relies on traditional deployments, requiring more setup intricacies, yet benefits those familiar with Hadoop through comprehensive documentation.
Pricing and ROI: Cloudera Distribution for Hadoop is economical for existing infrastructure where heavy customization isn't necessary, offering stable and reliable performance. Cloudera Data Platform, on the other hand, has a premium cost due to its comprehensive features and integration, promising superior long-term ROI through enhanced data insights and operational efficiencies.
The technical support is quite good and better than IBM.
Integrating with Active Directory, managing security, and configuration are the main concerns.
It can be deployed on-premises, unlike competitors' cloud-only solutions.
This is the only solution that is possible to install on-premise.
Cloudera Data Platform offers a powerful fusion of Hadoop technology and user-centric tools, enabling seamless scalability and open-source flexibility. It supports large-scale data operations with tools like Ranger and Cloudera Data Science Workbench, offering efficient cluster management and containerization capabilities.
Designed to support extensive data needs, Cloudera Data Platform encompasses a comprehensive Hadoop stack, which includes HDFS, Hive, and Spark. Its integration with Ambari provides user-friendliness in management and configuration. Despite its strengths in scalability and security, Cloudera Data Platform requires enhancements in multi-tenant implementation, governance, and UI, while attribute-level encryption and better HDFS namenode support are also needed. Stability, especially regarding the Hue UI, financial costs, and disaster recovery are notable challenges. Additionally, integration with cloud storage and deployment methods could be more intuitive to enhance user experience, along with more effective support and community engagement.
What are the key features?Cloudera Data Platform is implemented extensively across industries like hospitality for data science activities, including managing historical data. Its adaptability extends to operational analytics for sectors like oil & gas, finance, and healthcare, often enhanced by Hortonworks Data Platform for data ingestion and analytics tasks.
We monitor all Data Management Platforms (DMP) reviews to prevent fraudulent reviews and keep review quality high. We do not post reviews by company employees or direct competitors. We validate each review for authenticity via cross-reference with LinkedIn, and personal follow-up with the reviewer when necessary.