Cloudera Distribution for Hadoop Reviews and Pricing

Yevgen Manzhulyanov

CEO at AM-BITS LLC

Aug 8, 2023

Download

Good enterprise platform with good stability

Pros and Cons

"It has the best proxy, security, and support features compared to open-source products."

"The areas of improvement depend on the scale of the project. For banking customers, security features and an essential budget for commercial licenses would be the top priority. Data regulation could be the most crucial for a project with extensive data or an extra use case."

What is most valuable?

It is a good enterprise platform. It is easier and more stable. Additionally, it has the best proxy, security, and support features compared to open-source products.

What needs improvement?

The areas of improvement depend on the scale of the project. For banking customers, security features and an essential budget for commercial licenses would be the top priority. Data regulation could be the most crucial for a project with extensive data or an extra use case.

For how long have I used the solution?

We have been using Cloudera Distribution for Hadoop for a few years.

What do I think about the stability of the solution?

I rate the product’s stability a ten out of ten.

Buyer's Guide

Cloudera Distribution for Hadoop

April 2025

Free Report: Cloudera Distribution for Hadoop Reviews and More

Learn what your peers think about Cloudera Distribution for Hadoop. Get advice and tips from experienced pros sharing their opinions. Updated: April 2025.

DOWNLOAD NOW

845,960 professionals have used our research since 2012.

What do I think about the scalability of the solution?

We have ten customers using the product. They include data engineers, performance engineers, and environment engineers.

I rate its stability a ten out of ten.

How are customer service and support?

The product has a support subscription for one year. We use technical support only for complex use cases. We work with their team as we have direct and quick access to contact them. It helps us better understand the technical and business-related queries of the customers.

How was the initial setup?

The on-cloud version is easy to set up. Although, it is complicated to process a large amount of data for on-premises or hybrid setup. It is not a ready-to-use solution for telecom or finance technology. It requires the deployment of robust technology relying on network infrastructure.

What's my experience with pricing, setup cost, and licensing?

The product’s price depends from project to project. It is more expensive than open-source solutions and could be cheaper. However, in some cases, it is less costly than open-source.

What other advice do I have?

It is the best solution in the world at the moment. I advise others to go for it if you have an enterprise customer. I rate it a ten out of ten.

Disclosure: My company has a business relationship with this vendor other than being a customer: Partner

Hamid M. Hamid

Data architect at Banking Sector

Dec 15, 2022

Download

A scalable solution with a straightforward setup, but the price is too high

Pros and Cons

"The solution is stable."

"The pricing needs to improve."

What is our primary use case?

We used this solution as a data platform.

What needs improvement?

The pricing needs to improve. If the price was affordable, then we might have continued using Cloudera. We switched to HPE because of the cost.

For how long have I used the solution?

I used this solution for the last year.

What do I think about the stability of the solution?

The solution is stable.

What do I think about the scalability of the solution?

It is a scalable solution. There were about 20 users of this solution in my company. 10 people were required for the deployment and maintenance of the solution, including developers.

How was the initial setup?

The initial setup is straightforward.

What's my experience with pricing, setup cost, and licensing?

The price is very high. The solution is expensive.

What other advice do I have?

I would recommend this solution to others.

I rate this solution as an eight out of ten.

Which deployment model are you using for this solution?

On-premises

Disclosure: I am a real user, and this review is based on my own experience and opinions.

Buyer's Guide

Cloudera Distribution for Hadoop

April 2025

Free Report: Cloudera Distribution for Hadoop Reviews and More

Learn what your peers think about Cloudera Distribution for Hadoop. Get advice and tips from experienced pros sharing their opinions. Updated: April 2025.

DOWNLOAD NOW

845,960 professionals have used our research since 2012.

EricLin

Chairman at Athemaster co.,ltd.

Nov 1, 2023

Download

Provides excellent data processing features and enables users to connect with other applications

Pros and Cons

"The product provides better data processing features than other tools."

"The dashboard could be improved."

What is our primary use case?

I use the solution because my data is too big. It is almost 100 TB.

What is most valuable?

The product provides many APIs to connect with other applications. The product provides better data processing features than other tools.

What needs improvement?

The dashboard could be improved.

For how long have I used the solution?

I have been using the solution for seven years.

What do I think about the stability of the solution?

The tool is stable. I rate the stability an eight out of ten.

What do I think about the scalability of the solution?

The tool is scalable. I rate the scalability an eight out of ten. It is easy to scale the product. Almost 20 to 25 people use the tool in our organization. We maintain the solution ourselves. We have nine engineers in our maintenance team.

How are customer service and support?

The support is very, very helpful.

How would you rate customer service and support?

Positive

Which solution did I use previously and why did I switch?

I have worked with Oracle. Oracle is too expensive.

How was the initial setup?

It was pretty easy to install the product. It took us 20 minutes.

What's my experience with pricing, setup cost, and licensing?

The product’s cost is higher compared to other tools. The pricing must be improved.

What other advice do I have?

I recommend the solution to others. Overall, I rate the solution an eight out of ten.

Which deployment model are you using for this solution?

On-premises

Disclosure: I am a real user, and this review is based on my own experience and opinions.

Suresh_Srinivasan

Co-Founder at FORMCEPT Technologies

Jan 13, 2022

Download

Has a useful file system and is scalable

Pros and Cons

"The file system is a valuable feature."

"The security of this solution could be improved. There should also be a way to basically have a blockchain enabled storage with the HDFS."

What is our primary use case?

We use Cloudera Distribution for file storage.

This solution is deployed on-premise.

What is most valuable?

The file system is a valuable feature.

What needs improvement?

The security of this solution could be improved. There should also be a way to basically have a blockchain enabled storage with the HDFS.

For how long have I used the solution?

I have been working with Cloudera Distribution for Hadoop for 11 years.

What do I think about the stability of the solution?

This solution is stable.

What do I think about the scalability of the solution?

This solution is scalable enough for us.

We have created a product, using HDFS, and when our engineers install it for themselves or for customers, we use this solution. There are about 15 to 20 people using it at any point of time.

How was the initial setup?

The installation is straightforward. We use command-line-based installation and we have created our own way of installing with our product.

Depending on the customer or depending on internal usage, our DevOps engineer will install it or my development team will install it.

What about the implementation team?

We are very well-versed on these tools, so we implemented it ourselves.

What's my experience with pricing, setup cost, and licensing?

I haven't bought a license for this solution. I'm only using the Apache license version.

What other advice do I have?

I rate this solution an eight out of ten. Cloudera is a great product and, overall, there are many features.

We actually use Cloudera HDFS underneath, and we build our product on top of it. So, we don't use the Cloudera versions of all the other products, we just use the Cloudera HDFS, nothing else.

Which deployment model are you using for this solution?

On-premises

Disclosure: I am a real user, and this review is based on my own experience and opinions.

it_user347172

System Engineer at a tech company with 10,001+ employees

Nov 28, 2015

Download

For the clusters using CM, we are able to more tightly control and manage the configuration of all nodes in the clusters. But, it has HBase 1.0 stability issues and processing speed needs improvement.

What is most valuable?

Cluster rolling restarts
Cluster wide configuration management

How has it helped my organization?

For the clusters using CM, we are able to more tightly control and manage the configuration of all nodes in the clusters.

We are currently running six production clusters totaling 900+ nodes, and are building three more clusters. Knowing that if someone has some custom configuration on a node that they haven’t communicated out, and that I can ignore that configuration and bring that node into line with where we’ve decided to run the cluster, is very beneficial.

What needs improvement?

HBase 1.0 stability issues and processing speed is a major area for improvement. Right now, our Cloudera 5 clusters run four to seven times slower than our Cloudera 4 clusters using our storm and kafka topologies, which causes real-time processing to be a major challenge.

CM’s API is very limited and difficult when used on multiple clusters in the same CM instance

For how long have I used the solution?

We've used it for approximately two years. We also use Cloudera Manager, which is 6/10.

What was my experience with deployment of the solution?

No issues encountered.

What do I think about the stability of the solution?

Cloudera 5 is currently very unstable. Between two Cloudera 5 clusters, we have an incident at least twice a week due to what are now outstanding bugs.

What do I think about the scalability of the solution?

It's very easy to deploy and scale as large as you want. Once created on the CM management cluster, is difficult to scale up as needed, as you add more clusters to the same CM instance.

Which solution did I use previously and why did I switch?

No previous solution was used.

How was the initial setup?

We were already running one production cluster with approximately 75 nodes when I joined, so I’m not familiar with what was needed to get the initial production cluster up. Once I joined, I assisted in standing up the additional nodes and clusters using our chef automation.

What about the implementation team?

In house via chef automation. Chef, or similar systems, makes it much simpler to stand up large scale clusters. That said, I have not used or evaluated vendor team implementation methods.

Disclosure: I am a real user, and this review is based on my own experience and opinions.

reviewer2392494

Enterprise Data Architect at a pharma/biotech company with 11-50 employees

Jul 5, 2024

Download

Used for big data analytics, data sharing, and reporting

Pros and Cons

"Cloudera, as a whole, is designed to provide organizations with solutions for big data."

"The performance of some analytics engines provided by Cloudera is not that good."

What is our primary use case?

We mostly use the solution for big data analytics, data sharing, and reporting.

What is most valuable?

Cloudera, as a whole, is designed to provide organizations with solutions for big data. Cloudera is not one single component. It has many components related to storage, analytics, queries, and processing. All of these components work together to support big data implementation and analytics.

What needs improvement?

The performance of some analytics engines provided by Cloudera is not that good. So, we are using other analytics tools besides Cloudera.

For how long have I used the solution?

I have been using the solution for more than four years.

Which solution did I use previously and why did I switch?

We also use other tools like DataIQ and Apache Kudu.

What other advice do I have?

I'm working with the solution myself. As a company, we are implementing it for other customers. Cloudera itself does not provide analytics. It prepares data for analytics tools that work with Big Data, such as Apache Spark, DataIQ, and Tableau.

Overall, I rate the solution a nine out of ten.

Disclosure: I am a real user, and this review is based on my own experience and opinions.

it_user1296588

Senior Software Engineer at a tech services company with 10,001+ employees

Mar 12, 2020

Download

Performs well and the technical support is helpful, but the upgrade process needs to be consolidated

Pros and Cons

"The most valuable feature is Impala, the querying engine, which is very fast."

"There is a maximum of a one-gigabyte block size, which is an area of storage that can be improved upon."

What is our primary use case?

We are dealing with data from the telecom industry. We were using an Oracle system but our volume has increased. We now have a lot of real-time data that needs to be transformed so that it can be made available and used.

What is most valuable?

The most valuable feature is Impala, the querying engine, which is very fast. We have been able to work with one terabyte of data in less than 20 minutes. The speed makes it easy for us to process all of the data that comes in, in time.

The support is very good.

All of the data has automatic triple replication in order to secure integrity.

What needs improvement?

There is a maximum of a one-gigabyte block size, which is an area of storage that can be improved upon.

When we are upgrading CDH, there are many things that need to be upgraded and it would be helpful if it were bundled. As it is now, we have to upgrade many different things separately.

For how long have I used the solution?

I have been working with the Cloudera Distribution for Hadoop for around two years.

What do I think about the stability of the solution?

It is a stable solution.

What do I think about the scalability of the solution?

The scalability is good and it works on commodity hardware. One of the problems we have right now is that there is a lot of data and we're moving it from our Oracle solution. This means that there is a double cost, in terms of storage, during our transition to working with big data.

We are using a data lake that is a store for all of the data in our organization. There are more than25 projects, with between 25 and 30 people in each one, for a total of almost 1,000 people. All of them are dependent on this solution.

Most of our users are technicians who have problems to solve using the data available to them. A couple of them are data scientists and the remainder are upper management, who do the analysis.

How are customer service and technical support?

The technical support is very good. Whenever we open a ticket, we get support right away.

Which solution did I use previously and why did I switch?

We did use another solution prior to this one but it could not keep up with our increase in data.

What other advice do I have?

This suitability of this solution depends on the size of the data that you are going to be working with. If you have going to be working with a huge dataset that contains many gigabytes of data then this is a good solution. For smaller datasets, you should also consider other technologies.

My advice for anybody who is implementing this solution is to take some time to learn it. Beyond that, be sure to contact support if you have any problems because they are very helpful.

I would rate this solution a seven out of ten.

Which deployment model are you using for this solution?

On-premises

Disclosure: I am a real user, and this review is based on my own experience and opinions.

reviewer1488372

Associate Manager at a consultancy with 501-1,000 employees

Mar 15, 2021

Download

Easy to install, good technical support, and with a single script we can run jobs within minutes

Pros and Cons

"I don't see any performance issues."

"It could be faster and more user-friendly."

What is our primary use case?

We use this solution to process data.

When using an SQL Server you have to build indexes and you need to fine-tune the data.

We import the data that is in the SQL Source.

With a single script, we are able to run the jobs within minutes, which is an advantage.

We are using the Power BI model for the business convention. The performance in Power BI will be reduced if you incorporate more calculations. Those calculations are captured in the Hadoop layer and processed.

What needs improvement?

It could be faster and more user-friendly.

For how long have I used the solution?

I have been using this solution for seven months.

What do I think about the stability of the solution?

It's a stable product. I don't see any performance issues.

What do I think about the scalability of the solution?

This solution is scalable. We have 40 users for different projects in our organization.

We will continue to use this solution.

How are customer service and technical support?

Technical support is good.

Which solution did I use previously and why did I switch?

I didn't use any other product.

How was the initial setup?

The installing is straightforward.

Our clients provide us with the access to use it directly.

Once you have been given access to the edge nodes we are able to run the scripts in the Hadoop layer.

What's my experience with pricing, setup cost, and licensing?

We do not pay for licensing because our customers forward it, so there is no need to purchase the license for the project.

What other advice do I have?

I would recommend this solution.

I would rate Cloudera Distribution for Hadoop a nine out of ten.

Which deployment model are you using for this solution?

Public Cloud

Disclosure: I am a real user, and this review is based on my own experience and opinions.