Try our new research platform with insights from 80,000+ expert users

Cloudera Distribution for Hadoop vs Pentaho Business Analytics comparison

 

Comparison Buyer's Guide

Executive Summary
 

Categories and Ranking

Cloudera Distribution for H...
Average Rating
8.0
Number of Reviews
49
Ranking in other categories
Hadoop (2nd), NoSQL Databases (7th)
Pentaho Business Analytics
Average Rating
8.0
Number of Reviews
42
Ranking in other categories
BI (Business Intelligence) Tools (21st), Cloud Operations Analytics (4th), Reporting (15th)
 

Mindshare comparison

Cloudera Distribution for Hadoop and Pentaho Business Analytics aren’t in the same category and serve different purposes. Cloudera Distribution for Hadoop is designed for Hadoop and holds a mindshare of 27.1%, up 22.7% compared to last year.
Pentaho Business Analytics, on the other hand, focuses on BI (Business Intelligence) Tools, holds 0.6% mindshare, down 0.6% since last year.
Hadoop
BI (Business Intelligence) Tools
 

Featured Reviews

Shahan Rehman - PeerSpot reviewer
Mar 21, 2024
Can host multiple technologies and help businesses with their AI initiatives
The ease or difficulty in setting up the product depends on the environment of the customer where the tool is deployed. If a banking, industrial, or retail sector firm is taken into concentration, depending on how big of a database is maintained, including the applications that are to be hosted, the deployment process can range from a simple to a very complex phase, depending on the architecture. For Cloudera Distribution for Hadoop, one has to go through the usual deployment process, like for any software product. You have to have different environments before going into production, like pre-production environments, test and dev environments. You install and configure all the components in the test environment and then test them on the pre-production environment. Once UAT is done, you move them to the production environment. In general, it's a critical product deployed in a company.
Sayan König - PeerSpot reviewer
Feb 16, 2022
Flexible, easy to understand, and simple to set up
It's a data warehouse for finance and central bank reporting purposes, and an ETL tool in this environment It's easily understandable. The product is quick and flexible, with very good steps to make data transformation possible.  The initial setup is pretty straightforward. The repository should…

Quotes from Members

We asked business professionals to review the solutions they use. Here are some excerpts of what they said:
 

Pros

"The product provides better data processing features than other tools."
"The solution is stable."
"We had a data warehouse before all the data. We can process a lot more data structures."
"With a cluster available, you can manage the security layer using the shared SDX - it provides flexibility."
"The product as a whole is good."
"The scalability of Cloudera Distribution for Hadoop is excellent."
"The most valuable feature is Kubernetes."
"The solution is reliable and stable, it fits our requirements."
"Easy to use components to create the job."
"The most valuable feature of Pentaho is the Tableau report."
"The initial setup is pretty straightforward."
"Pentaho Business Analytics' best features include the ease of developing data flows and the wide range of options to connect to databases, including those on the cloud."
"We were able to install it without any assistance from tech support."
"I use the BI Server, CDE Dashboards, Saiku, and Kettle, because these tools are very good and highly experienced."
"Pentaho is an analytics platform that can be used when an organization has a lot of big data storage systems already installed and needs to manage and analyze that data. It has a specific use case for unstructured data, such as documents, and needs to be able to search and analyze it."
 

Cons

"The dashboard could be improved."
"The tool doesn't support reporting, and relational databases are still the major source of reporting data. Apache Iceberg will be launched soon within the Cloudera cluster for analytical purposes. The Cloudera Machine Learning aspect could be tuned and enhanced to enable us to host some predictive analytics machine learning and AI use cases."
"It would be useful if Cloudera had more tools like SQL Engines that offer the traditional relational database. We have to do a lot of work preparing the data outside Cloudera before getting it into the platform."
"The areas of improvement depend on the scale of the project. For banking customers, security features and an essential budget for commercial licenses would be the top priority. Data regulation could be the most crucial for a project with extensive data or an extra use case."
"It could be faster and more user-friendly."
"Cloudera's support is extremely bad and cannot be relied on."
"There are better solutions out there that have more features than this one."
"The tool's ability to be deployed on a cloud model is an area of concern where improvements are required."
"Version control would be a good addition."
"Another concern is that Pentaho is not customizable or interactive."
"Pentaho, at the general level, should greatly improve the easy construction of its dashboards and easy integration of information from different sources without technical user intervention."
"Logging capability is needed."
"Pentaho Business Analytics' user interface is outdated."
"Deployment is not simple. It is not simple because we are dealing with a lot of data; we are dealing with a lot of storage. So, it's not a simple process."
"The repository should be improved."
"We did not achieve the ROI. The work delivered to users had lesser value than the subscription cost."
 

Pricing and Cost Advice

"The tool is expensive...For the SMB market or customers whose environments are not that complex and do not have multiple systems running, Cloudera might not be a good option."
"The price could be better for the product."
"The product’s price depends from project to project."
"It is an expensive product."
"The price is very high. The solution is expensive."
"Cloudera Distribution for Hadoop is expensive, with support costs involved."
"I wouldn't recommend CDH to others because of its high cost."
"The solution is fairly expensive."
"Free and commercial versions are available."
"We were lucky enough to find a Pentaho OEM partner who offered a data warehouse model and the ETL software for about 60K SGD per year."
"Pentaho is expensive ."
report
Use our free recommendation engine to learn which Hadoop solutions are best for your needs.
814,649 professionals have used our research since 2012.
 

Comparison Review

it_user6978 - PeerSpot reviewer
Jun 10, 2013
Jaspersoft vs. Pentaho – Which one to use & is there any need to purchase the commercial edition
Any company (be it technology, manfucaturing, human resource, ecommerce, SME etc) always has the need for Business Intelligence to some or the other extent. If cost is one of the consideration factor, then the 2 BI tools which are at the forefront are Pentaho and Jaspersoft. But, often the same…
 

Top Industries

By visitors reading reviews
Financial Services Firm
23%
Computer Software Company
15%
Educational Organization
10%
Manufacturing Company
8%
Financial Services Firm
25%
Computer Software Company
14%
Educational Organization
8%
Government
8%
 

Company Size

By reviewers
Large Enterprise
Midsize Enterprise
Small Business
 

Questions from the Community

What do you like most about Cloudera Distribution for Hadoop?
The tool can be deployed using different container technologies, which makes it very scalable.
What is your experience regarding pricing and costs for Cloudera Distribution for Hadoop?
The tool is expensive. Overall, it's not a cheap software tool, and that is why only large enterprises who are mature enough and have an architecture that is complex enough opt for Cloudera, as its...
What needs improvement with Cloudera Distribution for Hadoop?
The tool doesn't support reporting, and relational databases are still the major source of reporting data. Apache Iceberg will be launched soon within the Cloudera cluster for analytical purposes. ...
Seeking lightweight open source BI software
There are many...It would rather depend what System BI architecture or Enterprise legacy you have at your end...I would recommend as follows: 1) If you have legacies of SAP, Oracle - look for SAP...
What is your experience regarding pricing and costs for Pentaho Business Analytics?
The organization has both options based on their needs and budget constraints. The Enterprise Edition is expensive with references to an added number of features.
What needs improvement with Pentaho Business Analytics?
The product to me is not as user-friendly as other players in the market. It also still needs improvement in the reporting module. You will need to search for deployment examples or need to have a ...
 

Also Known As

No data available
Pentaho, Kettle, Hitachi Pentaho Business Analytics
 

Overview

 

Sample Customers

37signals, Adconion,adgooroo, Aggregate Knowledge, AMD, Apollo Group, Blackberry, Box, BT, CSC
Cargo 2000 Lufthansa, Marketo, ModCloth, Cardiac Science, Telefonica, ExactTarget, Active Broadband Networks, and Brussels Airport.
Find out what your peers are saying about Apache, Cloudera, Amazon Web Services (AWS) and others in Hadoop. Updated: October 2024.
814,649 professionals have used our research since 2012.