Try our new research platform with insights from 80,000+ expert users

Databricks vs Pentaho Business Analytics comparison

 

Comparison Buyer's Guide

Executive Summary

Review summaries and opinions

We asked business professionals to review the solutions they use. Here are some excerpts of what they said:
 

Categories and Ranking

Databricks
Average Rating
8.2
Reviews Sentiment
7.0
Number of Reviews
88
Ranking in other categories
Cloud Data Warehouse (7th), Data Science Platforms (1st), Streaming Analytics (1st)
Pentaho Business Analytics
Average Rating
8.0
Reviews Sentiment
6.9
Number of Reviews
43
Ranking in other categories
BI (Business Intelligence) Tools (21st), Cloud Operations Analytics (4th), Reporting (16th)
 

Mindshare comparison

While both are Business Intelligence solutions, they serve different purposes. Databricks is designed for Cloud Data Warehouse and holds a mindshare of 7.8%, up 2.9% compared to last year.
Pentaho Business Analytics, on the other hand, focuses on BI (Business Intelligence) Tools, holds 0.4% mindshare, down 0.6% since last year.
Cloud Data Warehouse
BI (Business Intelligence) Tools
 

Featured Reviews

ShubhamSharma7 - PeerSpot reviewer
Capability to integrate diverse coding languages in a single notebook greatly enhances workflow
Databricks offers various courses that I can use, whether it's PySpark, Scala, or R. I can leverage all these courses in a single notebook, which is beneficial for clients as they can access various tools in one place whenever needed. This is quite significant. I usually work with PySpark based on client requirements. After coding, I feed the Databricks notebooks into the ADF pipeline for updates. Databricks' capability to process data in parallel enhances data processing speed. Furthermore, I can connect our Databricks notebook directly with Power BI and other visualization tools like Qlik. Once we develop code, it allows us to transform raw data into visualizations for clients using analysis diagrams, which is very helpful.
Sayan König - PeerSpot reviewer
Flexible, easy to understand, and simple to set up
The repository should be improved. There should be the possibility to have versioning, to make it combinable with some Git repositories or something like that, to check out the processes and make sure it has a traceable history. The solution could really be improved. There are too many bugs in our version.

Quotes from Members

We asked business professionals to review the solutions they use. Here are some excerpts of what they said:
 

Pros

"The Delta Lake data type has been the most useful part of this solution. Delta Lake is an opensource data type and it was implemented and invented by Databricks."
"One of the features provides nice interactive clusters, or compute instances that you don't really need to manage often."
"Can cut across the entire ecosystem of open source technology to give an extra level of getting the transformatory process of the data."
"Specifically for data science and data analytics purposes, it can handle large amounts of data in less time. I can compare it with Teradata. If a job takes five hours with Teradata databases, Databricks can complete it in around three to three and a half hours."
"Databricks is a unified solution that we can use for streaming. It is supporting open source languages, which are cloud-agnostic. When I do database coding if any other tool has a similar language pack to Excel or SQL, I can use the same knowledge, limiting the need to learn new things. It supports a lot of Python libraries where I can use some very easily."
"I would rate them ten out of ten."
"The most valuable feature of Databricks is the integration of the data warehouse and data lake, and the development of the lake house. Additionally, it integrates well with Spark for processing data in production."
"The fast data loading process and data storage capabilities are great."
"The initial setup is pretty straightforward."
"We were able to install it without any assistance from tech support."
"Pentaho Business Analytics' best features include the ease of developing data flows and the wide range of options to connect to databases, including those on the cloud."
"The most valuable feature of Pentaho is the Tableau report."
"It is robust, offers market intelligence, and utilizes modules effectively."
"Easy to use components to create the job."
"Pentaho is an analytics platform that can be used when an organization has a lot of big data storage systems already installed and needs to manage and analyze that data. It has a specific use case for unstructured data, such as documents, and needs to be able to search and analyze it."
"I use the BI Server, CDE Dashboards, Saiku, and Kettle, because these tools are very good and highly experienced."
 

Cons

"I have seen better user interfaces, so that is something that can be improved."
"The data visualization for this solution could be improved. They have started to roll out a data visualization tool inside Databricks but it is in the early stages. It's not comparable to a solution like Power BI, Luca, or Tableau."
"There would also be benefits if more options were available for workers, or the clusters of the two points."
"The product cannot be integrated with a popular coding IDE."
"The solution could be improved by integrating it with data packets. Right now, the load tables provide a function, like team collaboration. Still, it's unclear as to if there's a function to create different branches and/or more branches. Our team had used data packets before, however, I feel it's difficult to integrate the current with the previous data packets."
"Doesn't provide a lot of credits or trial options."
"The biggest problem associated with the product is that it is quite pricey."
"I would like to see more documentation in terms of how an end-user could use it, and users like me can easily try it and implement use cases."
"Version control would be a good addition."
"Deployment is not simple. It is not simple because we are dealing with a lot of data; we are dealing with a lot of storage. So, it's not a simple process."
"The repository should be improved."
"Pentaho Business Analytics' user interface is outdated."
"Logging capability is needed."
"We did not achieve the ROI. The work delivered to users had lesser value than the subscription cost."
"The tool is very good, and yet it has some problems as it relies heavily on Java."
"Another concern is that Pentaho is not customizable or interactive."
 

Pricing and Cost Advice

"The solution uses a pay-per-use model with an annual subscription fee or package. Typically this solution is used on a cloud platform, such as Azure or AWS, but more people are choosing Azure because the price is more reasonable."
"I do not exactly know the costs, but one of our clients pays between $100 USD and $200 USD monthly."
"The licensing costs of Databricks is a tiered licensing regime, so it is flexible."
"Databricks is a very expensive solution. Pricing is an area that could definitely be improved. They could provide a lower end compute and probably reduce the price."
"I would rate the tool’s pricing an eight out of ten."
"The basic version of this solution is now open-source, so there are no license costs involved. However, there is a charge for any advanced functionality and this can be quite expensive."
"The product pricing is moderate."
"The solution is based on a licensing model."
"Pentaho is expensive ."
"Free and commercial versions are available."
"We were lucky enough to find a Pentaho OEM partner who offered a data warehouse model and the ETL software for about 60K SGD per year."
report
Use our free recommendation engine to learn which Cloud Data Warehouse solutions are best for your needs.
841,004 professionals have used our research since 2012.
 

Comparison Review

it_user6978 - PeerSpot reviewer
Jun 10, 2013
Jaspersoft vs. Pentaho – Which one to use & is there any need to purchase the commercial edition
Any company (be it technology, manfucaturing, human resource, ecommerce, SME etc) always has the need for Business Intelligence to some or the other extent. If cost is one of the consideration factor, then the 2 BI tools which are at the forefront are Pentaho and Jaspersoft. But, often the same…
 

Top Industries

By visitors reading reviews
Financial Services Firm
17%
Computer Software Company
11%
Manufacturing Company
9%
Healthcare Company
6%
Financial Services Firm
27%
Computer Software Company
12%
Government
7%
Educational Organization
7%
 

Company Size

By reviewers
Large Enterprise
Midsize Enterprise
Small Business
 

Questions from the Community

Which do you prefer - Databricks or Azure Machine Learning Studio?
Databricks gives you the option of working with several different languages, such as SQL, R, Scala, Apache Spark, or Python. It offers many different cluster choices and excellent integration with ...
How would you compare Databricks vs Amazon SageMaker?
We researched AWS SageMaker, but in the end, we chose Databricks. Databricks is a Unified Analytics Platform designed to accelerate innovation projects. It is based on Spark so it is very fast. It...
Which would you choose - Databricks or Azure Stream Analytics?
Databricks is an easy-to-set-up and versatile tool for data management, analysis, and business analytics. For analytics teams that have to interpret data to further the business goals of their orga...
Seeking lightweight open source BI software
There are many...It would rather depend what System BI architecture or Enterprise legacy you have at your end...I would recommend as follows: 1) If you have legacies of SAP, Oracle - look for SAP...
What is your experience regarding pricing and costs for Pentaho Business Analytics?
For those starting to use this tool, there is a free version available which is beneficial. The company also finds the pricing to be good.
What needs improvement with Pentaho Business Analytics?
The tool is very good, and yet it has some problems as it relies heavily on Java. The platform works with Java, and it has brought some issues to the company.
 

Also Known As

Databricks Unified Analytics, Databricks Unified Analytics Platform, Redash
Pentaho, Kettle, Hitachi Pentaho Business Analytics
 

Overview

 

Sample Customers

Elsevier, MyFitnessPal, Sharethrough, Automatic Labs, Celtra, Radius Intelligence, Yesware
Cargo 2000 Lufthansa, Marketo, ModCloth, Cardiac Science, Telefonica, ExactTarget, Active Broadband Networks, and Brussels Airport.
Find out what your peers are saying about Snowflake Computing, Microsoft, Google and others in Cloud Data Warehouse. Updated: February 2025.
841,004 professionals have used our research since 2012.