Try our new research platform with insights from 80,000+ expert users

Databricks vs Pentaho Business Analytics comparison

Sponsored
 

Comparison Buyer's Guide

Executive Summary
 

Categories and Ranking

IBM SPSS Statistics
Sponsored
Average Rating
8.0
Reviews Sentiment
6.9
Number of Reviews
37
Ranking in other categories
Data Mining (3rd), Data Science Platforms (9th)
Databricks
Average Rating
8.2
Reviews Sentiment
7.0
Number of Reviews
84
Ranking in other categories
Data Science Platforms (1st), Streaming Analytics (1st)
Pentaho Business Analytics
Average Rating
8.0
Reviews Sentiment
5.8
Number of Reviews
43
Ranking in other categories
BI (Business Intelligence) Tools (20th), Cloud Operations Analytics (4th), Reporting (16th)
 

Mindshare comparison

Data Science Platforms
BI (Business Intelligence) Tools
 

Featured Reviews

Md Masudul Hassan - PeerSpot reviewer
Comprehensive data analysis capabilities with a user-friendly interface, providing an efficient and reliable platform for researchers and analysts
I believe that offering short-term SPSS licenses, perhaps when customer sourcing is available, could make it more affordable. These licenses shouldn't include features tailored for universities or large sales organizations. Instead, they could offer discounts or additional facilities for smaller entities to access the software. In developing countries, it would be beneficial to provide certain features to users at no cost initially, while also customizing pricing options. For example, offering basic features to the first hundred users can help them become familiar with the software and its capabilities. This approach encourages users to upgrade to higher tiers as they become more experienced and require additional functionality.
Dunstan Matekenya - PeerSpot reviewer
Process large-scale data sets and integrates with Apache Spark with notebook environment
Databricks integrates natively with Apache Spark, which I use as a processing engine for large-scale datasets. This native integration is one of its strengths. Another strength is that the platform makes it very easy to manage resources. For example, setting up a cluster of five or fifteen nodes is straightforward with Databricks. The notebook environment is also excellent, making it easy to perform various tasks.
Sayan König - PeerSpot reviewer
Flexible, easy to understand, and simple to set up
The repository should be improved. There should be the possibility to have versioning, to make it combinable with some Git repositories or something like that, to check out the processes and make sure it has a traceable history. The solution could really be improved. There are too many bugs in our version.

Quotes from Members

We asked business professionals to review the solutions they use. Here are some excerpts of what they said:
 

Pros

"The most valuable features are the small learning curve and its ability to hold a lot of data."
"The most valuable feature is the user interface because you don't need to write code."
"The features that I have found most valuable are the Bayesian statistics and descriptive statistics."
"I've found the descriptive statistics and cross-tabs valuable. The very simple correlations and regressions are as well."
"It offers very good visualization."
"The most valuable features mainly include factor analysis, correlation analysis, and geographic analysis."
"It is perfectly adequate if all you need are the results and not the trail of evidence."
"SPSS is quite robust and quicker in terms of providing you the output."
"Easy to use and requires minimal coding and customizations."
"I haven't heard about any major stability issues. At this time I feel like it's stable."
"One of the features provides nice interactive clusters, or compute instances that you don't really need to manage often."
"The solution is built from Spark and has integration with MLflow, which is important for our use case."
"When we have a huge volume of data that we want to process with speed, velocity, and volume, we go through Databricks."
"It is fast, it's scalable, and it does the job it needs to do."
"Databricks integrates well with other solutions."
"This solution offers a lake house data concept that we have found exciting. We are able to have a large amount of data in a data lake and can manage all relational activities."
"We were able to install it without any assistance from tech support."
"It is robust, offers market intelligence, and utilizes modules effectively."
"I use the BI Server, CDE Dashboards, Saiku, and Kettle, because these tools are very good and highly experienced."
"Pentaho Business Analytics' best features include the ease of developing data flows and the wide range of options to connect to databases, including those on the cloud."
"Easy to use components to create the job."
"The initial setup is pretty straightforward."
"The most valuable feature of Pentaho is the Tableau report."
"Pentaho is an analytics platform that can be used when an organization has a lot of big data storage systems already installed and needs to manage and analyze that data. It has a specific use case for unstructured data, such as documents, and needs to be able to search and analyze it."
 

Cons

"The solution needs to improve forecasting using time series analysis."
"It would be helpful if there was better documentation on how to properly use the solution. A beginner's guide on how to use the various programming functions within the product would be so useful to a lot of people. I found that everything was very confusing at first. Having clear documentation would help alleviate that."
"Better documentation on how to use macros."
"The technical support should be improved."
"One of the areas that should be similar to Minitabs is the use of blogs. The Minitabs blog helps users understand the tools and gives lots of practical examples. Following the SPSS manual is cumbersome. It's a good, exhaustive manual, but it's not practical to use. With Minitabs, you can go to the blogs and find specific articles written about various components and it's very helpful. Without blogs, we find SPSS more complicated."
"I would like SPSS to improve its integration with other data-filing IBM tools. I also think its duration with data, utilization, and graphics could be better."
"The product should provide more ways to import data and export results that are user-friendly for high-level executives."
"If there is any self-generation data collection plan (DCP), it would be helpful in gathering data. It would also be useful if there is a function to scale it up to, let's say, UiPath and have it consolidate and integrate into a UiPath solution."
"The integration of data could be a bit better."
"Costs can quickly add up if you don't plan for it."
"Some of the error messages that we receive are too vague, saying things like "unknown exception", and these should be improved to make it easier for developers to debug problems."
"I would like it if Databricks made it easier to set up a project."
"Overall it's a good product, however, it doesn't do well against any individual best-of-breed products."
"Implementation of Databricks is still very code heavy."
"Databricks requires writing code in Python or SQL, so if you're a good programmer then you can use Databricks."
"The integration and query capabilities can be improved."
"Pentaho Business Analytics' user interface is outdated."
"Another concern is that Pentaho is not customizable or interactive."
"Deployment is not simple. It is not simple because we are dealing with a lot of data; we are dealing with a lot of storage. So, it's not a simple process."
"Version control would be a good addition."
"The tool is very good, and yet it has some problems as it relies heavily on Java."
"We did not achieve the ROI. The work delivered to users had lesser value than the subscription cost."
"The repository should be improved."
"Logging capability is needed."
 

Pricing and Cost Advice

"More affordable training for new staff members."
"It's quite expensive, but they do a special deal for universities."
"Our licence is on a yearly renewal basis. While pricing is not the primary concern in our evaluation, as products are assessed by whether they can meet our user needs and expertise, the cost can be a limiting factor in the number of licences we procure."
"We think that IBM SPSS is expensive for this function."
"The pricing of the modeler is high and can reduce the utility of the product for those who can not afford to adopt it."
"I rate the tool's pricing a five out of ten."
"While the pricing of the product may be higher, the accompanying service and features justify the investment."
"The price of IBM SPSS Statistics could improve."
"The licensing costs of Databricks depend on how many licenses we need, depending on which Databricks provides a lot of discounts."
"The cost for Databricks depends on the use case. I work on it as a consultant, so I'm using the client's Databricks, so it depends on how big the client is."
"The solution requires a subscription."
"I would rate Databricks' pricing seven out of ten."
"The product pricing is moderate."
"Price-wise, I would rate Databricks a three out of five."
"The price of Databricks is reasonable compared to other solutions."
"We only pay for the Azure compute behind the solution."
"Free and commercial versions are available."
"We were lucky enough to find a Pentaho OEM partner who offered a data warehouse model and the ETL software for about 60K SGD per year."
"Pentaho is expensive ."
report
Use our free recommendation engine to learn which Data Science Platforms solutions are best for your needs.
824,053 professionals have used our research since 2012.
 

Comparison Review

it_user6978 - PeerSpot reviewer
Jun 10, 2013
Jaspersoft vs. Pentaho – Which one to use & is there any need to purchase the commercial edition
Any company (be it technology, manfucaturing, human resource, ecommerce, SME etc) always has the need for Business Intelligence to some or the other extent. If cost is one of the consideration factor, then the 2 BI tools which are at the forefront are Pentaho and Jaspersoft. But, often the same…
 

Top Industries

By visitors reading reviews
Financial Services Firm
17%
Computer Software Company
9%
University
8%
Manufacturing Company
8%
Financial Services Firm
16%
Computer Software Company
11%
Manufacturing Company
9%
Healthcare Company
6%
Financial Services Firm
25%
Computer Software Company
13%
Government
8%
Educational Organization
6%
 

Company Size

By reviewers
Large Enterprise
Midsize Enterprise
Small Business
 

Questions from the Community

What do you like most about IBM SPSS Statistics?
The software offers consistency across multiple research projects helping us with predictive analytics capabilities.
What is your experience regarding pricing and costs for IBM SPSS Statistics?
The cost of IBM SPSS Statistics is managed by organizations, not individual researchers. It is a very expensive produ...
What needs improvement with IBM SPSS Statistics?
IBM SPSS Statistics does not keep you close to your data like KNIME. In KNIME, at every stage, you can see the result...
Which do you prefer - Databricks or Azure Machine Learning Studio?
Databricks gives you the option of working with several different languages, such as SQL, R, Scala, Apache Spark, or ...
How would you compare Databricks vs Amazon SageMaker?
We researched AWS SageMaker, but in the end, we chose Databricks. Databricks is a Unified Analytics Platform designe...
Which would you choose - Databricks or Azure Stream Analytics?
Databricks is an easy-to-set-up and versatile tool for data management, analysis, and business analytics. For analyti...
Seeking lightweight open source BI software
There are many...It would rather depend what System BI architecture or Enterprise legacy you have at your end...I wou...
What is your experience regarding pricing and costs for Pentaho Business Analytics?
The organization has both options based on their needs and budget constraints. The Enterprise Edition is expensive wi...
What needs improvement with Pentaho Business Analytics?
The product to me is not as user-friendly as other players in the market. It also still needs improvement in the repo...
 

Also Known As

SPSS Statistics
Databricks Unified Analytics, Databricks Unified Analytics Platform, Redash
Pentaho, Kettle, Hitachi Pentaho Business Analytics
 

Learn More

Video not available
 

Overview

 

Sample Customers

LDB Group, RightShip, Tennessee Highway Patrol, Capgemini Consulting, TEAC Corporation, Ironside, nViso SA, Razorsight, Si.mobil, University Hospitals of Leicester, CROOZ Inc., GFS Fundraising Solutions, Nedbank Ltd., IDS-TILDA
Elsevier, MyFitnessPal, Sharethrough, Automatic Labs, Celtra, Radius Intelligence, Yesware
Cargo 2000 Lufthansa, Marketo, ModCloth, Cardiac Science, Telefonica, ExactTarget, Active Broadband Networks, and Brussels Airport.
Find out what your peers are saying about Databricks, Knime, Amazon Web Services (AWS) and others in Data Science Platforms. Updated: December 2024.
824,053 professionals have used our research since 2012.