Try our new research platform with insights from 80,000+ expert users

Cloudera Data Science Workbench vs H2O.ai comparison

Sponsored
 

Comparison Buyer's Guide

Executive Summary
 

Categories and Ranking

IBM SPSS Statistics
Sponsored
Ranking in Data Science Platforms
10th
Average Rating
8.0
Number of Reviews
37
Ranking in other categories
Data Mining (3rd)
Cloudera Data Science Workb...
Ranking in Data Science Platforms
21st
Average Rating
7.0
Number of Reviews
2
Ranking in other categories
No ranking in other categories
H2O.ai
Ranking in Data Science Platforms
22nd
Average Rating
7.6
Number of Reviews
7
Ranking in other categories
Model Monitoring (8th)
 

Mindshare comparison

As of November 2024, in the Data Science Platforms category, the mindshare of IBM SPSS Statistics is 2.8%, up from 2.6% compared to the previous year. The mindshare of Cloudera Data Science Workbench is 1.5%, down from 1.8% compared to the previous year. The mindshare of H2O.ai is 1.5%, up from 1.5% compared to the previous year. It is calculated based on PeerSpot user engagement data.
Data Science Platforms
 

Featured Reviews

AbakarAhmat - PeerSpot reviewer
Sep 21, 2023
Enhancing survey analysis that provides valued insightfulness
I use it to analyze questionnaire surveys related to a product, solution, or application, such as open data services, which I provide to consumers and end-users. These surveys contain evaluation assessments, and I use SPSS to analyze the responses The most valuable feature is its robust…
Ismail Peer - PeerSpot reviewer
Feb 13, 2024
Useful for data science modeling but improvement is needed in MLOps and pricing
We have different use cases. Our banking use case uses machine learning to identify customer life events and recommend the best-suited card products. These machine-learning models are deployed in our environment, where they run on a scheduled basis. We rely on the platform for every data science…
RK
Dec 11, 2018
It is helpful, intuitive, and easy to use. The learning curve is not too steep.
One example, we are able to automate life insurance. We have to underwrite policies. When somebody applies for a policy, we take their blood, then assign them a risk: substandard, standard, preferred, etc. Depending on this, we price our products. Usually the process is that you take the blood, then it goes to a lab and we get the lab results back, then an underwriter takes a look at the lab results. This is usually done in a two week time frame to get a rating. We were able to build models to automate all of this, and now, it happens in real-time. Somebody can apply online and get issued a policy right away.

Quotes from Members

We asked business professionals to review the solutions they use. Here are some excerpts of what they said:
 

Pros

"In terms of the features I've found most valuable, I'd say the duration, the correlation, and of course the nonparametric statistics. I use it for reliability and survival analysis, time series, regression models in different solutions, and different types of solutions."
"in terms of the simplicity, I think the SPSS basic can handle it."
"The software offers consistency across multiple research projects helping us with predictive analytics capabilities."
"One feature I found very valuable was the analysis of variance (ANOVA)."
"Some of the most valuable features that we are using with some business models are machine learning algorithms, statistical models given to us by the business, and getting data from the database or text files."
"You can quickly build models because it does the work for you."
"The most valuable features are the small learning curve and its ability to hold a lot of data."
"The most valuable feature of IBM SPSS Statistics is all the functionality it provides. Additionally, it is simple to do the five-way analysis that you can into multidimensional setup space. It's the multidimensional space facility that is most useful."
"The Cloudera Data Science Workbench is customizable and easy to use."
"I appreciate CDSW's ability to logically segregate environments, such as data, DR, and production, ensuring they don't interfere with each other. The deployment of machine learning is fast and easy to manage. Its API calls are also fast."
"One of the most interesting features of the product is their driverless component. The driverless component allows you to test several different algorithms along with navigating you through choosing the best algorithm."
"The most valuable features are the machine learning tools, the support for Jupyter Notebooks, and the collaboration that allows you to share it across people."
"Fast training, memory-efficient DataFrame manipulation, well-documented, easy-to-use algorithms, ability to integrate with enterprise Java apps (through POJO/MOJO) are the main reasons why we switched from Spark to H2O."
"AutoML helps in hands-free initial evaluations of efficiency/accuracy of ML algorithms."
"It is helpful, intuitive, and easy to use. The learning curve is not too steep."
"The ease of use in connecting to our cluster machines."
 

Cons

"I would like SPSS to improve its integration with other data-filing IBM tools. I also think its duration with data, utilization, and graphics could be better."
"Better documentation on how to use macros."
"It could provide even more in the way of automation as there are many opportunities."
"One of the areas that should be similar to Minitabs is the use of blogs. The Minitabs blog helps users understand the tools and gives lots of practical examples. Following the SPSS manual is cumbersome. It's a good, exhaustive manual, but it's not practical to use. With Minitabs, you can go to the blogs and find specific articles written about various components and it's very helpful. Without blogs, we find SPSS more complicated."
"IBM SPSS Statistics could improve the visual outputs where you are producing, for example, a graph for a company board of directors, or an advert."
"Most of the package will give you the fixed value, or the p-value, without an explanation as to whether it it significant or not. Some beginners might need not just the results, but also some explanation for them."
"There is a learning curve; it's not very steep, but there is one."
"Each algorithm could be more adaptable to some industry-specific areas, or, in some cases, adapted for maintenance."
"The tool's MLOps is not good. It's pricing also needs to improve."
"Running this solution requires a minimum of 12GB to 16GB of RAM."
"The model management features could be improved."
"On the topic of model training and model governance, this solution cannot handle ten or twelve models running at the same time."
"The interpretability module has room for improvement. Also, it needs to improve its ability to integrate with other systems, like SageMaker, and the overall integration capability."
"It lacks the data manipulation capabilities of R and Pandas DataFrames. We would kill for dplyr offloading H2O."
"I would like to see more features related to deployment."
"It needs a drag and drop GUI like KNIME, for easy access to and visibility of workflows."
"Referring to bullet-3 as well, H2O DataFrame manipulation capabilities are too primitive."
 

Pricing and Cost Advice

"The price of IBM SPSS Statistics could improve."
"I rate the tool's pricing a five out of ten."
"The pricing of the modeler is high and can reduce the utility of the product for those who can not afford to adopt it."
"It's quite expensive, but they do a special deal for universities."
"If it requires lot of data processing, maybe switching to IBM SPSS Clementine would be better for the buyer."
"While the pricing of the product may be higher, the accompanying service and features justify the investment."
"We think that IBM SPSS is expensive for this function."
"Our licence is on a yearly renewal basis. While pricing is not the primary concern in our evaluation, as products are assessed by whether they can meet our user needs and expertise, the cost can be a limiting factor in the number of licences we procure."
"The product is expensive."
"We have seen significant ROI where we were able to use the product in certain key projects and could automate a lot of processes. We were even able to reduce staff."
report
Use our free recommendation engine to learn which Data Science Platforms solutions are best for your needs.
814,649 professionals have used our research since 2012.
 

Top Industries

By visitors reading reviews
Financial Services Firm
16%
University
10%
Computer Software Company
9%
Manufacturing Company
8%
Financial Services Firm
35%
Manufacturing Company
11%
Healthcare Company
9%
Government
7%
Financial Services Firm
19%
Computer Software Company
11%
Manufacturing Company
9%
Insurance Company
6%
 

Company Size

By reviewers
Large Enterprise
Midsize Enterprise
Small Business
No data available
 

Questions from the Community

What do you like most about IBM SPSS Statistics?
The software offers consistency across multiple research projects helping us with predictive analytics capabilities.
What is your experience regarding pricing and costs for IBM SPSS Statistics?
While the pricing of the product may be higher, the accompanying service and features justify the investment. However...
What needs improvement with IBM SPSS Statistics?
In some cases, the product takes time to load a large dataset. They could improve this particular area.
What do you like most about Cloudera Data Science Workbench?
I appreciate CDSW's ability to logically segregate environments, such as data, DR, and production, ensuring they don'...
What needs improvement with Cloudera Data Science Workbench?
The tool's MLOps is not good. It's pricing also needs to improve.
What is your primary use case for Cloudera Data Science Workbench?
We have different use cases. Our banking use case uses machine learning to identify customer life events and recommen...
Ask a question
Earn 20 points
 

Also Known As

SPSS Statistics
CDSW
No data available
 

Learn More

Video not available
 

Overview

 

Sample Customers

LDB Group, RightShip, Tennessee Highway Patrol, Capgemini Consulting, TEAC Corporation, Ironside, nViso SA, Razorsight, Si.mobil, University Hospitals of Leicester, CROOZ Inc., GFS Fundraising Solutions, Nedbank Ltd., IDS-TILDA
IQVIA, Rush University Medical Center, Western Union
poder.io, Stanley Black & Decker, G5, PWC, Comcast, Cisco
Find out what your peers are saying about Cloudera Data Science Workbench vs. H2O.ai and other solutions. Updated: October 2024.
814,649 professionals have used our research since 2012.