Try our new research platform with insights from 80,000+ expert users

Cloudera Data Science Workbench vs IBM SPSS Modeler comparison

Sponsored
 

Comparison Buyer's Guide

Executive Summary
 

Categories and Ranking

IBM SPSS Statistics
Sponsored
Ranking in Data Science Platforms
10th
Average Rating
8.0
Number of Reviews
37
Ranking in other categories
Data Mining (3rd)
Cloudera Data Science Workb...
Ranking in Data Science Platforms
21st
Average Rating
7.0
Number of Reviews
2
Ranking in other categories
No ranking in other categories
IBM SPSS Modeler
Ranking in Data Science Platforms
13th
Average Rating
8.0
Number of Reviews
39
Ranking in other categories
Data Mining (4th)
 

Mindshare comparison

As of November 2024, in the Data Science Platforms category, the mindshare of IBM SPSS Statistics is 2.8%, up from 2.6% compared to the previous year. The mindshare of Cloudera Data Science Workbench is 1.5%, down from 1.8% compared to the previous year. The mindshare of IBM SPSS Modeler is 2.5%, down from 2.7% compared to the previous year. It is calculated based on PeerSpot user engagement data.
Data Science Platforms
 

Featured Reviews

AbakarAhmat - PeerSpot reviewer
Sep 21, 2023
Enhancing survey analysis that provides valued insightfulness
I use it to analyze questionnaire surveys related to a product, solution, or application, such as open data services, which I provide to consumers and end-users. These surveys contain evaluation assessments, and I use SPSS to analyze the responses The most valuable feature is its robust…
Ismail Peer - PeerSpot reviewer
Feb 13, 2024
Useful for data science modeling but improvement is needed in MLOps and pricing
We have different use cases. Our banking use case uses machine learning to identify customer life events and recommend the best-suited card products. These machine-learning models are deployed in our environment, where they run on a scheduled basis. We rely on the platform for every data science…
PeterHuo - PeerSpot reviewer
Jul 12, 2024
Good tool for extracting data from data warehouses, creating streams, and manipulating logic to extract final data
There are performance issues. Extracting data from many combined tables can take hours and occasionally crash the server due to memory leaks. This performance problem bothers people. The performance issue seems to be related to the server. We design streams on the client and submit them to the server, which generates a large SQL statement. There are two potential bottlenecks: one in the server and another in data extraction. I'm unsure about the exact mechanics of data splitting when fetching from the database. When streams become larger, performance bottlenecks may occur in the IBM SPSS Modeler server or the database. Sometimes the server crashes and needs to be restarted to release memory on both sides. I'm not sure exactly where the problem is caused, as I focus on stream design rather than server issues. The problem could be on the IBM SPSS Modeler server and database.

Quotes from Members

We asked business professionals to review the solutions they use. Here are some excerpts of what they said:
 

Pros

"It is a modeling tool with helpful automation."
"It offers very good visualization."
"The most valuable feature is the user interface because you don't need to write code."
"SPSS can handle whatever you throw at it, whether your data set contains 10,000, 100,000, or a million objects. It's like the heavy artillery of analytical tools."
"The solution has numerous valuable features. We particularly like custom tabs. It's very useful. We end up analyzing a lot of software data, so features related to custom tabs are really helpful."
"Most of the product features are good but I particularly like the linear regression analysis."
"It is perfectly adequate if all you need are the results and not the trail of evidence."
"In terms of the features I've found most valuable, I'd say the duration, the correlation, and of course the nonparametric statistics. I use it for reliability and survival analysis, time series, regression models in different solutions, and different types of solutions."
"I appreciate CDSW's ability to logically segregate environments, such as data, DR, and production, ensuring they don't interfere with each other. The deployment of machine learning is fast and easy to manage. Its API calls are also fast."
"The Cloudera Data Science Workbench is customizable and easy to use."
"Our go live process has been slightly enhanced compared to the previous programmatic process. There is now a faster time to production from the business end."
"It is very scalable for non-technical people."
"The supervised models are valuable. It is also very organized and easy to use."
"I think it is the point and drag features that are the most valuable. You can simply click at the windows, and then pull up the functions."
"Some basic form of feature engineering for classification models. This really quickens the model development process."
"Compared to other tools, the product works much easier to analyze data without coding."
"Very good data aggregation."
"It scales. I have not run into any challenges where it will not perform.​"
 

Cons

"Perhaps in terms of visualization. It's not really easy to do some data visualization, just simple, descriptive analysis in SPSS. I think that could be an area for improvement."
"I'd like to see them use more artificial intelligence. It should be smart enough to do predictions and everything based on what you input."
"This solution is not suitable for use with Big Data."
"The technical support should be improved."
"I think the visualization and charting should be changed and made easier and more effective."
"It could allow adding color to data models to make them easier to interpret."
"I know that SPSS is a statistical tool but it should also include a little bit of analytical behavior. You can call it augmented analysis or predictive analysis. The bottom line is it should have more graphical and analytical capabilities."
"Technical support needs some improvement, as they do not respond as quickly as we would like."
"Running this solution requires a minimum of 12GB to 16GB of RAM."
"The tool's MLOps is not good. It's pricing also needs to improve."
"The platform's cloud version needs improvements."
"Time Series or forecasting needs to be easier. It is a very important feature, and it should be made easier and more automated to use. For instance, for logistic regression, binary or multinomial is used automatically based on the type of the target variable. I wish they can make Time Series easier to use in a similar way."
"Expensive to deploy solutions. You need to buy an extra deployment unit."
"It would be beneficial if the tool would include more well-known machine learning algorithms."
"​Initial setup of the software was complex, because of our own problems within the government."
"Requires more development."
"The platform that you can deploy it on needs improvement because I think it is Windows only. I do not think it can run off a Red Hat, like the server products. I am pretty sure it is Windows and AIX only."
"The forecasting could be a bit easier."
 

Pricing and Cost Advice

"The pricing of the modeler is high and can reduce the utility of the product for those who can not afford to adopt it."
"SPSS is an expensive piece of software because it's incredibly complex and has been refined over decades, but I would say it's fairly priced."
"If it requires lot of data processing, maybe switching to IBM SPSS Clementine would be better for the buyer."
"More affordable training for new staff members."
"It's quite expensive, but they do a special deal for universities."
"The price of this solution is a little bit high, which was a problem for my company."
"We think that IBM SPSS is expensive for this function."
"I rate the tool's pricing a five out of ten."
"The product is expensive."
"When you are close to end of quarter, IBM and its partners can get you 60% to 70% discounts, so literally wait for the last day of the quarter for the best prices. You may feel like you are getting robbed if you can't receive a good discount."
"$5,000 annually."
"If you are in a university and the license is free then you can use the tool without any charges, which is good."
"This tool, being an IBM product, is pretty expensive."
"I am using the free version of IBM SPSS Modeler, it is the educational edition version."
"Its price is okay for a company, but for personal use, it is considered somewhat expensive."
"The government has funds and a budget, it's hard to say if it's expensive or cheap. In Canada, they have a yearly budget. They used to encourage people to use the modeler for development. If ten users use the server with ten licenses, it runs faster. But if forty users use the same appliance, everything slows down. People then think it's not easy to do things and prefer using remote tools like Python to extract data from the database. It's not about being expensive or cheap, but about people's knowledge and experience in how to do the work."
"It got us a good amount of money with quick and efficient modeling."
report
Use our free recommendation engine to learn which Data Science Platforms solutions are best for your needs.
814,649 professionals have used our research since 2012.
 

Top Industries

By visitors reading reviews
Financial Services Firm
16%
University
10%
Computer Software Company
9%
Manufacturing Company
8%
Financial Services Firm
35%
Manufacturing Company
11%
Healthcare Company
9%
Government
7%
Educational Organization
14%
Financial Services Firm
12%
Computer Software Company
10%
University
9%
 

Company Size

By reviewers
Large Enterprise
Midsize Enterprise
Small Business
No data available
 

Questions from the Community

What do you like most about IBM SPSS Statistics?
The software offers consistency across multiple research projects helping us with predictive analytics capabilities.
What is your experience regarding pricing and costs for IBM SPSS Statistics?
While the pricing of the product may be higher, the accompanying service and features justify the investment. However...
What needs improvement with IBM SPSS Statistics?
In some cases, the product takes time to load a large dataset. They could improve this particular area.
What do you like most about Cloudera Data Science Workbench?
I appreciate CDSW's ability to logically segregate environments, such as data, DR, and production, ensuring they don'...
What needs improvement with Cloudera Data Science Workbench?
The tool's MLOps is not good. It's pricing also needs to improve.
What is your primary use case for Cloudera Data Science Workbench?
We have different use cases. Our banking use case uses machine learning to identify customer life events and recommen...
What do you like most about IBM SPSS Modeler?
Compared to other tools, the product works much easier to analyze data without coding.
What is your experience regarding pricing and costs for IBM SPSS Modeler?
The government has funds and a budget, it's hard to say if it's expensive or cheap. In Canada, they have a yearly bud...
What needs improvement with IBM SPSS Modeler?
There are performance issues. Extracting data from many combined tables can take hours and occasionally crash the ser...
 

Also Known As

SPSS Statistics
CDSW
SPSS Modeler
 

Learn More

Video not available
Video not available
 

Overview

 

Sample Customers

LDB Group, RightShip, Tennessee Highway Patrol, Capgemini Consulting, TEAC Corporation, Ironside, nViso SA, Razorsight, Si.mobil, University Hospitals of Leicester, CROOZ Inc., GFS Fundraising Solutions, Nedbank Ltd., IDS-TILDA
IQVIA, Rush University Medical Center, Western Union
Reisebªro Idealtours GmbH, MedeAnalytics, Afni, Israel Electric Corporation, Nedbank Ltd., DigitalGlobe, Vodafone Hungary, Aegon Hungary, Bureau Veritas, Brammer Group, Florida Department of Juvenile Justice, InSites Consulting, Fortis Turkey
Find out what your peers are saying about Cloudera Data Science Workbench vs. IBM SPSS Modeler and other solutions. Updated: October 2024.
814,649 professionals have used our research since 2012.