Try our new research platform with insights from 80,000+ expert users

Cloudera Data Science Workbench vs RapidMiner comparison

Sponsored
 

Comparison Buyer's Guide

Executive Summary
 

Categories and Ranking

IBM SPSS Statistics
Sponsored
Ranking in Data Science Platforms
10th
Average Rating
8.0
Number of Reviews
37
Ranking in other categories
Data Mining (3rd)
Cloudera Data Science Workb...
Ranking in Data Science Platforms
21st
Average Rating
7.0
Number of Reviews
2
Ranking in other categories
No ranking in other categories
RapidMiner
Ranking in Data Science Platforms
6th
Average Rating
8.6
Number of Reviews
22
Ranking in other categories
Predictive Analytics (3rd)
 

Mindshare comparison

As of November 2024, in the Data Science Platforms category, the mindshare of IBM SPSS Statistics is 2.8%, up from 2.6% compared to the previous year. The mindshare of Cloudera Data Science Workbench is 1.5%, down from 1.8% compared to the previous year. The mindshare of RapidMiner is 7.6%, up from 5.1% compared to the previous year. It is calculated based on PeerSpot user engagement data.
Data Science Platforms
 

Featured Reviews

AbakarAhmat - PeerSpot reviewer
Enhancing survey analysis that provides valued insightfulness
I used traditional tools where I would prepare data, click through menus, and use SQL Server for data visualization. We switched to IBM SPSS because it offers strong certification and aligns well with the standards we prioritize in our surveys. In terms of popularity, it stands out as the top choice in the market, especially in the research and university domains. Many different organizations and institutions use SPSS for statistical analytics. While there are other tools like MCLab and similar options available, SPSS is the most renowned and widely used among them.
Ismail Peer - PeerSpot reviewer
Useful for data science modeling but improvement is needed in MLOps and pricing
If you don't configure CDSW well, then it might be not useful for you. Deploying the tool can vary in complexity, but most of the time, it's relatively simple and straightforward. Triggering a job from data to production is easy, as the platform automates the deployment process. However, ensuring optimal resource allocation is essential for smooth operations.
Rathnam Makam - PeerSpot reviewer
A no-code tool that helps to build machine learning models
One challenge I encountered while implementing RapidMiner was the lack of documentation. Since there aren't as many users, finding resources to learn the tool was initially difficult. To overcome this hurdle, I believe RapidMiner could improve by providing more tutorials tailored for new users. I haven't explored the tool's latest version, so I'm unaware of the current features. However, I think it would be beneficial if they could enhance capabilities related to deep neural networks, provide better support for generating UI, and allow for importing and utilizing large language models.

Quotes from Members

We asked business professionals to review the solutions they use. Here are some excerpts of what they said:
 

Pros

"The SPSS interface is very accessible and user-friendly. It's really easy to get information in it. I've shared it with experts and beginners, and everyone can navigate it."
"It is a modeling tool with helpful automation."
"IBM SPSS Statistics depends on AI."
"It has the ability to easily change any variable in our research."
"Custom tables and macros: They allow us to create useful reports quickly for a broad audience."
"The best part is that they have an algorithm handbook, so you can open it up and understand how it works, and if it is useful, this is very important."
"The learning curve to using this product is not steep. The program is appropriate for those who do not have a lot of background in programming, yet have to perform basic statistical analysis."
"SPSS is quite robust and quicker in terms of providing you the output."
"The Cloudera Data Science Workbench is customizable and easy to use."
"I appreciate CDSW's ability to logically segregate environments, such as data, DR, and production, ensuring they don't interfere with each other. The deployment of machine learning is fast and easy to manage. Its API calls are also fast."
"The documentation for this solution is very good, where each operator is explained with how to use it."
"The most valuable features are the Binary classification and Auto Model."
"The most valuable feature of RapidMiner is that it is code free. It is similar to playing with Lego pieces and executing after you are finished to see the results. Additionally, it is easy to use and has interesting utilities when preparing the data. It has a utility to automatically launch a series of models and show the comparisons. When finished with the comparisons you can select the best one, and deploy it automatically."
"We value the collaboration and governance features because it's a comprehensive platform that covers everything from data extraction to modeling operations in the ML language. RapidMiner is competitive in the ML space."
"The most valuable feature of RapidMiner is that it can read a large number of file formats including CSV, Excel, and in particular, SPSS."
"It is easy to use and has a huge community that I can rely on for help. Moreover, it is interactive."
"RapidMiner for Windows is an excellent graphical tool for data science."
"The solution is stable."
 

Cons

"The design of the experience can be improved."
"SPSS is a tool that's been around since the late 60s, and it's the universal worldwide standard for quantitative social science data analysis. That said, it does seem a bit strange to me that the graphical output functions are so clunky after all these years. The output of charts and graphs that SPSS produces is hideous."
"There is a learning curve; it's not very steep, but there is one."
"The solution needs to improve forecasting using time series analysis."
"It could allow adding color to data models to make them easier to interpret."
"Each algorithm could be more adaptable to some industry-specific areas, or, in some cases, adapted for maintenance."
"The technical support should be improved."
"I'd like to see them use more artificial intelligence. It should be smart enough to do predictions and everything based on what you input."
"Running this solution requires a minimum of 12GB to 16GB of RAM."
"The tool's MLOps is not good. It's pricing also needs to improve."
"RapidMiner would be improved with the inclusion of more machine learning algorithms for generating time-series forecasting models."
"If they could include video tutorials, people would find that quite helpful."
"One challenge I encountered while implementing RapidMiner was the lack of documentation. Since there aren't as many users, finding resources to learn the tool was initially difficult. To overcome this hurdle, I believe RapidMiner could improve by providing more tutorials tailored for new users."
"Improve the online data services."
"A great product but confusing in some way with regard to the user interface and integration with other tools."
"The price of this solution should be improved."
"The product must provide data-cleaning features."
"I would appreciate improvements in automation and customization options to further streamline processes."
 

Pricing and Cost Advice

"The pricing of the modeler is high and can reduce the utility of the product for those who can not afford to adopt it."
"More affordable training for new staff members."
"Our licence is on a yearly renewal basis. While pricing is not the primary concern in our evaluation, as products are assessed by whether they can meet our user needs and expertise, the cost can be a limiting factor in the number of licences we procure."
"The price of IBM SPSS Statistics could improve."
"If it requires lot of data processing, maybe switching to IBM SPSS Clementine would be better for the buyer."
"The price of this solution is a little bit high, which was a problem for my company."
"While the pricing of the product may be higher, the accompanying service and features justify the investment."
"It's quite expensive, but they do a special deal for universities."
"The product is expensive."
"The client only has to pay the licensing costs. There are not any maintenance or hidden costs in addition to the license."
"Although we don't pay licensing fees because it is being used within the university, my understanding is that the cost is between $5,000 and $10,000 USD per year."
"I'm not fully aware of RapidMiner's price because we had licenses provided, but from my analysis, it's moderately priced, not too high or too low. It's worth the investment."
"For the university, the cost of the solution is free for the students and teachers."
"I used an educational license for this solution, which is available free of charge."
report
Use our free recommendation engine to learn which Data Science Platforms solutions are best for your needs.
816,406 professionals have used our research since 2012.
 

Top Industries

By visitors reading reviews
Financial Services Firm
17%
University
9%
Computer Software Company
9%
Manufacturing Company
8%
Financial Services Firm
35%
Manufacturing Company
11%
Healthcare Company
9%
Government
7%
University
11%
Computer Software Company
11%
Educational Organization
10%
Financial Services Firm
10%
 

Company Size

By reviewers
Large Enterprise
Midsize Enterprise
Small Business
No data available
 

Questions from the Community

What do you like most about IBM SPSS Statistics?
The software offers consistency across multiple research projects helping us with predictive analytics capabilities.
What is your experience regarding pricing and costs for IBM SPSS Statistics?
The cost of IBM SPSS Statistics is managed by organizations, not individual researchers. It is a very expensive produ...
What needs improvement with IBM SPSS Statistics?
IBM SPSS Statistics does not keep you close to your data like KNIME. In KNIME, at every stage, you can see the result...
What do you like most about Cloudera Data Science Workbench?
I appreciate CDSW's ability to logically segregate environments, such as data, DR, and production, ensuring they don'...
What needs improvement with Cloudera Data Science Workbench?
The tool's MLOps is not good. It's pricing also needs to improve.
What is your primary use case for Cloudera Data Science Workbench?
We have different use cases. Our banking use case uses machine learning to identify customer life events and recommen...
What do you like most about RapidMiner?
RapidMiner is a no-code machine learning tool. I can install it on my local machine and work with smaller datasets. I...
What is your experience regarding pricing and costs for RapidMiner?
I'm not fully aware of RapidMiner's price because we had licenses provided, but from my analysis, it's moderately pri...
What needs improvement with RapidMiner?
The product must provide data-cleaning features. I could not use RapidMiner for data cleaning in one of my projects a...
 

Also Known As

SPSS Statistics
CDSW
No data available
 

Learn More

Video not available
 

Overview

 

Sample Customers

LDB Group, RightShip, Tennessee Highway Patrol, Capgemini Consulting, TEAC Corporation, Ironside, nViso SA, Razorsight, Si.mobil, University Hospitals of Leicester, CROOZ Inc., GFS Fundraising Solutions, Nedbank Ltd., IDS-TILDA
IQVIA, Rush University Medical Center, Western Union
PayPal, Deloitte, eBay, Cisco, Miele, Volkswagen
Find out what your peers are saying about Cloudera Data Science Workbench vs. RapidMiner and other solutions. Updated: October 2024.
816,406 professionals have used our research since 2012.