Try our new research platform with insights from 80,000+ expert users

Cloudera Data Science Workbench vs RapidMiner comparison

Sponsored
 

Comparison Buyer's Guide

Executive SummaryUpdated on Dec 5, 2024
 

Categories and Ranking

IBM SPSS Statistics
Sponsored
Ranking in Data Science Platforms
9th
Average Rating
8.0
Reviews Sentiment
6.9
Number of Reviews
37
Ranking in other categories
Data Mining (3rd)
Cloudera Data Science Workb...
Ranking in Data Science Platforms
22nd
Average Rating
7.0
Reviews Sentiment
6.9
Number of Reviews
2
Ranking in other categories
No ranking in other categories
RapidMiner
Ranking in Data Science Platforms
6th
Average Rating
8.6
Reviews Sentiment
7.0
Number of Reviews
22
Ranking in other categories
Predictive Analytics (3rd)
 

Mindshare comparison

As of December 2024, in the Data Science Platforms category, the mindshare of IBM SPSS Statistics is 2.7%, up from 2.7% compared to the previous year. The mindshare of Cloudera Data Science Workbench is 1.5%, down from 1.8% compared to the previous year. The mindshare of RapidMiner is 7.7%, up from 5.5% compared to the previous year. It is calculated based on PeerSpot user engagement data.
Data Science Platforms
 

Featured Reviews

Md Masudul Hassan - PeerSpot reviewer
Comprehensive data analysis capabilities with a user-friendly interface, providing an efficient and reliable platform for researchers and analysts
I believe that offering short-term SPSS licenses, perhaps when customer sourcing is available, could make it more affordable. These licenses shouldn't include features tailored for universities or large sales organizations. Instead, they could offer discounts or additional facilities for smaller entities to access the software. In developing countries, it would be beneficial to provide certain features to users at no cost initially, while also customizing pricing options. For example, offering basic features to the first hundred users can help them become familiar with the software and its capabilities. This approach encourages users to upgrade to higher tiers as they become more experienced and require additional functionality.
Ismail Peer - PeerSpot reviewer
Useful for data science modeling but improvement is needed in MLOps and pricing
If you don't configure CDSW well, then it might be not useful for you. Deploying the tool can vary in complexity, but most of the time, it's relatively simple and straightforward. Triggering a job from data to production is easy, as the platform automates the deployment process. However, ensuring optimal resource allocation is essential for smooth operations.
Rathnam Makam - PeerSpot reviewer
A no-code tool that helps to build machine learning models
One challenge I encountered while implementing RapidMiner was the lack of documentation. Since there aren't as many users, finding resources to learn the tool was initially difficult. To overcome this hurdle, I believe RapidMiner could improve by providing more tutorials tailored for new users. I haven't explored the tool's latest version, so I'm unaware of the current features. However, I think it would be beneficial if they could enhance capabilities related to deep neural networks, provide better support for generating UI, and allow for importing and utilizing large language models.

Quotes from Members

We asked business professionals to review the solutions they use. Here are some excerpts of what they said:
 

Pros

"Capability analysis is one of the main and valuable functions. We also do some hypothesis testing in Minitab and summary stats. These are the functions that we find very useful."
"The best part is that they have an algorithm handbook, so you can open it up and understand how it works, and if it is useful, this is very important."
"IBM SPSS Statistics depends on AI."
"Most of the product features are good but I particularly like the linear regression analysis."
"You can find a complete algorithm in the solution and use it. You don't need to write your own algorithms for predictive analytics. That's the most valuable feature and the main one we use."
"The most valuable feature is its robust statistical analysis capabilities."
"The software offers consistency across multiple research projects helping us with predictive analytics capabilities."
"In terms of the features I've found most valuable, I'd say the duration, the correlation, and of course the nonparametric statistics. I use it for reliability and survival analysis, time series, regression models in different solutions, and different types of solutions."
"The Cloudera Data Science Workbench is customizable and easy to use."
"I appreciate CDSW's ability to logically segregate environments, such as data, DR, and production, ensuring they don't interfere with each other. The deployment of machine learning is fast and easy to manage. Its API calls are also fast."
"The data science, collaboration, and IDN are very, very strong."
"The most valuable features are the Binary classification and Auto Model."
"RapidMiner for Windows is an excellent graphical tool for data science."
"The documentation for this solution is very good, where each operator is explained with how to use it."
"Scalability is not really a concern with RapidMiner. It scales very well and can be used in global implementations."
"Using the GUI, I can have models and algorithms drag and drop nodes."
"The solution is stable."
"We value the collaboration and governance features because it's a comprehensive platform that covers everything from data extraction to modeling operations in the ML language. RapidMiner is competitive in the ML space."
 

Cons

"There is a learning curve; it's not very steep, but there is one."
"SPSS slows down the computer or the laptop if the data is huge; then you need a faster computer."
"It could provide even more in the way of automation as there are many opportunities."
"I would like SPSS to improve its integration with other data-filing IBM tools. I also think its duration with data, utilization, and graphics could be better."
"It would be helpful if there was better documentation on how to properly use the solution. A beginner's guide on how to use the various programming functions within the product would be so useful to a lot of people. I found that everything was very confusing at first. Having clear documentation would help alleviate that."
"I know that SPSS is a statistical tool but it should also include a little bit of analytical behavior. You can call it augmented analysis or predictive analysis. The bottom line is it should have more graphical and analytical capabilities."
"The reports could be better."
"Each algorithm could be more adaptable to some industry-specific areas, or, in some cases, adapted for maintenance."
"The tool's MLOps is not good. It's pricing also needs to improve."
"Running this solution requires a minimum of 12GB to 16GB of RAM."
"The product must provide data-cleaning features."
"Improve the online data services."
"It would be helpful to have some tutorials on communicating with Python."
"The visual interface could use something like the-drag-and-drop features which other products already support. Some additional features can make RapidMiner a better tool and maybe more competitive."
"I would appreciate improvements in automation and customization options to further streamline processes."
"Many things in the interface look nice, but they aren't of much use to the operator. It already has lots of variables in there."
"If they could include video tutorials, people would find that quite helpful."
"The price of this solution should be improved."
 

Pricing and Cost Advice

"The price of this solution is a little bit high, which was a problem for my company."
"The price of IBM SPSS Statistics could improve."
"I rate the tool's pricing a five out of ten."
"More affordable training for new staff members."
"Our licence is on a yearly renewal basis. While pricing is not the primary concern in our evaluation, as products are assessed by whether they can meet our user needs and expertise, the cost can be a limiting factor in the number of licences we procure."
"While the pricing of the product may be higher, the accompanying service and features justify the investment."
"We think that IBM SPSS is expensive for this function."
"If it requires lot of data processing, maybe switching to IBM SPSS Clementine would be better for the buyer."
"The product is expensive."
"I'm not fully aware of RapidMiner's price because we had licenses provided, but from my analysis, it's moderately priced, not too high or too low. It's worth the investment."
"The client only has to pay the licensing costs. There are not any maintenance or hidden costs in addition to the license."
"For the university, the cost of the solution is free for the students and teachers."
"I used an educational license for this solution, which is available free of charge."
"Although we don't pay licensing fees because it is being used within the university, my understanding is that the cost is between $5,000 and $10,000 USD per year."
report
Use our free recommendation engine to learn which Data Science Platforms solutions are best for your needs.
824,067 professionals have used our research since 2012.
 

Top Industries

By visitors reading reviews
Financial Services Firm
17%
Computer Software Company
9%
University
8%
Manufacturing Company
8%
Financial Services Firm
36%
Manufacturing Company
12%
Healthcare Company
9%
Computer Software Company
6%
University
12%
Computer Software Company
10%
Educational Organization
10%
Financial Services Firm
10%
 

Company Size

By reviewers
Large Enterprise
Midsize Enterprise
Small Business
No data available
 

Questions from the Community

What do you like most about IBM SPSS Statistics?
The software offers consistency across multiple research projects helping us with predictive analytics capabilities.
What is your experience regarding pricing and costs for IBM SPSS Statistics?
The cost of IBM SPSS Statistics is managed by organizations, not individual researchers. It is a very expensive produ...
What needs improvement with IBM SPSS Statistics?
IBM SPSS Statistics does not keep you close to your data like KNIME. In KNIME, at every stage, you can see the result...
What do you like most about Cloudera Data Science Workbench?
I appreciate CDSW's ability to logically segregate environments, such as data, DR, and production, ensuring they don'...
What needs improvement with Cloudera Data Science Workbench?
The tool's MLOps is not good. It's pricing also needs to improve.
What is your primary use case for Cloudera Data Science Workbench?
We have different use cases. Our banking use case uses machine learning to identify customer life events and recommen...
What do you like most about RapidMiner?
RapidMiner is a no-code machine learning tool. I can install it on my local machine and work with smaller datasets. I...
What is your experience regarding pricing and costs for RapidMiner?
I'm not fully aware of RapidMiner's price because we had licenses provided, but from my analysis, it's moderately pri...
What needs improvement with RapidMiner?
The product must provide data-cleaning features. I could not use RapidMiner for data cleaning in one of my projects a...
 

Also Known As

SPSS Statistics
CDSW
No data available
 

Learn More

Video not available
 

Overview

 

Sample Customers

LDB Group, RightShip, Tennessee Highway Patrol, Capgemini Consulting, TEAC Corporation, Ironside, nViso SA, Razorsight, Si.mobil, University Hospitals of Leicester, CROOZ Inc., GFS Fundraising Solutions, Nedbank Ltd., IDS-TILDA
IQVIA, Rush University Medical Center, Western Union
PayPal, Deloitte, eBay, Cisco, Miele, Volkswagen
Find out what your peers are saying about Cloudera Data Science Workbench vs. RapidMiner and other solutions. Updated: December 2024.
824,067 professionals have used our research since 2012.