Try our new research platform with insights from 80,000+ expert users

Cloudera Data Science Workbench vs KNIME comparison

Sponsored
 

Comparison Buyer's Guide

Executive Summary
 

Categories and Ranking

IBM SPSS Statistics
Sponsored
Ranking in Data Science Platforms
10th
Average Rating
8.0
Number of Reviews
37
Ranking in other categories
Data Mining (3rd)
Cloudera Data Science Workb...
Ranking in Data Science Platforms
21st
Average Rating
7.0
Number of Reviews
2
Ranking in other categories
No ranking in other categories
KNIME
Ranking in Data Science Platforms
2nd
Average Rating
8.2
Reviews Sentiment
7.2
Number of Reviews
58
Ranking in other categories
Data Mining (1st)
 

Mindshare comparison

As of November 2024, in the Data Science Platforms category, the mindshare of IBM SPSS Statistics is 2.8%, up from 2.6% compared to the previous year. The mindshare of Cloudera Data Science Workbench is 1.5%, down from 1.8% compared to the previous year. The mindshare of KNIME is 11.0%, up from 8.6% compared to the previous year. It is calculated based on PeerSpot user engagement data.
Data Science Platforms
 

Featured Reviews

AbakarAhmat - PeerSpot reviewer
Enhancing survey analysis that provides valued insightfulness
I used traditional tools where I would prepare data, click through menus, and use SQL Server for data visualization. We switched to IBM SPSS because it offers strong certification and aligns well with the standards we prioritize in our surveys. In terms of popularity, it stands out as the top choice in the market, especially in the research and university domains. Many different organizations and institutions use SPSS for statistical analytics. While there are other tools like MCLab and similar options available, SPSS is the most renowned and widely used among them.
Ismail Peer - PeerSpot reviewer
Useful for data science modeling but improvement is needed in MLOps and pricing
If you don't configure CDSW well, then it might be not useful for you. Deploying the tool can vary in complexity, but most of the time, it's relatively simple and straightforward. Triggering a job from data to production is easy, as the platform automates the deployment process. However, ensuring optimal resource allocation is essential for smooth operations.
Hansong Choi - PeerSpot reviewer
A low-code platform that reduces data mining time by linking script
The solution allows for sharing model designs and model operations with other data analysts. Other solutions such as SAS, R, and Python consist of just the script which is difficult to share. The solution is a low-code platform which reduces data mining time and its platform includes a clickable icon for linking script.

Quotes from Members

We asked business professionals to review the solutions they use. Here are some excerpts of what they said:
 

Pros

"It is a modeling tool with helpful automation."
"It has helped our analyst unit deliver work with more transparency and confidence, given that we can always view the dataset in totality, after each step of data transformation."
"The learning curve to using this product is not steep. The program is appropriate for those who do not have a lot of background in programming, yet have to perform basic statistical analysis."
"The most valuable feature is the user interface because you don't need to write code."
"The most valuable features are the small learning curve and its ability to hold a lot of data."
"You can find a complete algorithm in the solution and use it. You don't need to write your own algorithms for predictive analytics. That's the most valuable feature and the main one we use."
"The solution has numerous valuable features. We particularly like custom tabs. It's very useful. We end up analyzing a lot of software data, so features related to custom tabs are really helpful."
"The most valuable feature is its robust statistical analysis capabilities."
"The Cloudera Data Science Workbench is customizable and easy to use."
"I appreciate CDSW's ability to logically segregate environments, such as data, DR, and production, ensuring they don't interfere with each other. The deployment of machine learning is fast and easy to manage. Its API calls are also fast."
"All of the features related to the ETL are fantastic. That includes the connectors to other programs, databases, and the meta node function."
"I've never had any problems with stability."
"This open-source product can compete with category leaders in ELT software."
"I've tried to utilize KNIME to the fullest extent possible to replace Excel."
"One of the greatest advantages of KNIME is that it can be used by those without any coding experience. those with no coding background can use it."
"The tool's analytic capabilities are good."
"The solution allows for sharing model designs and model operations with other data analysts."
"The most valuable features of KNIME are its ability to convert your sub-workflow into a node. For example, the workflow has many individual native nodes that can be converted into a single node. This representation has simplified my workflow to a great extent. I can present my workflow in a very compact way."
 

Cons

"The reports could be better."
"There is a learning curve; it's not very steep, but there is one."
"Perhaps in terms of visualization. It's not really easy to do some data visualization, just simple, descriptive analysis in SPSS. I think that could be an area for improvement."
"I would like SPSS to improve its integration with other data-filing IBM tools. I also think its duration with data, utilization, and graphics could be better."
"Technical support needs some improvement, as they do not respond as quickly as we would like."
"I feel that when it comes to conducting multiple analyses, there could be more detailed information provided. Currently, the software gives a summary and an overview, but it would be beneficial to have specific details for each product or variable."
"The solution needs more planning tools and capabilities."
"Better documentation on how to use macros."
"Running this solution requires a minimum of 12GB to 16GB of RAM."
"The tool's MLOps is not good. It's pricing also needs to improve."
"The documentation is lacking and it could be better."
"The dynamic column name feature could be improved. When attempting to automate processes involving columns, such as with companies, it becomes difficult to achieve the same result when we make changes."
"When deploying models on a regular system, it works fine. However, when accuracy is a priority, hyperparameter tuning is necessary. Currently, KNIME doesn't have the best tools for this which they could improve in this area."
"The predefined workflows could use a bit of improvement."
"It could input more data acquisitions from other sources and it is difficult to combine with Python."
"KNIME's documentation is not strong."
"There are a lot of tools in the product and it would help if they were grouped into classes where you can select a function, rather than a specific tool."
"​The data visualization part is the area most in need of improvement."
 

Pricing and Cost Advice

"The price of this solution is a little bit high, which was a problem for my company."
"More affordable training for new staff members."
"SPSS is an expensive piece of software because it's incredibly complex and has been refined over decades, but I would say it's fairly priced."
"I rate the tool's pricing a five out of ten."
"Our licence is on a yearly renewal basis. While pricing is not the primary concern in our evaluation, as products are assessed by whether they can meet our user needs and expertise, the cost can be a limiting factor in the number of licences we procure."
"We think that IBM SPSS is expensive for this function."
"If it requires lot of data processing, maybe switching to IBM SPSS Clementine would be better for the buyer."
"The pricing of the modeler is high and can reduce the utility of the product for those who can not afford to adopt it."
"The product is expensive."
"There is a Community Edition and paid versions available."
"Scaling to the on-premises version requires a licensing fee per user that is a bit expensive in comparison to R, Python, and SAS."
"KNIME is a cheap product. I currently use KNIME's open-source version."
"For beginners, the free desktop version is very attractive, but the full server version can be more expensive. I have only used the free version and it offers a fair pricing system. I have been promoting it to others without any compensation or request from the company, simply because I am enthusiastic about it. I am not aware of the pricing for the server version, but it seems to be widely used."
"KNIME desktop is free, which is great for analytics teams. Server is well priced, depending on how much support is required."
"The price for Knime is okay."
"KNIME is free as a stand-alone desktop-based platform but if you want to get a KNIME server then you can find the cost on their website."
"This is an open-source solution that is free to use."
report
Use our free recommendation engine to learn which Data Science Platforms solutions are best for your needs.
816,406 professionals have used our research since 2012.
 

Top Industries

By visitors reading reviews
Financial Services Firm
17%
University
9%
Computer Software Company
9%
Manufacturing Company
8%
Financial Services Firm
35%
Manufacturing Company
11%
Healthcare Company
9%
Government
7%
Manufacturing Company
13%
Financial Services Firm
13%
Computer Software Company
9%
Educational Organization
8%
 

Company Size

By reviewers
Large Enterprise
Midsize Enterprise
Small Business
No data available
 

Questions from the Community

What do you like most about IBM SPSS Statistics?
The software offers consistency across multiple research projects helping us with predictive analytics capabilities.
What is your experience regarding pricing and costs for IBM SPSS Statistics?
The cost of IBM SPSS Statistics is managed by organizations, not individual researchers. It is a very expensive produ...
What needs improvement with IBM SPSS Statistics?
IBM SPSS Statistics does not keep you close to your data like KNIME. In KNIME, at every stage, you can see the result...
What do you like most about Cloudera Data Science Workbench?
I appreciate CDSW's ability to logically segregate environments, such as data, DR, and production, ensuring they don'...
What needs improvement with Cloudera Data Science Workbench?
The tool's MLOps is not good. It's pricing also needs to improve.
What is your primary use case for Cloudera Data Science Workbench?
We have different use cases. Our banking use case uses machine learning to identify customer life events and recommen...
What do you like most about KNIME?
Since KNIME is a no-code platform, it is easy to work with.
What is your experience regarding pricing and costs for KNIME?
I rate the product’s pricing a seven out of ten, where one is cheap and ten is expensive.
What needs improvement with KNIME?
The current UI is primarily in English. Analyzing data in local languages might present challenges or issues.
 

Also Known As

SPSS Statistics
CDSW
KNIME Analytics Platform
 

Learn More

Video not available
 

Overview

 

Sample Customers

LDB Group, RightShip, Tennessee Highway Patrol, Capgemini Consulting, TEAC Corporation, Ironside, nViso SA, Razorsight, Si.mobil, University Hospitals of Leicester, CROOZ Inc., GFS Fundraising Solutions, Nedbank Ltd., IDS-TILDA
IQVIA, Rush University Medical Center, Western Union
Infocom Corporation, Dymatrix Consulting Group, Soluzione Informatiche, MMI Agency, Estanislao Training and Solutions, Vialis AG
Find out what your peers are saying about Cloudera Data Science Workbench vs. KNIME and other solutions. Updated: October 2024.
816,406 professionals have used our research since 2012.