Try our new research platform with insights from 80,000+ expert users

Dataiku vs RapidMiner comparison

 

Comparison Buyer's Guide

Executive SummaryUpdated on Dec 5, 2024

Review summaries and opinions

We asked business professionals to review the solutions they use. Here are some excerpts of what they said:
 

Categories and Ranking

Dataiku
Ranking in Data Science Platforms
7th
Average Rating
8.0
Reviews Sentiment
7.2
Number of Reviews
10
Ranking in other categories
No ranking in other categories
RapidMiner
Ranking in Data Science Platforms
6th
Average Rating
8.6
Reviews Sentiment
7.0
Number of Reviews
22
Ranking in other categories
Predictive Analytics (3rd)
 

Mindshare comparison

As of February 2025, in the Data Science Platforms category, the mindshare of Dataiku is 12.4%, up from 7.9% compared to the previous year. The mindshare of RapidMiner is 7.7%, up from 6.1% compared to the previous year. It is calculated based on PeerSpot user engagement data.
Data Science Platforms
 

Featured Reviews

RichardXu - PeerSpot reviewer
The platform organizes workflows visually and efficiently
One of the valuable features of Dataiku is the workflow capability. It allows us to organize a workflow efficiently. The platform has a visual interface, making it much easier for educated professionals to organize their work. This feature is useful because it simplifies tasks and eliminates the need for a data scientist. If you are knowledgeable about AI, you can directly write using primitive tools like Pantera flow, PyTorch, and Scikit-learn. However, Dataiku makes this process much easier.
Rathnam Makam - PeerSpot reviewer
A no-code tool that helps to build machine learning models
One challenge I encountered while implementing RapidMiner was the lack of documentation. Since there aren't as many users, finding resources to learn the tool was initially difficult. To overcome this hurdle, I believe RapidMiner could improve by providing more tutorials tailored for new users. I haven't explored the tool's latest version, so I'm unaware of the current features. However, I think it would be beneficial if they could enhance capabilities related to deep neural networks, provide better support for generating UI, and allow for importing and utilizing large language models.

Quotes from Members

We asked business professionals to review the solutions they use. Here are some excerpts of what they said:
 

Pros

"Traceability is vital since I manage many cohorts, and collaboration is key as I have multiple engineers substituting for one another."
"Cloud-based process run helps in not keeping the systems on while processes are running."
"The most valuable feature of this solution is that it is one tool that can do everything, and you have the ability to very easily push your design to prediction."
"The most valuable feature is the set of visual data preparation tools."
"I believe the return on investment looks positive."
"One of the valuable features of Dataiku is the workflow capability."
"Extremely easy to use with its GUI-based functionality and large compatibility with various data sources. Also, maintenance processes are much more automated than ever, with fewer errors."
"If many teams are collaborating and sharing Jupyter notebooks, it's very useful."
"RapidMiner for Windows is an excellent graphical tool for data science."
"Using the GUI, I can have models and algorithms drag and drop nodes."
"RapidMiner is a no-code machine learning tool. I can install it on my local machine and work with smaller datasets. It can also connect to databases, allowing me to build models directly on the data stored there. RapidMiner offers a wider range of operators than other tools like Dataiku, making it a better option for my needs."
"RapidMiner is very easy to use."
"The most valuable features are the Binary classification and Auto Model."
"We value the collaboration and governance features because it's a comprehensive platform that covers everything from data extraction to modeling operations in the ML language. RapidMiner is competitive in the ML space."
"The best part of RapidMiner is efficiency."
"The solution is stable."
 

Cons

"Although known for Big Data, the processing time to process 1.8 billion records was terribly slow (five days)."
"One of the main challenges was collaboration. Developers typically use GitHub to push and manage code, but integrating GitHub with Dataiku was complicated."
"Server up-time needs to be improved. Also, query engines like Spark and Hive need to be more stable."
"The ability to have charts right from the explorer would be an improvement."
"The interface for the web app can be a bit difficult. It needs to have better capabilities, at least for developers who like to code. This is due to the fact that everything is enabled in a single window with different tabs. For them to actually develop and do the concurrent testing that needs to be done, it takes a bit of time. That is one improvement that I would like to see - from a web app developer perspective."
"I think it would help if Data Science Studio added some more features and improved the data model."
"I find that it is a little slow during use. It takes more time than I would expect for operations to complete."
"One area for improvement is the need for more capabilities similar to those provided by NVIDIA for parallel machine learning training. We still encounter some integration issues."
"It would be helpful to have some tutorials on communicating with Python."
"A great product but confusing in some way with regard to the user interface and integration with other tools."
"In the Mexican or Latin American market, it's kind of pricey."
"If they could include video tutorials, people would find that quite helpful."
"The product must provide data-cleaning features."
"I think that they should make deep learning models easier."
"The biggest problem, not from a platform process, but from an avoidance process, is when you work in a heavily regulated environment, like banking and finance. Whenever you make a decision or there is an output, you need to bill it as an avoidance to the investigator or to the bank audit team. If you made decisions within this machine learning model, you need to explain why you did so. It would better if you could explain your decision in terms of delivery. However, this is an issue with all ML platforms. Many companies are working heavily in this area to help figure out how to make it more explainable to the business team or the regulator."
"RapidMiner would be improved with the inclusion of more machine learning algorithms for generating time-series forecasting models."
 

Pricing and Cost Advice

"Pricing is pretty steep. Dataiku is also not that cheap."
"The annual licensing fees are approximately €20 ($22 USD) per key for the basic version and €40 ($44 USD) per key for the version with everything."
"For the university, the cost of the solution is free for the students and teachers."
"I used an educational license for this solution, which is available free of charge."
"Although we don't pay licensing fees because it is being used within the university, my understanding is that the cost is between $5,000 and $10,000 USD per year."
"The client only has to pay the licensing costs. There are not any maintenance or hidden costs in addition to the license."
"I'm not fully aware of RapidMiner's price because we had licenses provided, but from my analysis, it's moderately priced, not too high or too low. It's worth the investment."
report
Use our free recommendation engine to learn which Data Science Platforms solutions are best for your needs.
838,533 professionals have used our research since 2012.
 

Top Industries

By visitors reading reviews
Financial Services Firm
18%
Educational Organization
15%
Manufacturing Company
9%
Computer Software Company
8%
University
11%
Computer Software Company
11%
Financial Services Firm
10%
Educational Organization
10%
 

Company Size

By reviewers
Large Enterprise
Midsize Enterprise
Small Business
 

Questions from the Community

What needs improvement with Dataiku Data Science Studio?
I need more experience in the sector, which is health. The license is very expensive. It would be great to have an intermediate license for basic treatments that do not require extensive experience.
What is your primary use case for Dataiku Data Science Studio?
I use that IQ since I am preparing cohorts for health investment research.
What do you like most about RapidMiner?
RapidMiner is a no-code machine learning tool. I can install it on my local machine and work with smaller datasets. It can also connect to databases, allowing me to build models directly on the dat...
What is your experience regarding pricing and costs for RapidMiner?
I'm not fully aware of RapidMiner's price because we had licenses provided, but from my analysis, it's moderately priced, not too high or too low. It's worth the investment.
What needs improvement with RapidMiner?
The product must provide data-cleaning features. I could not use RapidMiner for data cleaning in one of my projects and had to use Python instead.
 

Comparisons

 

Also Known As

Dataiku DSS
No data available
 

Overview

 

Sample Customers

BGL BNP Paribas, Dentsu Aegis, Link Mobility Group, AramisAuto
PayPal, Deloitte, eBay, Cisco, Miele, Volkswagen
Find out what your peers are saying about Dataiku vs. RapidMiner and other solutions. Updated: January 2025.
838,533 professionals have used our research since 2012.