Try our new research platform with insights from 80,000+ expert users

RapidMiner vs Talend Data Quality comparison

 

Comparison Buyer's Guide

Executive Summary
 

Categories and Ranking

RapidMiner
Average Rating
8.6
Number of Reviews
22
Ranking in other categories
Predictive Analytics (3rd), Data Science Platforms (6th)
Talend Data Quality
Average Rating
8.0
Number of Reviews
20
Ranking in other categories
Data Quality (6th), Data Scrubbing Software (1st)
 

Mindshare comparison

RapidMiner and Talend Data Quality aren’t in the same category and serve different purposes. RapidMiner is designed for Predictive Analytics and holds a mindshare of 17.3%, down 20.7% compared to last year.
Talend Data Quality, on the other hand, focuses on Data Quality, holds 3.7% mindshare, down 6.5% since last year.
Predictive Analytics
Data Quality
 

Featured Reviews

Rathnam Makam - PeerSpot reviewer
May 7, 2024
A no-code tool that helps to build machine learning models
I use the tool for educational purposes to mentor students. I use it in various educational projects and real-world customer use cases. It helps me to build machine learning models such as clustering, decision trees, and time series analysis RapidMiner is a no-code machine learning tool. I can…
WesamHabboub - PeerSpot reviewer
Jan 16, 2024
Stands out for its user-friendly interface, robust community support, competitive pricing and strategic approach to improving data accuracy
Its greatest asset lies in its user-friendly interface, specifically within the Talend Open Studio, known for its ease of use and familiarity among users. The robust community support proves invaluable when encountering challenges, providing a reliable resource for issue resolution. Moreover, the pricing structure stands out as highly competitive compared to other offerings in the market, making it a cost-effective choice for users. The most valuable feature lies in the capability to assign data quality issues to different stakeholders, facilitating the tracking and resolution of defective work. This functionality enables a streamlined process for identifying, assigning, and subsequently addressing data quality issues.

Quotes from Members

We asked business professionals to review the solutions they use. Here are some excerpts of what they said:
 

Pros

"The data science, collaboration, and IDN are very, very strong."
"The most valuable feature of RapidMiner is that it is code free. It is similar to playing with Lego pieces and executing after you are finished to see the results. Additionally, it is easy to use and has interesting utilities when preparing the data. It has a utility to automatically launch a series of models and show the comparisons. When finished with the comparisons you can select the best one, and deploy it automatically."
"The most valuable feature is what the product sets out to do, which is extracting information and data."
"RapidMiner is very easy to use."
"The best part of RapidMiner is efficiency."
"Using the GUI, I can have models and algorithms drag and drop nodes."
"It is easy to use and has a huge community that I can rely on for help. Moreover, it is interactive."
"We value the collaboration and governance features because it's a comprehensive platform that covers everything from data extraction to modeling operations in the ML language. RapidMiner is competitive in the ML space."
"With its frequency function, we were able to pick a line of business to be addressed first in one of our conversion projects."
"The jobs are visual and this has improved collaboration between colleagues. It’s much easier to understand a visual job than a piece of Java code."
"We are able to get emails from URLs very easily using this function when others fail."
"The features that I find to be the most valuable are the extensibility, the integration, and the ease of integration with multiple platforms."
"​This product speeds up the unit testing and QA for specific test scenarios. As a result, the development output quality can be evaluated and adjusted.​"
"The most valuable feature lies in the capability to assign data quality issues to different stakeholders, facilitating the tracking and resolution of defective work."
"The solution is customizable."
"I really like the fact that there are no out-of-the-box solutions regarding the development of jobs. Other vendors may have modules which cleanse your addresses. In Talend, you have the freedom to completely develop the process yourself. This can be tricky, but it also makes it fun."
 

Cons

"The product must provide data-cleaning features."
"A great product but confusing in some way with regard to the user interface and integration with other tools."
"The price of this solution should be improved."
"I think that they should make deep learning models easier."
"RapidMiner would be improved with the inclusion of more machine learning algorithms for generating time-series forecasting models."
"In the Mexican or Latin American market, it's kind of pricey."
"I would like to see more integration capabilities."
"Many things in the interface look nice, but they aren't of much use to the operator. It already has lots of variables in there."
"They don't have any AI capabilities. Talend DQ is specifically for data quality, which only has data profiling. With Talend DQ, I cannot generate any reports today, so I need an ETL tool. It provides general Excel files, or I have to create some views. If instead of buying a new tool, Talend provides a reporting capability or solution, it would be great. It will reduce the development effort for creating these kinds of reports. We also manage the infrastructure for Talend. From the licensing perspective, for cloud, they only have seat licenses where one person is tied to one license, but for on-premise, they have concurrent licenses. It would be really awesome if they can provide concurrent licenses for the cloud so that if one person is not there, somebody else can use that license. Currently, it is not possible unless a person deactivates his or her license and moves the same seat license to someone else. We are one of the biggest customers in the central zone of the US for Talend, and this is the feedback that we have provided them again and again, but they come back and say that they aren't able to provide concurrent licenses on the cloud. In version 7.3, there is a feature for tokenization and de-tokenization of data. This is the feature that we are looking for. It is useful if somebody wants to see what we have masked and how do we demask it. This feature is not there in version 7.1. There are also a few other capabilities on the cloud, but we don't yet have a big footprint in the cloud."
"When we upgraded to Version 6.4.1, we tried using a GIT repository instead of a SVN repository. After a few incidents where things disappeared and changes were not saved, we decided to go back to a SVN repository."
"If the SQL input controls could dynamically determine the schema-based on the SQL alone, it would simplify the steps of having to use a manually created and saved schema for use in the TMap for the Postgres and Redshift components. This would make things even easier."
"Heap space issues plague us consistently. We maxed it out and it runs fine, then it doesn’t, then it does."
"There are too many functions which could be streamlined."
"It would be more helpful if it offered dynamic dashboards that could be directly used by clients for better analysis."
"If we encounter issues, it’s most likely when using the Talend Open Studio. The studio can be slow, get stuck, or crash. But again, it can be caused by the resources of your machine or your connection with the repository. If we encounter issues with the Studio we restart the Studio. In emergencies, we create and use a new workspace."
"You can't join more than two tables for analysis."
 

Pricing and Cost Advice

"Although we don't pay licensing fees because it is being used within the university, my understanding is that the cost is between $5,000 and $10,000 USD per year."
"I'm not fully aware of RapidMiner's price because we had licenses provided, but from my analysis, it's moderately priced, not too high or too low. It's worth the investment."
"For the university, the cost of the solution is free for the students and teachers."
"The client only has to pay the licensing costs. There are not any maintenance or hidden costs in addition to the license."
"I used an educational license for this solution, which is available free of charge."
"Moreover, the pricing structure stands out as highly competitive compared to other offerings in the market, making it a cost-effective choice for users."
"It's a subscription-based platform, we renew it every year."
"I would advise to first take a look and at the Open Studio edition. Figure out what you need and purchase the appropriate license."
"We did not purchase a separate license for DQ. It is part of our data platform suite, and I believe it is well-priced."
"It is cheaper than Informatica. Talend Data Quality costs somewhere between $10,000 to $12,000 per year for a seat license. It would cost around $20,000 per year for a concurrent license. It is the same for the whole big data solution, which comes with Talend DI, Talend DQ, and TDM."
report
Use our free recommendation engine to learn which Predictive Analytics solutions are best for your needs.
814,763 professionals have used our research since 2012.
 

Top Industries

By visitors reading reviews
University
11%
Computer Software Company
10%
Educational Organization
10%
Financial Services Firm
9%
Financial Services Firm
14%
Computer Software Company
12%
Manufacturing Company
10%
Energy/Utilities Company
8%
 

Company Size

By reviewers
Large Enterprise
Midsize Enterprise
Small Business
 

Questions from the Community

What do you like most about RapidMiner?
RapidMiner is a no-code machine learning tool. I can install it on my local machine and work with smaller datasets. It can also connect to databases, allowing me to build models directly on the dat...
What is your experience regarding pricing and costs for RapidMiner?
I'm not fully aware of RapidMiner's price because we had licenses provided, but from my analysis, it's moderately priced, not too high or too low. It's worth the investment.
What needs improvement with RapidMiner?
The product must provide data-cleaning features. I could not use RapidMiner for data cleaning in one of my projects and had to use Python instead.
What do you like most about Talend Data Quality?
The most valuable feature lies in the capability to assign data quality issues to different stakeholders, facilitating the tracking and resolution of defective work.
What is your experience regarding pricing and costs for Talend Data Quality?
There are many data quality tools available, but some can be expensive. Talend Data Quality stands out because it is often provided for free if you already have Talend Data Integration, which means...
What needs improvement with Talend Data Quality?
Talend suite might have a missing product, particularly in the commercial master aspect. This would contribute to completing the overall picture, though the focus isn't necessarily on economic cons...
 

Learn More

 

Overview

 

Sample Customers

PayPal, Deloitte, eBay, Cisco, Miele, Volkswagen
Aliaxis, Electrocomponents, M¾NCHENER VEREIN, The Sunset Group
Find out what your peers are saying about Alteryx, SAP, RapidMiner and others in Predictive Analytics. Updated: November 2024.
814,763 professionals have used our research since 2012.