Altair RapidMiner vs Dremio comparison

Read 8 Dremio reviews

2,546 Views
1,908 Comparison Views

100% willing to recommend

Altair RapidMiner

Comparison Buyer's Guide

Download the report

Executive SummaryUpdated on Mar 4, 2025

Altair RapidMiner and Dremio are competitive products in the data analytics space. Altair RapidMiner's pricing and support seem more favorable, whereas Dremio's superior features justify its higher cost.

Features: Altair RapidMiner is known for its intuitive visual workflow design, extensive machine learning capabilities, and efficient data processing. Dremio is recognized for its data-as-a-service functionality, accelerated query performance, and ability to easily integrate with various data sources.

Room for Improvement: Altair RapidMiner could expand its cloud capabilities, enhance scalability for large data sets, and improve integration with external APIs. Dremio might benefit from simplifying its learning curve, offering more user-friendly documentation, and refining its customer onboarding process.

Ease of Deployment and Customer Service: Altair RapidMiner features an easy installation process and responsive support services. Dremio focuses on cloud-centric deployment, offering scalability with some complexity, and provides robust support for users experienced in handling large data environments.

Pricing and ROI: Altair RapidMiner presents a competitive setup cost with reasonable ROI, making it accessible for various businesses. Dremio's setup costs are higher, but its advanced features potentially offer significant ROI for data-intensive operations, providing long-term benefits.

To learn more, read our detailed Altair RapidMiner vs. Dremio Report (Updated: March 2025).

Altair RapidMiner vs. Dremio

March 2025

Download the complete report

Helped 846,617 peers since 2012

Review summaries and opinions

We asked business professionals to review the solutions they use. Here are some excerpts of what they said:

Categories and Ranking

Altair RapidMiner

Ranking in Data Science Platforms

7th

Average Rating

8.6

Reviews Sentiment

7.0

Number of Reviews

Ranking in other categories

Predictive Analytics (3rd)

Dremio

Ranking in Data Science Platforms

9th

Average Rating

8.6

Reviews Sentiment

7.1

Number of Reviews

Ranking in other categories

Cloud Data Warehouse (9th)

Mindshare comparison

As of April 2025, in the Data Science Platforms category, the mindshare of Altair RapidMiner is 7.7%, up from 6.5% compared to the previous year. The mindshare of Dremio is 4.3%, up from 2.8% compared to the previous year. It is calculated based on PeerSpot user engagement data.

Data Science Platforms

Featured Reviews

Laurence Moseley

Emeritus Professor of Health Services Research at University of South Wales

Offers good tutorials that make it easy to learn and use, with a powerful feature to compare machine learning algorithms

When I started using RapidMiner, I found it difficult to get it to read the metadata. I wanted to use, for example, a pivot table, and it did not have the variable or the attribute names in it. There were no values. It took a long while to figure out how to do that, although it tends to do it automatically nowadays. RapidMiner is not utterly intuitive for beginners. Sometimes people have trouble distinguishing between a file in their own file system and a repository entry, and they cannot find their data. This is an area where this solution could be improved. It would be helpful to have some tutorials on communicating with Python. I found it a bit difficult at times to figure out which particular variable, or attribute, is going where in Python. It is probably a simple thing to do but I haven't mastered it yet. I'd like them to do a video on that. There are a large number of videos that are usually well-produced, but I don't think that they have one on that. Essentially, I would like to see how to communicate from RapidMiner to Python and from Python to RapidMiner. One of the things I do a lot of is looking at questionnaires where people have used Likert-type scales. I don't recommend Likert-type scales, but if they're properly produced, which is a lot of hard work and it's not usually done, they're really powerful and you can do things like normalizing holes on the Likert scale. That's not the same as normalizing your data in RapidMiner. So, I would want to get results with these Likert scales, pass it through RapidMiner, do a normalization and pass back both the raw scores and the normalized scores and put in some rules, which will say if it's high on the raw score and on the normalized score and low on the standard deviation, then you can trust it.

Read full review

KamleshPant

Senior Software Architect at USEReady

Solution offers quick data connection with an edge in computation

It's almost similar, yet it's better than Starburst in spinning up or connecting to the new source since it's on SaaS. It is a similar experience between the based application and cloud-based application. You just get the source, connect the data, get visualization, get connected, and do whatever you want. They say data reflection is one way where they do the caching and all that. Starburst also does the caching. In Starburst, you have a data product. Here, the data product comes from a reflection perspective. The y are working on a columnar memory map, columnar computation. That will have some edge in computation.

Read full review

Quotes from Members

We asked business professionals to review the solutions they use. Here are some excerpts of what they said:

Pros

"The most valuable feature is what the product sets out to do, which is extracting information and data."

"I've been using a lot of components from the Strategic Extension and Python Extension."

"The most valuable feature of RapidMiner is that it is code free. It is similar to playing with Lego pieces and executing after you are finished to see the results. Additionally, it is easy to use and has interesting utilities when preparing the data. It has a utility to automatically launch a series of models and show the comparisons. When finished with the comparisons you can select the best one, and deploy it automatically."

"The documentation for this solution is very good, where each operator is explained with how to use it."

"RapidMiner is a no-code machine learning tool. I can install it on my local machine and work with smaller datasets. It can also connect to databases, allowing me to build models directly on the data stored there. RapidMiner offers a wider range of operators than other tools like Dataiku, making it a better option for my needs."

"Scalability is not really a concern with RapidMiner. It scales very well and can be used in global implementations."

"I like not having to write all solutions from code. Being able to drag and drop controls, enables me to focus on building the best model, without needing to search for syntax errors or extra libraries."

"The data science, collaboration, and IDN are very, very strong."

More Altair RapidMiner pros

"Dremio gives you the ability to create services which do not require additional resources and sterilization."

"Dremio is very easy to use for building queries."

"Everyone uses Dremio in my company; some use it only for the analytics function."

"Dremio enables you to manage changes more effectively than any other data warehouse platform. There are two things that come into play. One is data lineage. If you are looking at data in Dremio, you may want to know the source and what happened to it along the way or how it may have been transformed in the data pipeline to get to the point where you're consuming it."

"Dremio allows querying the files I have on my block storage or object storage."

"Overall, you can rate it as eight out of ten."

"The most valuable feature of Dremio is it can sit on top of any other data storage, such as Amazon S3, Azure Data Factory, SGFS, or Hive. The memory competition is good. If you are running any kind of materialized view, you'd be running in memory."

"We primarily use Dremio to create a data framework and a data queue."

More Dremio pros

Cons

"In terms of the UI and SaaS, the user interface with KNIME is more appealing than RapidMiner."

"I would like to see all users have access to all of the deep learning models, and that they can be used easily."

"I would appreciate improvements in automation and customization options to further streamline processes."

"The biggest problem, not from a platform process, but from an avoidance process, is when you work in a heavily regulated environment, like banking and finance. Whenever you make a decision or there is an output, you need to bill it as an avoidance to the investigator or to the bank audit team. If you made decisions within this machine learning model, you need to explain why you did so. It would better if you could explain your decision in terms of delivery. However, this is an issue with all ML platforms. Many companies are working heavily in this area to help figure out how to make it more explainable to the business team or the regulator."

"The server product has been getting updated and continues to be better each release. When I started using RapidMiner, it was solid but not easy to set up and upgrade."

"The price of this solution should be improved."

"One challenge I encountered while implementing RapidMiner was the lack of documentation. Since there aren't as many users, finding resources to learn the tool was initially difficult. To overcome this hurdle, I believe RapidMiner could improve by providing more tutorials tailored for new users."

"In the Mexican or Latin American market, it's kind of pricey."

More Altair RapidMiner cons

"We've faced a challenge with integrating Dremio and Databricks, specifically regarding authentication. It is not shaking hands very easily."

"They have an automated tool for building SQL queries, so you don't need to know SQL. That interface works, but it could be more efficient in terms of the SQL generated from those things. It's going through some growing pains. There is so much value in tools like these for people with no SQL experience. Over time, Dermio will make these capabilities more accessible to users who aren't database people."

"Dremio takes a long time to execute large queries or the executing of correlated queries or nested queries. Additionally, the solution could improve if we could read data from the streaming pipelines or if it allowed us to create the ETL pipeline directly on top of it, similar to Snowflake."

"I cannot use the recursive common table expression (CTE) in Dremio because the support page says it's currently unsupported."

"They need to have multiple connectors. Starburst is rich in connectors, however, they are lacking Salesforce connectivity as of today."

"They need to have multiple connectors."

"There are performance issues at times due to our limited experience with Dremio, and the fact that we are running it on single nodes using a community version."

"Dremio doesn't support the Delta connector. Dremio writes the IT support for Delta, but the support isn't great. There is definitely room for improvement."

More Dremio cons

Pricing and Cost Advice

"I used an educational license for this solution, which is available free of charge."

"I'm not fully aware of RapidMiner's price because we had licenses provided, but from my analysis, it's moderately priced, not too high or too low. It's worth the investment."

"The client only has to pay the licensing costs. There are not any maintenance or hidden costs in addition to the license."

"For the university, the cost of the solution is free for the students and teachers."

"Although we don't pay licensing fees because it is being used within the university, my understanding is that the cost is between $5,000 and $10,000 USD per year."

"Right now the cluster costs approximately $200,000 per month and is based on the volume of data we have."

"Dremio is less costly competitively to Snowflake or any other tool."

See which vendors are best for you

Use our free recommendation engine to learn which Data Science Platforms solutions are best for your needs.

See recommendations

846,617 professionals have used our research since 2012.

Top Industries

By visitors reading reviews

University

11%

Computer Software Company

11%

Educational Organization

10%

Financial Services Firm

32%

Computer Software Company

10%

Manufacturing Company

Healthcare Company

Company Size

By reviewers

Large Enterprise

Midsize Enterprise

Small Business

Questions from the Community

What do you like most about RapidMiner?

RapidMiner is a no-code machine learning tool. I can install it on my local machine and work with smaller datasets. It can also connect to databases, allowing me to build models directly on the dat...

What is your experience regarding pricing and costs for RapidMiner?

I'm not fully aware of RapidMiner's price because we had licenses provided, but from my analysis, it's moderately priced, not too high or too low. It's worth the investment.

What needs improvement with RapidMiner?

Altair RapidMiner needs updates to its examples, particularly in business and marketing areas, and to the tool itself. The user interface should be improved. Incorporating generative AI as an AI as...

What do you like most about Dremio?

Dremio allows querying the files I have on my block storage or object storage.

What is your experience regarding pricing and costs for Dremio?

The licensing is very expensive. We need a license to scale as we are currently using the community version.

What needs improvement with Dremio?

They need to have multiple connectors. Starburst is rich in connectors, however, they are lacking Salesforce connectivity as of today. They don't have Salesforce connectivity. However, Starburst do...

KNIME vs Altair RapidMiner

Comparisons

Compared 52% of the time

Dataiku vs Altair RapidMiner

Compared 16% of the time

Alteryx vs Altair RapidMiner

Compared 8% of the time

Tableau vs Altair RapidMiner

Compared 7% of the time

Microsoft Azure Machine Learning Studio vs Altair RapidMiner

Compared 3% of the time

More Altair RapidMiner Competitors

Databricks vs Dremio

Compared 51% of the time

Snowflake vs Dremio

Compared 15% of the time

Starburst Enterprise vs Dremio

Compared 6% of the time

Microsoft Power BI vs Dremio

Compared 4% of the time

Dataiku vs Dremio

Compared 4% of the time

More Dremio Competitors

Product Reports

Download Altair RapidMiner product report

Altair RapidMiner

April 2025

Download Dremio product report

April 2025

Overview

Altair RapidMiner is a leading platform for data science and machine learning, offering a user-friendly interface with powerful tools for predictive analytics. It supports integration with APIs, Python, and cloud services for streamlined workflow creation.

RapidMiner provides an efficient data science environment featuring drag-and-drop functionality, automation tools, and a wide array of algorithms, making it adaptable for novices and experts alike. Users benefit from easy data preparation and analysis alongside robust support from a vibrant community. Challenges include better onboarding and deep learning model accessibility, alongside calls for enhanced image processing and large language model integration.

What features make Altair RapidMiner stand out?

Data science tools: Facilitates machine learning and predictive analytics.
Code-optional GUI: Enables non-programmers to create workflows seamlessly.
Integration: Supports APIs, Python, and cloud services.
Drag-and-drop functionality: Simplifies workflow creation for users.
Algorithm variety: Offers diverse methods for data analysis.

What benefits and ROI should users consider when evaluating?

Efficiency: Saves time in data preparation and workflow automation.
Accessibility: Intuitive design aids beginners and experts.
Community: Access to extensions and collaborative resources.
Documentation: Comprehensive tutorials enhance learning.

Altair RapidMiner is extensively used in business and academia, facilitating tasks like predictive analytics, segmentation, and deployment. In education, it supports data science teaching and research, while in industries such as telecom, banking, and healthcare, it's used for data mining, decision trees, and market analysis.

Altair

Dremio is a data analytics platform designed to simplify and expedite the data analysis process by enabling direct querying across multiple data sources without the need for data replication. This solution stands out due to its approach to data lake transformation, offering tools that allow users to access and query data stored in various formats and locations as if it were all in a single relational database.

At its core, Dremio facilitates a more streamlined data management experience. It integrates easily with existing data lakes, allowing organizations to continue using their storage of choice, such as AWS S3, Microsoft ADLS, or Hadoop, without data migration. Dremio supports SQL queries, which means it seamlessly integrates with familiar BI tools and data science frameworks, enhancing user accessibility and reducing the learning curve typically associated with adopting new data technologies.

What Are Dremio's Key Features?

Data Reflections: Reduces query times by creating optimized representations of source data, which can accelerate performance without the complexity of traditional data warehousing solutions.
Semantic Layer: Allows users to define business metrics and dimensions centrally, ensuring consistency and governance across all analytics tools.
Built-in Security Features: Provides robust security measures, including column- and row-level security, ensuring compliance with data governance and privacy standards.
Support for Multiple Data Formats and Sources: Enables querying directly against a variety of data formats (Parquet, JSON, etc.) and sources without the need for conversion or replication.

What Benefits Should Users Expect?

When evaluating Dremio, potential users should look for feedback on its query performance, especially in environments with large and complex data sets. Reviews might highlight the efficiency gains from using Dremio’s data reflections and its ability to integrate with existing BI tools without significant changes to underlying data structures. Also, check how other users evaluate its ease of deployment and scalability, particularly in hybrid and cloud environments.

How is Dremio Implemented Across Different Industries?

Dremio is widely applicable across various industries, including finance, healthcare, and retail, where organizations benefit from rapid, on-demand access to large volumes of data spread across disparate systems. For instance, in healthcare, Dremio can be used to analyze patient outcomes across different data repositories, improving treatment strategies and operational efficiencies.

What About Dremio’s Pricing, Licensing, and Support?

Dremio offers a flexible pricing model that caters to different sizes and types of businesses, including a free community version for smaller teams and proof-of-concept projects. Their enterprise version is subscription-based, with pricing varying based on the deployment scale and support needs. Customer support is comprehensive, featuring dedicated assistance, online resources, and community support.

Sample Customers

PayPal, Deloitte, eBay, Cisco, Miele, Volkswagen

UBS, TransUnion, Quantium, Daimler, OVH