Try our new research platform with insights from 80,000+ expert users

Dremio vs KNIME comparison

 

Comparison Buyer's Guide

Executive SummaryUpdated on Dec 5, 2024

Review summaries and opinions

We asked business professionals to review the solutions they use. Here are some excerpts of what they said:
 

Categories and Ranking

Dremio
Ranking in Data Science Platforms
8th
Average Rating
8.6
Reviews Sentiment
7.2
Number of Reviews
7
Ranking in other categories
Cloud Data Warehouse (10th)
KNIME
Ranking in Data Science Platforms
2nd
Average Rating
8.0
Reviews Sentiment
7.1
Number of Reviews
59
Ranking in other categories
Data Mining (1st)
 

Mindshare comparison

As of January 2025, in the Data Science Platforms category, the mindshare of Dremio is 4.3%, up from 2.4% compared to the previous year. The mindshare of KNIME is 11.3%, up from 9.4% compared to the previous year. It is calculated based on PeerSpot user engagement data.
Data Science Platforms
 

Featured Reviews

MikeWalker - PeerSpot reviewer
It enables you to manage changes more effectively than any other platform.
Dremio enables you to manage changes more effectively than any other data warehouse platform. There are two things that come into play. One is data lineage. If you are looking at data in Dremio, you may want to know the source and what happened to it along the way or how it may have been transformed in the data pipeline to get to the point where you're consuming it. There's another thing called data providence. They're tied together. Data providence allows you to go back and recreate the data at any particular point in time. It's extremely important for compliance and governance issues because data changes all time. How did it change? What was it three days or months ago? You may have made some decisions based on data that was three months old, so you might need to revisit those. It's essential for things like machine learning and deep learning, where you are generating AI models off data. When the model stops working or doesn't work as expected, you need to figure out why. You have to go back and adjust the datasets used to train the model. We do that through an open-source project called Nessie, which is their basis for providing data lineage and data province capabilities. It's super powerful. Arrow is another open-source project for storing data in memory and performing data query operations. Data sits on a disk in one format. If you want to do anything with data, you have to load it into your computer and put it into memory so you can work with it. Arrow provides a format in memory that enables the whole library to perform various operations on that data. Every vendor has its own way of representing data in memory. They've latched onto an industry standard and developed it so it's open. Now people can use the exact same format in memory to do operations and use the library set to perform functions on data. New developers can decide if they want to develop their own memory format or use one that's already there. Data transfer is a massive problem when you're working with large datasets, doing advanced analytics, and trying to train machine learning or deep learning models. What happens often is companies downsample their data sets to do training on models because transferring and managing data on a deep learning or machine learning platform is too much.
Shyam_Sridhar - PeerSpot reviewer
Good for data analysis to model prediction and application but data load limitations
KNIME is very easy to handle and use. Anyone can use it, and it's easy to learn. You don't need a specific class. They're very good at model prediction. It has got everything. From data analysis to model prediction and application, it's very good. I only use the free community edition, not the enterprise one. I feel KNIME is really good. I haven't tried any other tool or platform yet, but KNIME is pretty good. The workflow is great. You drag and drop, and then you have the data explorer and charts that give results. The execution is also good – it's easy to identify where your model has gone wrong. It shows you the exact point of error within the workflow, so you don't have to execute the entire workflow to find it. For example, if your workflow has ten steps and the error is in the sixth step, it will show you the error at that step. You don't have to worry about the first five steps. The Data Explorer is very good, and the charts are great too. The accuracy charts for different models, like decision tree, K3, Naive Bayes, are all very good. KNIME is great at reporting, whether it's structured or unstructured data. These are all very good features.

Quotes from Members

We asked business professionals to review the solutions they use. Here are some excerpts of what they said:
 

Pros

"Dremio enables you to manage changes more effectively than any other data warehouse platform. There are two things that come into play. One is data lineage. If you are looking at data in Dremio, you may want to know the source and what happened to it along the way or how it may have been transformed in the data pipeline to get to the point where you're consuming it."
"Dremio allows querying the files I have on my block storage or object storage."
"Everyone uses Dremio in my company; some use it only for the analytics function."
"We primarily use Dremio to create a data framework and a data queue."
"The most valuable feature of Dremio is it can sit on top of any other data storage, such as Amazon S3, Azure Data Factory, SGFS, or Hive. The memory competition is good. If you are running any kind of materialized view, you'd be running in memory."
"Dremio is very easy to use for building queries."
"Dremio gives you the ability to create services which do not require additional resources and sterilization."
"The tool's analytic capabilities are good."
"The product is very easy to understand even for non-analytical stakeholders. Sometimes we provide them with KNIME workflows and teach them how to run it on their own machine."
"Easy to use, stable, and powerful."
"The solution is very easy to use"
"We can deploy the solution in a cluster as well."
"There are a lot of connectors available in KNIME."
"It allows for a user-friendly approach where you can simply drag and drop elements to create your model, which is a convenient and effective idea."
"KNIME is easy to learn."
 

Cons

"There are performance issues at times due to our limited experience with Dremio, and the fact that we are running it on single nodes using a community version."
"Dremio takes a long time to execute large queries or the executing of correlated queries or nested queries. Additionally, the solution could improve if we could read data from the streaming pipelines or if it allowed us to create the ETL pipeline directly on top of it, similar to Snowflake."
"Dremio doesn't support the Delta connector. Dremio writes the IT support for Delta, but the support isn't great. There is definitely room for improvement."
"They have an automated tool for building SQL queries, so you don't need to know SQL. That interface works, but it could be more efficient in terms of the SQL generated from those things. It's going through some growing pains. There is so much value in tools like these for people with no SQL experience. Over time, Dermio will make these capabilities more accessible to users who aren't database people."
"We've faced a challenge with integrating Dremio and Databricks, specifically regarding authentication. It is not shaking hands very easily."
"I cannot use the recursive common table expression (CTE) in Dremio because the support page says it's currently unsupported."
"It shows errors sometimes."
"The pricing needs improvement."
"It needs more examples, use cases, and MOOC to learn, especially with respect to the algorithms and how to practically create a flow from end-to-end."
"I would prefer to have more connectivity."
"KNIME's documentation is not strong."
"When deploying models on a regular system, it works fine. However, when accuracy is a priority, hyperparameter tuning is necessary. Currently, KNIME doesn't have the best tools for this which they could improve in this area."
"​The data visualization part is the area most in need of improvement."
"KNIME can improve by adding more automation tools in the query, similar to UiPath or Blue Prism. It would make the data collection and cleanup duties more versatile."
"If they had a more structured training model it would be very helpful."
 

Pricing and Cost Advice

"Dremio is less costly competitively to Snowflake or any other tool."
"Right now the cluster costs approximately $200,000 per month and is based on the volume of data we have."
"KNIME assets are stand alone, as the solution is open source."
"The price for Knime is okay."
"KNIME is a cheap product. I currently use KNIME's open-source version."
"KNIME is free and open source."
"I use the open-source version."
"With KNIME, you can use the desktop version free of charge as much as you like. I've yet to hit its limits. If I did, I'd have to go to the server version, and for that you have to pay. Fortunately, I don't have to at the moment."
"While there are certain limitations in functionality, you can still utilize it efficiently free of charge."
"KNIME is an open-source tool, so it's free to use."
report
Use our free recommendation engine to learn which Data Science Platforms solutions are best for your needs.
831,369 professionals have used our research since 2012.
 

Top Industries

By visitors reading reviews
Financial Services Firm
32%
Computer Software Company
10%
Manufacturing Company
8%
Retailer
4%
Financial Services Firm
13%
Manufacturing Company
12%
Computer Software Company
9%
Educational Organization
8%
 

Company Size

By reviewers
Large Enterprise
Midsize Enterprise
Small Business
 

Questions from the Community

What do you like most about Dremio?
Dremio allows querying the files I have on my block storage or object storage.
What is your experience regarding pricing and costs for Dremio?
The licensing is very expensive. We need a license to scale as we are currently using the community version.
What needs improvement with Dremio?
There are performance issues at times due to our limited experience with Dremio, and the fact that we are running it on single nodes using a community version. We face certain issues when connectin...
What do you like most about KNIME?
Since KNIME is a no-code platform, it is easy to work with.
What is your experience regarding pricing and costs for KNIME?
I rate the product’s pricing a seven out of ten, where one is cheap and ten is expensive.
What needs improvement with KNIME?
For graphics, the interface is a little confusing. So, this is a point that could be improved.
 

Comparisons

 

Also Known As

No data available
KNIME Analytics Platform
 

Overview

 

Sample Customers

UBS, TransUnion, Quantium, Daimler, OVH
Infocom Corporation, Dymatrix Consulting Group, Soluzione Informatiche, MMI Agency, Estanislao Training and Solutions, Vialis AG
Find out what your peers are saying about Dremio vs. KNIME and other solutions. Updated: January 2025.
831,369 professionals have used our research since 2012.