Try our new research platform with insights from 80,000+ expert users

Databricks vs Dataiku comparison

 

Comparison Buyer's Guide

Executive SummaryUpdated on Jan 12, 2025

Review summaries and opinions

We asked business professionals to review the solutions they use. Here are some excerpts of what they said:
 

Categories and Ranking

Databricks
Ranking in Data Science Platforms
1st
Average Rating
8.2
Reviews Sentiment
7.0
Number of Reviews
88
Ranking in other categories
Cloud Data Warehouse (7th), Streaming Analytics (1st)
Dataiku
Ranking in Data Science Platforms
7th
Average Rating
8.2
Reviews Sentiment
7.1
Number of Reviews
11
Ranking in other categories
No ranking in other categories
 

Mindshare comparison

As of March 2025, in the Data Science Platforms category, the mindshare of Databricks is 18.5%, up from 18.7% compared to the previous year. The mindshare of Dataiku is 12.5%, up from 8.0% compared to the previous year. It is calculated based on PeerSpot user engagement data.
Data Science Platforms
 

Featured Reviews

ShubhamSharma7 - PeerSpot reviewer
Capability to integrate diverse coding languages in a single notebook greatly enhances workflow
Databricks offers various courses that I can use, whether it's PySpark, Scala, or R. I can leverage all these courses in a single notebook, which is beneficial for clients as they can access various tools in one place whenever needed. This is quite significant. I usually work with PySpark based on client requirements. After coding, I feed the Databricks notebooks into the ADF pipeline for updates. Databricks' capability to process data in parallel enhances data processing speed. Furthermore, I can connect our Databricks notebook directly with Power BI and other visualization tools like Qlik. Once we develop code, it allows us to transform raw data into visualizations for clients using analysis diagrams, which is very helpful.
RichardXu - PeerSpot reviewer
The platform organizes workflows visually and efficiently
One of the valuable features of Dataiku is the workflow capability. It allows us to organize a workflow efficiently. The platform has a visual interface, making it much easier for educated professionals to organize their work. This feature is useful because it simplifies tasks and eliminates the need for a data scientist. If you are knowledgeable about AI, you can directly write using primitive tools like Pantera flow, PyTorch, and Scikit-learn. However, Dataiku makes this process much easier.

Quotes from Members

We asked business professionals to review the solutions they use. Here are some excerpts of what they said:
 

Pros

"Imageflow is a visual tool that helps make it easier for business people to understand complex workflows."
"Databricks makes it really easy to use a number of technologies to do data analysis. In terms of languages, we can use Scala, Python, and SQL. Databricks enables you to run very large queries, at a massive scale, within really good timeframes."
"The setup was straightforward."
"Databricks is a robust solution for big data processing, offering flexibility and powerful features."
"Databricks has a scalable Spark cluster creation process. The creators of Databricks are also the creators of Spark, and they are the industry leaders in terms of performance."
"The initial setup phase of Databricks was good."
"We have the ability to scale, collaborate and do machine learning."
"The fast data loading process and data storage capabilities are great."
"The most valuable feature of this solution is that it is one tool that can do everything, and you have the ability to very easily push your design to prediction."
"The solution is quite stable."
"One of the valuable features of Dataiku is the workflow capability."
"Dataiku is highly regarded as it is a leader in the Gartner ranking."
"Traceability is vital since I manage many cohorts, and collaboration is key as I have multiple engineers substituting for one another."
"Cloud-based process run helps in not keeping the systems on while processes are running."
"Extremely easy to use with its GUI-based functionality and large compatibility with various data sources. Also, maintenance processes are much more automated than ever, with fewer errors."
"If many teams are collaborating and sharing Jupyter notebooks, it's very useful."
 

Cons

"Databricks may not be as easy to use as other tools, but if you simplify a tool too much, it won't have the flexibility to go in-depth. Databricks is completely in the programmer's hands. I prefer flexibility rather than simplicity."
"There has been a significant evolution in databases. One area of improvement is the Databricks File System (DBFS), where command-line challenges arise when accessing files."
"Anyone who doesn't know SQL may find the product difficult to work with."
"If I want to create a Databricks account, I need to have a prior cloud account such as an AWS account or an Azure account. Only then can I create a Databricks account on the cloud. However, if they can make it so that I can still try Databricks even if I don't have a cloud account on AWS and Azure, it would be great. That is, it would be nice if it were possible to create a pseudo account and be provided with a free trial. It is very essential to creating a workforce on Databricks. For example, students or corporate staff can then explore and learn Databricks."
"The ability to customize our own pipelines would enhance the product, similar to what's possible using ML files in Microsoft Azure DevOps."
"Databricks is not geared towards the end-user, but rather it is for data engineers or data scientists."
"The solution has some scalability and integration limitations when consolidating legacy systems."
"I'm not the guy that I'm working with Databricks on a daily basis. I'm on the management team. However, my team tells me there are limitations with streaming events. The connectors work with a small set of platforms. For example, we can work with Kafka, but if we want to move to an event-driven solution from AWS, we cannot do it. We cannot connect to all the streaming analytics platforms, so we are limited in choosing the best one."
"Server up-time needs to be improved. Also, query engines like Spark and Hive need to be more stable."
"Although known for Big Data, the processing time to process 1.8 billion records was terribly slow (five days)."
"The license is very expensive. It would be great to have an intermediate license for basic treatments that do not require extensive experience."
"The technical support from Dataiku is not good. The support team does not provide adequate assistance, and there are concerns about billing requests."
"I find that it is a little slow during use. It takes more time than I would expect for operations to complete."
"The license is very expensive."
"The interface for the web app can be a bit difficult. It needs to have better capabilities, at least for developers who like to code. This is due to the fact that everything is enabled in a single window with different tabs. For them to actually develop and do the concurrent testing that needs to be done, it takes a bit of time. That is one improvement that I would like to see - from a web app developer perspective."
"I think it would help if Data Science Studio added some more features and improved the data model."
 

Pricing and Cost Advice

"Licensing on site I would counsel against, as on-site hardware issues tend to really delay and slow down delivery."
"The price of Databricks is reasonable compared to other solutions."
"The solution requires a subscription."
"The solution is a good value for batch processing and huge workloads."
"I am based in South Africa, where it is expensive adapting to the cloud, and then there is the price for the tool itself."
"Whenever we want to find the actual costing, we have to send an email to Databricks, so having the information available on the internet would be helpful."
"The cost for Databricks depends on the use case. I work on it as a consultant, so I'm using the client's Databricks, so it depends on how big the client is."
"My smallest project is around a hundred euros, and my most expensive is just under a thousand euros a week. That is based on terabytes of data processed each month."
"Pricing is pretty steep. Dataiku is also not that cheap."
"The annual licensing fees are approximately €20 ($22 USD) per key for the basic version and €40 ($44 USD) per key for the version with everything."
report
Use our free recommendation engine to learn which Data Science Platforms solutions are best for your needs.
842,161 professionals have used our research since 2012.
 

Top Industries

By visitors reading reviews
Financial Services Firm
17%
Computer Software Company
11%
Manufacturing Company
9%
Healthcare Company
6%
Financial Services Firm
18%
Educational Organization
14%
Manufacturing Company
9%
Computer Software Company
8%
 

Company Size

By reviewers
Large Enterprise
Midsize Enterprise
Small Business
 

Questions from the Community

Which do you prefer - Databricks or Azure Machine Learning Studio?
Databricks gives you the option of working with several different languages, such as SQL, R, Scala, Apache Spark, or Python. It offers many different cluster choices and excellent integration with ...
How would you compare Databricks vs Amazon SageMaker?
We researched AWS SageMaker, but in the end, we chose Databricks. Databricks is a Unified Analytics Platform designed to accelerate innovation projects. It is based on Spark so it is very fast. It...
Which would you choose - Databricks or Azure Stream Analytics?
Databricks is an easy-to-set-up and versatile tool for data management, analysis, and business analytics. For analytics teams that have to interpret data to further the business goals of their orga...
What needs improvement with Dataiku Data Science Studio?
I need more experience in the sector, which is health. The license is very expensive. It would be great to have an intermediate license for basic treatments that do not require extensive experience.
What is your primary use case for Dataiku Data Science Studio?
I use that IQ since I am preparing cohorts for health investment research.
 

Comparisons

 

Also Known As

Databricks Unified Analytics, Databricks Unified Analytics Platform, Redash
Dataiku DSS
 

Overview

 

Sample Customers

Elsevier, MyFitnessPal, Sharethrough, Automatic Labs, Celtra, Radius Intelligence, Yesware
BGL BNP Paribas, Dentsu Aegis, Link Mobility Group, AramisAuto
Find out what your peers are saying about Databricks vs. Dataiku and other solutions. Updated: March 2025.
842,161 professionals have used our research since 2012.