

Dataiku and Dremio compete in the advanced data platforms category, offering users a variety of options for data management. While each has its strengths, Dremio's comprehensive features give it a notable edge.
Features: Dataiku shines with a strong process scheduler, easy cloud process runs, and versatile visual data preparation tools. Its support for languages like Python and R enhances data integration with platforms like BigQuery. Features for automation, team collaboration, and a user-friendly drag-and-drop interface are also praised. Dremio stands out with powerful visualization capabilities, data lineage and providence, and seamless integration with cloud storage. Its dynamic querying and virtualization are top features.
Room for Improvement: Dataiku could improve server uptime, stability in engines like Spark and Hive, and the interface's complexity for non-IT users. Enhancements in collaboration and affordability are also needed. Dremio may benefit from performance stability improvements, expanded integration capabilities, and better data connector support. Users suggest enhancing query abilities, streamlining memory management, addressing persistent bugs, and improving documentation.
Ease of Deployment and Customer Service: Dataiku provides flexible on-premises, private cloud, and hybrid deployment options. Customer service is generally responsive, though technical and billing support varies. Dremio supports diverse cloud models with swift technical support and a strong community, despite noted deployment complexities for community versions.
Pricing and ROI: Dataiku's high pricing suits larger enterprises, competing with solutions like Alteryx. The lack of a consumption-based model affects ROI perceptions. Dremio, also priced at a premium, remains competitive compared to rivals like Snowflake, with positive ROI perceptions and strong support for regulatory compliance.
The market is competitive, and Dataiku must adopt a consumption-based model instead of the current monthly model.
In terms of ROI, the use of Dataiku simplifies the architecture of customers, which helps them to decommission some of their existing tools;
Dremio surely saves time, reduces costs, and all those things because we don't have to worry so much about the infrastructure to make the different tools communicate.
Dataiku partners with local industry experts who understand the business better and provide support.
The support team does not provide adequate assistance.
As a partner with Dataiku, my experience with them is good; they are supportive, and when we contact them, we receive a quick response.
We have had to reach out for customer support many times, and they respond, so they are pretty supportive about some long-term issues.
Dremio's scalability can handle growing data and user demands easily.
Internally, if it's on Docker or Kubernetes, scalability will be built into the system.
In terms of stabilization, if my data has no outlier creation in the raw data, then it is quite stable.
I rate Dremio a nine in terms of stability.
Someone who needs to do coding can do it, and someone who does not know coding can also build solutions.
The license is very expensive.
I would love for Dataiku to allow more flexibility with code-based components and provide the possibility to extend it by developing and integrating custom components easily with existing ones.
Starburst comes with around 50 connectors now.
It should be easier to get Arctic or an open-source version of Arctic onto the software version so that development teams can experiment with it.
I see that many times the new versions of Dremio have not fixed old bugs, and in some new versions, old problems that were previously fixed come back again, so I think the upgrade part could use improvement.
There are no extra expenses beyond the existing licensing cost.
I find the pricing of Dataiku quite affordable for our customers, as they are usually large companies.
The pricing for Dataiku is very high, which is its biggest downside.
This feature is useful because it simplifies tasks and eliminates the need for a data scientist.
Dataiku primarily enhances the speed at which our customers can develop or train their machine learning models because it is a drag-and-drop platform.
It offers most of the capabilities required for data science, MLOps, and LLMOps.
Having everything under one system and an easier-to-work-with interface, along with having API integrations, adds significant value to working with Dremio.
Dremio has positively impacted my organization as nowadays we are connected to multiple databases from multiple environments, multiple APIs, and applications, and Dremio organizes everything in an amazing way for me.
You just get the source, connect the data, get visualization, get connected, and do whatever you want.
| Product | Market Share (%) |
|---|---|
| Dataiku | 9.3% |
| Dremio | 2.6% |
| Other | 88.1% |


| Company Size | Count |
|---|---|
| Small Business | 4 |
| Midsize Enterprise | 2 |
| Large Enterprise | 9 |
| Company Size | Count |
|---|---|
| Small Business | 1 |
| Midsize Enterprise | 5 |
| Large Enterprise | 5 |
Dataiku Data Science Studio is acclaimed for its versatile capabilities in advanced analytics, data preparation, machine learning, and visualization. It streamlines complex data tasks with an intuitive visual interface, supports multiple languages like Python, R, SQL, and scales efficiently for large dataset handling, boosting organizational efficiency and collaboration.
Dremio offers a comprehensive platform for data warehousing and data engineering, integrating seamlessly with data storage systems like Amazon S3 and Azure. Its main features include scalability, query federation, and data reflection.
Dremio's core strength lies in its ability to function as a robust data lake query engine and data warehousing solution. It facilitates the creation of complex queries with ease, thanks to its support for Apache Airflow and query federation across endpoints. Despite challenges with Delta connector support, complex query execution, and expensive licensing, users find it valuable for managing ad-hoc queries and financial data analytics. The platform aids in SQL table management and BI traffic visualization while reducing storage costs and resolving storage conflicts typical in traditional data warehouses.
What are Dremio's most valuable features?Dremio is primarily implemented in industries requiring extensive data engineering and analytics, including finance and technology. Companies use it for constructing data frameworks, efficiently processing financial analytics, and visualizing BI traffic. It acts as a viable alternative to AWS Glue and Apache Hive, integrating seamlessly with multiple databases, including Oracle and MySQL, offering robust solutions for data-driven strategies. Despite some challenges, its ability to reduce data storage costs and manage complex queries makes it a favorable choice among enterprise users.
We monitor all Data Science Platforms reviews to prevent fraudulent reviews and keep review quality high. We do not post reviews by company employees or direct competitors. We validate each review for authenticity via cross-reference with LinkedIn, and personal follow-up with the reviewer when necessary.