Try our new research platform with insights from 80,000+ expert users

Confluent vs Databricks comparison

 

Comparison Buyer's Guide

Executive SummaryUpdated on Dec 17, 2024

Review summaries and opinions

We asked business professionals to review the solutions they use. Here are some excerpts of what they said:
 

Categories and Ranking

Confluent
Ranking in Streaming Analytics
3rd
Average Rating
8.2
Reviews Sentiment
6.8
Number of Reviews
22
Ranking in other categories
No ranking in other categories
Databricks
Ranking in Streaming Analytics
1st
Average Rating
8.2
Reviews Sentiment
7.0
Number of Reviews
85
Ranking in other categories
Data Science Platforms (1st)
 

Mindshare comparison

As of January 2025, in the Streaming Analytics category, the mindshare of Confluent is 8.8%, down from 11.9% compared to the previous year. The mindshare of Databricks is 14.6%, up from 10.1% compared to the previous year. It is calculated based on PeerSpot user engagement data.
Streaming Analytics
 

Featured Reviews

Yantao Zhao - PeerSpot reviewer
Great tool for sharing knowledge, internal communication and allows for real-time collaboration on pages
Confluence is easy to use and modify. However, sometimes there are too many pages. We have to reorganize the folder or parent account. Since everyone can create a page, the same knowledge might be created in multiple places by different people. This leads to redundancy and makes it difficult to find information. It's not centralized. So it could be more user-friendly and centralized. A way to reduce redundancy would be helpful. It's very easy to use, so everyone can create knowledge. But it would be good to synchronize and organize that information a bit better. Another improvement would be in Confluence search. You can search for keywords, but it's not like AI, not even ChatGPT or OpenAI. It would be nice to get more relevant or organized answers. If you're outside the company, you just get some titles containing the keyword you input. But if Confluence were like a database, you could input something and get a well-organized search offering from multiple pages.
Parag Bhosale - PeerSpot reviewer
Integrating engineering and learning, but cost challenges arise with cluster management
We often use a single cluster to ingest Databricks, which Databricks doesn't recommend. They suggest using a no-cluster solution like job clusters. This can be overwhelming for us because we started smaller. We prefer using a small to mid-sized cluster for many jobs to keep costs low, but this sometimes doesn't support our operations properly. We need to stay in sync with the DVR versions, and migrations can pose challenges. For example, issues arose when we moved a cluster from a previous version to the latest one. We could use their job clusters, however, that increases costs, which is challenging for us as a startup. Maintaining this infrastructure can be a headache.

Quotes from Members

We asked business professionals to review the solutions they use. Here are some excerpts of what they said:
 

Pros

"A person with a good IT background and HTML will not have any trouble with Confluent."
"One of the best features of Confluent is that it's very easy to search and have a live status with Jira."
"The client APIs are the most valuable feature."
"With Confluent Cloud we no longer need to handle the infrastructure and the plumbing, which is a concern for Confluent. The other advantage is that all portfolios have access to the data that is being shared."
"I find Confluent's Kafka Connectors and Kafka Streams invaluable for my use cases because they simplify real-time data processing and ETL tasks by providing reliable, pre-packaged connectors and tools."
"Their tech support is amazing; they are very good, both on and off-site."
"The solution can handle a high volume of data because it works and scales well."
"The most valuable is its capability to enhance the documentation process, particularly when creating software documentation."
"We like that this solution can handle a wide variety and velocity of data engineering, either in batch mode or real-time."
"The capacity of use of the different types of coding is valuable. Databricks also has good performance because it is running in spark extra storage, meaning the performance and the capacity use different kinds of codes."
"I like cloud scalability and data access for any type of user."
"We have the ability to scale, collaborate and do machine learning."
"Databricks provides a consistent interface for data engineers to work with data in a consistent language on a single integrated platform for ingesting, processing, and serving data to the end user."
"The built-in optimization recommendations halved the speed of queries and allowed us to reach decision points and deliver insights very quickly."
"A very valuable feature is the data processing, and the solution is specifically good at using the Spark ecosystem."
"What I like about Databricks is that it's one of the most popular platforms that give access to folks who are trying not just to do exploratory work on the data but also go ahead and build advanced modeling and machine learning on top of that."
 

Cons

"Areas for improvement include implementing multi-storage support to differentiate between database stores based on data age and optimizing storage costs."
"It could have more integration with different platforms."
"There is a limitation when it comes to seamlessly importing Microsoft documents into Confluent pages, which can be inconvenient for users who frequently work with Microsoft Office tools and need to transition their content to Confluent."
"It could have more themes. They should also have more reporting-oriented plugins as well. It would be great to have free custom reports that can be dispatched directly from Jira."
"The formatting aspect within the page can be improved and more powerful."
"Confluent's price needs improvement."
"The Schema Registry service could be improved. I would like a bigger knowledge base of other use cases and more technical forums. It would be good to have more flexible monitoring features added to the next release as well."
"It could be improved by including a feature that automatically creates a new topic and puts failed messages."
"I would love an integration in my desktop IDE. For now, I have to code on their webpage."
"There is room for improvement in the documentation of processes and how it works."
"The data visualization for this solution could be improved. They have started to roll out a data visualization tool inside Databricks but it is in the early stages. It's not comparable to a solution like Power BI, Luca, or Tableau."
"This solution only supports queries in SQL and Python, which is a bit limiting."
"Instead of relying on a massive instance, the solution should offer micro partition levels. They're working on it, however, they need to implement it to help the solution run more effectively."
"It would be nice to have more guidance on integrations with ETLs and other data quality tools."
"I would like to see the integration between Databricks and MLflow improved. It is quite hard to train multiple models in parallel in the distributed fashions. You hit rate limits on the clients very fast."
"Databricks is not geared towards the end-user, but rather it is for data engineers or data scientists."
 

Pricing and Cost Advice

"The solution is cheaper than other products."
"On a scale from one to ten, where one is low pricing and ten is high pricing, I would rate Confluent's pricing at five. I have not encountered any additional costs."
"You have to pay additional for one or two features."
"Confluent has a yearly license, which is a bit high because it's on a per-user basis."
"Confluent is an expensive solution."
"Regarding pricing, I think Confluent is a premium product, but it's hard for me to say definitively if it's overly expensive. We're still trying to understand if the features and reduced maintenance complexity justify the cost, especially as we scale our platform use."
"Confluence's pricing is quite reasonable, with a cost of around $10 per user that decreases as the number of users increases. Additionally, it's worth noting that for teams of up to 10 users, the solution is completely free."
"Confluent is expensive, I would prefer, Apache Kafka over Confluent because of the high cost of maintenance."
"We only pay for the Azure compute behind the solution."
"The solution uses a pay-per-use model with an annual subscription fee or package. Typically this solution is used on a cloud platform, such as Azure or AWS, but more people are choosing Azure because the price is more reasonable."
"The solution is affordable."
"My smallest project is around a hundred euros, and my most expensive is just under a thousand euros a week. That is based on terabytes of data processed each month."
"The billing of Databricks can be difficult and should improve."
"The cost is around $600,000 for 50 users."
"The product pricing is moderate."
"The solution is based on a licensing model."
report
Use our free recommendation engine to learn which Streaming Analytics solutions are best for your needs.
831,158 professionals have used our research since 2012.
 

Top Industries

By visitors reading reviews
Financial Services Firm
19%
Computer Software Company
17%
Manufacturing Company
8%
Insurance Company
5%
Financial Services Firm
17%
Computer Software Company
11%
Manufacturing Company
9%
Healthcare Company
6%
 

Company Size

By reviewers
Large Enterprise
Midsize Enterprise
Small Business
 

Questions from the Community

What do you like most about Confluent?
I find Confluent's Kafka Connectors and Kafka Streams invaluable for my use cases because they simplify real-time data processing and ETL tasks by providing reliable, pre-packaged connectors and to...
What is your experience regarding pricing and costs for Confluent?
Regarding pricing, I think Confluent is a premium product, but it's hard for me to say definitively if it's overly expensive. We're still trying to understand if the features and reduced maintenanc...
What needs improvement with Confluent?
One area we've identified that could be improved is the governance and access control to the Kafka topics. We've found some limitations, like a threshold of 10,000 rules per cluster, that make it c...
Which do you prefer - Databricks or Azure Machine Learning Studio?
Databricks gives you the option of working with several different languages, such as SQL, R, Scala, Apache Spark, or Python. It offers many different cluster choices and excellent integration with ...
How would you compare Databricks vs Amazon SageMaker?
We researched AWS SageMaker, but in the end, we chose Databricks. Databricks is a Unified Analytics Platform designed to accelerate innovation projects. It is based on Spark so it is very fast. It...
Which would you choose - Databricks or Azure Stream Analytics?
Databricks is an easy-to-set-up and versatile tool for data management, analysis, and business analytics. For analytics teams that have to interpret data to further the business goals of their orga...
 

Comparisons

 

Also Known As

No data available
Databricks Unified Analytics, Databricks Unified Analytics Platform, Redash
 

Overview

 

Sample Customers

ING, Priceline.com, Nordea, Target, RBC, Tivo, Capital One, Chartboost
Elsevier, MyFitnessPal, Sharethrough, Automatic Labs, Celtra, Radius Intelligence, Yesware
Find out what your peers are saying about Confluent vs. Databricks and other solutions. Updated: January 2025.
831,158 professionals have used our research since 2012.