Try our new research platform with insights from 80,000+ expert users

Amazon Kinesis vs Databricks comparison

 

Comparison Buyer's Guide

Executive SummaryUpdated on Dec 17, 2024

Review summaries and opinions

We asked business professionals to review the solutions they use. Here are some excerpts of what they said:
 

Categories and Ranking

Amazon Kinesis
Ranking in Streaming Analytics
2nd
Average Rating
8.0
Reviews Sentiment
7.1
Number of Reviews
27
Ranking in other categories
No ranking in other categories
Databricks
Ranking in Streaming Analytics
1st
Average Rating
8.2
Reviews Sentiment
7.0
Number of Reviews
88
Ranking in other categories
Cloud Data Warehouse (7th), Data Science Platforms (1st)
 

Mindshare comparison

As of April 2025, in the Streaming Analytics category, the mindshare of Amazon Kinesis is 8.7%, down from 13.9% compared to the previous year. The mindshare of Databricks is 14.6%, up from 10.1% compared to the previous year. It is calculated based on PeerSpot user engagement data.
Streaming Analytics
 

Featured Reviews

Rajni Kumar Jha - PeerSpot reviewer
Used for media streaming and live-streaming data
It is not compulsory to use Amazon Kinesis. If you don't want to use the data streaming, you can use just the Kinesis data firehose. Using the Kinesis data firehose is compulsory because we can't store all chats and recordings in Amazon S3 without it. When a call comes in the Amazon Kinesis instance, it will go to Data Streams if we use it. Otherwise, it will go to the Kinesis data firehose, where we need to define the S3 bucket path, and it will go to Amazon S3. So, without the Kinesis data firehose, we can't store all the chats and recordings in Amazon S3. Using Amazon Kinesis totally depends upon the user's requirements. If you want to use live streaming for the data lake or data analyst team, you need to use Amazon Kinesis. If you don't want to use it, you can directly use the Kinesis data firehose, which will be stored in Amazon S3. Overall, I rate the solution an eight out of ten.
ShubhamSharma7 - PeerSpot reviewer
Capability to integrate diverse coding languages in a single notebook greatly enhances workflow
Databricks offers various courses that I can use, whether it's PySpark, Scala, or R. I can leverage all these courses in a single notebook, which is beneficial for clients as they can access various tools in one place whenever needed. This is quite significant. I usually work with PySpark based on client requirements. After coding, I feed the Databricks notebooks into the ADF pipeline for updates. Databricks' capability to process data in parallel enhances data processing speed. Furthermore, I can connect our Databricks notebook directly with Power BI and other visualization tools like Qlik. Once we develop code, it allows us to transform raw data into visualizations for clients using analysis diagrams, which is very helpful.

Quotes from Members

We asked business professionals to review the solutions they use. Here are some excerpts of what they said:
 

Pros

"The solution's technical support is flawless."
"Amazon Kinesis has improved our ROI."
"The management and analytics are valuable features."
"Great auto-scaling, auto-sharing, and auto-correction features."
"The solution works well in rather sizable environments."
"One of the best features of Amazon Kinesis is the multi-partition."
"Everything is hosted and simple."
"I have worked in companies that build tools in-house. They face scaling challenges."
"Databricks is a robust solution for big data processing, offering flexibility and powerful features."
"The setup was straightforward."
"Databricks' most valuable features are the workspace and notebooks. Its integration, interface, and documentation are also good."
"Databricks is a unified solution that we can use for streaming. It is supporting open source languages, which are cloud-agnostic. When I do database coding if any other tool has a similar language pack to Excel or SQL, I can use the same knowledge, limiting the need to learn new things. It supports a lot of Python libraries where I can use some very easily."
"I haven't heard about any major stability issues. At this time I feel like it's stable."
"Databricks' capability to process data in parallel enhances data processing speed."
"The most valuable aspect of the solution is its notebook. It's quite convenient to use, both terms of the research and the development and also the final deployment, I can just declare the spark jobs by the load tables. It's quite convenient."
"The most valuable feature of Databricks is the integration of the data warehouse and data lake, and the development of the lake house. Additionally, it integrates well with Spark for processing data in production."
 

Cons

"If there were better documentation on optimal sharding strategies then it would be helpful."
"Kinesis Data Analytics needs to be improved somewhat. It's SQL based data but it is not as user friendly as MySQL or Athena tools."
"In general, the pain point for us was that once the data gets into Kinesis there is no way for us to understand what's happening because Kinesis divides everything into shards. So if we wanted to understand what's happening with a particular shard, whether it is published or not, we could not. Even with the logs, if we want to have some kind of logging it is in the shard."
"I think the default settings are far too low."
"Amazon Kinesis involved a more complex setup and configuration than Azure Event Hub."
"One thing that would be nice would be a policy for increasing the number of Kinesis streams because that's the one thing that's constant. You can change it in real time, but somebody has to change it, or you have to set some kind of meter. So, auto-scaling of adding and removing streams would be nice."
"One area for improvement in the solution is the file size limitation of 10 Mb. My company works with files with a larger file size. The batch size and throughput also need improvement in Amazon Kinesis."
"Lacks first in, first out queuing."
"I think setting up the whole account for one person and giving access are areas that can be difficult to manage and should be made a little easier."
"I would like to see the integration between Databricks and MLflow improved. It is quite hard to train multiple models in parallel in the distributed fashions. You hit rate limits on the clients very fast."
"The solution could improve by providing better automation capabilities. For example, working together with more of a DevOps approach, such as continuous integration."
"The query plan is not easy with Databrick's job level. If I want to tune any of the code, it is not easily available in the blogs as well."
"While Databricks is generally a robust solution, I have noticed a limitation with debugging in the Delta Live Table, which could be improved."
"Anyone who doesn't know SQL may find the product difficult to work with."
"I would love an integration in my desktop IDE. For now, I have to code on their webpage."
"It would be very helpful if Databricks could integrate with platforms in addition to Azure."
 

Pricing and Cost Advice

"The tool's pricing is cheap."
"In general, cloud services are very convenient to use, even if we have to pay a bit more, as we know what we are paying for and can focus on other tasks."
"The tool's entry price is cheap. However, pricing increases with data volume."
"The fee is based on the number of hours the service is running."
"It was actually a fairly high volume we were spending. We were spending about 150 a month."
"The solution's pricing is fair."
"The pricing depends on the use cases and the level of usage. If you wanted to use Kinesis for different use cases, there's definitely a cheaper base cost involved. However, it's not entirely cheap, as different use cases might require different levels of Kinesis usage."
"Amazon Kinesis pricing is sometimes reasonable and sometimes could be better, depending on the planning, so it's a five out of ten for me."
"We implement this solution on behalf of our customers who have their own Azure subscription and they pay for Databricks themselves. The pricing is more expensive if you have large volumes of data."
"We only pay for the Azure compute behind the solution."
"The product pricing is moderate."
"Databricks is a very expensive solution. Pricing is an area that could definitely be improved. They could provide a lower end compute and probably reduce the price."
"The price of Databricks is reasonable compared to other solutions."
"Whenever we want to find the actual costing, we have to send an email to Databricks, so having the information available on the internet would be helpful."
"We find Databricks to be very expensive, although this improved when we found out how to shut it down at night."
"My smallest project is around a hundred euros, and my most expensive is just under a thousand euros a week. That is based on terabytes of data processed each month."
report
Use our free recommendation engine to learn which Streaming Analytics solutions are best for your needs.
848,716 professionals have used our research since 2012.
 

Top Industries

By visitors reading reviews
Computer Software Company
18%
Financial Services Firm
17%
Manufacturing Company
10%
Retailer
5%
Financial Services Firm
17%
Computer Software Company
10%
Manufacturing Company
9%
Healthcare Company
6%
 

Company Size

By reviewers
Large Enterprise
Midsize Enterprise
Small Business
 

Questions from the Community

What do you like most about Amazon Kinesis?
Amazon Kinesis's main purpose is to provide near real-time data streaming at a consistent 2Mbps rate, which is really impressive.
What is your experience regarding pricing and costs for Amazon Kinesis?
Amazon Kinesis is moderately priced. In comparison with other competitors, it is fairly priced, however, if they reduced the price a little, it could add more value to customers.
What needs improvement with Amazon Kinesis?
I do not see any scope for improvement as it does what it is supposed to do. No changes are required. Since it's predominantly a back-end service, any end-user isn't going to interact with it direc...
Which do you prefer - Databricks or Azure Machine Learning Studio?
Databricks gives you the option of working with several different languages, such as SQL, R, Scala, Apache Spark, or Python. It offers many different cluster choices and excellent integration with ...
How would you compare Databricks vs Amazon SageMaker?
We researched AWS SageMaker, but in the end, we chose Databricks. Databricks is a Unified Analytics Platform designed to accelerate innovation projects. It is based on Spark so it is very fast. It...
Which would you choose - Databricks or Azure Stream Analytics?
Databricks is an easy-to-set-up and versatile tool for data management, analysis, and business analytics. For analytics teams that have to interpret data to further the business goals of their orga...
 

Comparisons

 

Also Known As

Amazon AWS Kinesis, AWS Kinesis, Kinesis
Databricks Unified Analytics, Databricks Unified Analytics Platform, Redash
 

Overview

 

Sample Customers

Zillow, Netflix, Sonos
Elsevier, MyFitnessPal, Sharethrough, Automatic Labs, Celtra, Radius Intelligence, Yesware
Find out what your peers are saying about Amazon Kinesis vs. Databricks and other solutions. Updated: April 2025.
848,716 professionals have used our research since 2012.