We performed a comparison between Amazon Kinesis and Databricks based on real PeerSpot user reviews.
Find out in this report how the two Streaming Analytics solutions compare in terms of features, pricing, service and support, easy of deployment, and ROI."Great auto-scaling, auto-sharing, and auto-correction features."
"The solution works well in rather sizable environments."
"The solution's technical support is flawless."
"What I like about Amazon Kinesis is that it's very effective for small businesses. It's a well-managed solution with excellent reporting. Amazon Kinesis is also easy to use, and even a novice developer can work with it, versus Apache Kafka, which requires expertise."
"From my experience, one of the most valuable features is the ability to track silent events on endpoints. Previously, these events might have gone unnoticed, but now we can access them within the product range. For example, if a customer reports that their calls are not reaching the portal files, we can use this feature to troubleshoot and optimize the system."
"The management and analytics are valuable features."
"I find almost all features valuable, especially the timing and fast pace movement."
"Amazon Kinesis also provides us with plenty of flexibility."
"Databricks gives us the ability to build a lakehouse framework and do everything implicit to this type of database structure. We also like the ability to stream events. Databricks covers a broad spectrum, from reporting and machine learning to streaming events. It's important for us to have all these features in one platform."
"We are completely satisfied with the ease of connecting to different sources of data or pocket files in the search"
"Databricks makes it really easy to use a number of technologies to do data analysis. In terms of languages, we can use Scala, Python, and SQL. Databricks enables you to run very large queries, at a massive scale, within really good timeframes."
"The Delta Lake data type has been the most useful part of this solution. Delta Lake is an opensource data type and it was implemented and invented by Databricks."
"The setup was straightforward."
"I work in the data science field and I found Databricks to be very useful."
"A very valuable feature is the data processing, and the solution is specifically good at using the Spark ecosystem."
"The main features of the solution are efficiency."
"One thing that would be nice would be a policy for increasing the number of Kinesis streams because that's the one thing that's constant. You can change it in real time, but somebody has to change it, or you have to set some kind of meter. So, auto-scaling of adding and removing streams would be nice."
"AI processing or cleaning up data would be nice since I don't think it is a feature in Amazon Kinesis right now."
"For me, especially with video streams, there's sometimes a kind of delay when the data has to be pumped to other services. This delay could be improved in Kinesis, or especially the Kinesis Video Streams, which is being used for different use cases for Amazon Connect. With that improvement, a lot of other use cases of Amazon Connect integrating with third-party analytic tools would be easier."
"The services which are described in the documentation could use some visual presentation because for someone who is new to the solution the documentation is not easy to follow or beginner friendly and can leave a person feeling helpless."
"I think the default settings are far too low."
"Kinesis Data Analytics needs to be improved somewhat. It's SQL based data but it is not as user friendly as MySQL or Athena tools."
"The solution has a two-minute maximum time delay for live streaming, which could be reduced."
"Something else to mention is that we use Kinesis with Lambda a lot and the fact that you can only connect one Stream to one Lambda, I find is a limiting factor. I would definitely recommend to remove that constraint."
"The solution could be improved by adding a feature that would make it more user-friendly for our team. The feature is simple, but it would be useful. Currently, our team is more familiar with the language R, but Databricks requires the use of Jupyter Notebooks which primarily supports Python. We have tried using RStudio, but it is not a fully integrated solution. To fully utilize Databricks, we have to use the Jupyter interface. One feature that would make it easier for our team to adopt the Jupyter interface would be the ability to select a specific variable or line of code and execute it within a cell. This feature is available in other Jupyter Notebooks outside of Databricks and in our own IDE, but it is not currently available within Databricks. If this feature were added, it would make the transition to using Databricks much smoother for our team."
"The integration features could be more interesting, more involved."
"In the future, I would like to see Data Lake support. That is something that I'm looking forward to."
"It's not easy to use, and they need a better UI."
"It would be better if it were faster. It can be slow, and it can be super fast for big data. But for small data, sometimes there is a sub-second response, which can be considered slow. In the next release, I would like to have automatic creation of APIs because they don't have it at the moment, and I spend a lot of time building them."
"Databricks is not geared towards the end-user, but rather it is for data engineers or data scientists."
"The tool should improve its integration with other products."
"The interface of Databricks could be easier to use when compared to other solutions. It is not easy for non-data scientists. The user interface is important before we had to write code manually and as solutions move to "No code AI" it is critical that the interface is very good."
Amazon Kinesis is ranked 1st in Streaming Analytics with 24 reviews while Databricks is ranked 2nd in Streaming Analytics with 78 reviews. Amazon Kinesis is rated 8.0, while Databricks is rated 8.2. The top reviewer of Amazon Kinesis writes "Used for media streaming and live-streaming data". On the other hand, the top reviewer of Databricks writes "A nice interface with good features for turning off clusters to save on computing". Amazon Kinesis is most compared with Azure Stream Analytics, Amazon MSK, Confluent, Apache Flink and Spring Cloud Data Flow, whereas Databricks is most compared with Amazon SageMaker, Informatica PowerCenter, Dataiku, Dremio and Microsoft Azure Machine Learning Studio. See our Amazon Kinesis vs. Databricks report.
See our list of best Streaming Analytics vendors.
We monitor all Streaming Analytics reviews to prevent fraudulent reviews and keep review quality high. We do not post reviews by company employees or direct competitors. We validate each review for authenticity via cross-reference with LinkedIn, and personal follow-up with the reviewer when necessary.