Try our new research platform with insights from 80,000+ expert users

Databricks vs Starburst Enterprise comparison

Sponsored
 

Comparison Buyer's Guide

Executive SummaryUpdated on Dec 5, 2024
 

Categories and Ranking

IBM SPSS Statistics
Sponsored
Ranking in Data Science Platforms
9th
Average Rating
8.0
Reviews Sentiment
6.9
Number of Reviews
37
Ranking in other categories
Data Mining (3rd)
Databricks
Ranking in Data Science Platforms
1st
Average Rating
8.2
Reviews Sentiment
7.0
Number of Reviews
84
Ranking in other categories
Streaming Analytics (1st)
Starburst Enterprise
Ranking in Data Science Platforms
14th
Average Rating
8.6
Reviews Sentiment
6.9
Number of Reviews
2
Ranking in other categories
Streaming Analytics (12th)
 

Mindshare comparison

As of December 2024, in the Data Science Platforms category, the mindshare of IBM SPSS Statistics is 2.7%, up from 2.7% compared to the previous year. The mindshare of Databricks is 19.2%, up from 18.7% compared to the previous year. The mindshare of Starburst Enterprise is 2.1%, up from 1.6% compared to the previous year. It is calculated based on PeerSpot user engagement data.
Data Science Platforms
 

Featured Reviews

Md Masudul Hassan - PeerSpot reviewer
Comprehensive data analysis capabilities with a user-friendly interface, providing an efficient and reliable platform for researchers and analysts
I believe that offering short-term SPSS licenses, perhaps when customer sourcing is available, could make it more affordable. These licenses shouldn't include features tailored for universities or large sales organizations. Instead, they could offer discounts or additional facilities for smaller entities to access the software. In developing countries, it would be beneficial to provide certain features to users at no cost initially, while also customizing pricing options. For example, offering basic features to the first hundred users can help them become familiar with the software and its capabilities. This approach encourages users to upgrade to higher tiers as they become more experienced and require additional functionality.
Dunstan Matekenya - PeerSpot reviewer
Process large-scale data sets and integrates with Apache Spark with notebook environment
Databricks integrates natively with Apache Spark, which I use as a processing engine for large-scale datasets. This native integration is one of its strengths. Another strength is that the platform makes it very easy to manage resources. For example, setting up a cluster of five or fifteen nodes is straightforward with Databricks. The notebook environment is also excellent, making it easy to perform various tasks.
KamleshPant - PeerSpot reviewer
Connects to any data source from any region and offers unified access
There are no specific projects supported by Starburst regarding AI initiatives or machine learning projects. In the future, if we have all the data available, we can definitely capitalize on AI/ML and LLM capabilities to summarize data and gain insights. That's our future goal, but we haven't reached that point yet. There should be support for REST API data sources to access data from the web. We often have data coming in and communicate with data sources via REST API calls. I don't see that capability in Starburst currently; everything is through JDBC or ODBC. If Starburst could seamlessly access data using REST API capabilities, it would be a game-changer. The self-service data management features, like self-service materialized views, are great, but they can be a bit complex for basic users to understand.

Quotes from Members

We asked business professionals to review the solutions they use. Here are some excerpts of what they said:
 

Pros

"The most valuable feature is its robust statistical analysis capabilities."
"In terms of the features I've found most valuable, I'd say the duration, the correlation, and of course the nonparametric statistics. I use it for reliability and survival analysis, time series, regression models in different solutions, and different types of solutions."
"It has the ability to easily change any variable in our research."
"The most valuable feature is the user interface because you don't need to write code."
"I've found the descriptive statistics and cross-tabs valuable. The very simple correlations and regressions are as well."
"It is perfectly adequate if all you need are the results and not the trail of evidence."
"The best part is that they have an algorithm handbook, so you can open it up and understand how it works, and if it is useful, this is very important."
"It offers very good visualization."
"In the manufacturing industry, Databricks can be beneficial to use because of machine learning. It is useful for tasks, such as product analysis or predictive maintenance."
"The time travel feature is the solution's most valuable aspect."
"Databricks has a scalable Spark cluster creation process. The creators of Databricks are also the creators of Spark, and they are the industry leaders in terms of performance."
"It is fast, it's scalable, and it does the job it needs to do."
"The most valuable features of the solution are the hardware and the resources it quickly provides without much hassle."
"The most valuable feature of Databricks is the integration of the data warehouse and data lake, and the development of the lake house. Additionally, it integrates well with Spark for processing data in production."
"Databricks is hosted on the cloud. It is very easy to collaborate with other team members who are working on it. It is production-ready code, and scheduling the jobs is easy."
"The setup is quite easy."
"We have noticed improvements in performance using Starburst Enterprise. It handles complex data, including reading and partitioning files. We can add a new catalog to Starburst Enterprise by providing connection details and service account information. This allows us to integrate with existing tools, such as the Snowflake database, which we use for data protection in our project."
"It's very scalable, fast performing, and supports many catalogs."
 

Cons

"Needs more statistical modelling functions."
"One of the areas that should be similar to Minitabs is the use of blogs. The Minitabs blog helps users understand the tools and gives lots of practical examples. Following the SPSS manual is cumbersome. It's a good, exhaustive manual, but it's not practical to use. With Minitabs, you can go to the blogs and find specific articles written about various components and it's very helpful. Without blogs, we find SPSS more complicated."
"The product should provide more ways to import data and export results that are user-friendly for high-level executives."
"Technical support needs some improvement, as they do not respond as quickly as we would like."
"It could allow adding color to data models to make them easier to interpret."
"The solution could improve by providing a visual network for predictions and a self-organizing map for clustering."
"This solution is not suitable for use with Big Data."
"SPSS slows down the computer or the laptop if the data is huge; then you need a faster computer."
"The query plan is not easy with Databrick's job level. If I want to tune any of the code, it is not easily available in the blogs as well."
"I'm not the guy that I'm working with Databricks on a daily basis. I'm on the management team. However, my team tells me there are limitations with streaming events. The connectors work with a small set of platforms. For example, we can work with Kafka, but if we want to move to an event-driven solution from AWS, we cannot do it. We cannot connect to all the streaming analytics platforms, so we are limited in choosing the best one."
"This solution only supports queries in SQL and Python, which is a bit limiting."
"The solution has some scalability and integration limitations when consolidating legacy systems."
"Databricks would have more collaborative features than it has. It should have some more customization for the jobs."
"Databricks has added some alerts and query functionality into their SQL persona, but the whole SQL persona, which is like a role, needs a lot of development. The alerts are not very flexible, and the query interface itself is not as polished as the notebook interface that is used through the data science and machine learning persona. It is clunky at present."
"I would love an integration in my desktop IDE. For now, I have to code on their webpage."
"The product cannot be integrated with a popular coding IDE."
"There should be support for REST API data sources to access data from the web."
"Starburst Enterprise could improve by offering additional features similar to those provided by other SQL query tools. For example, incorporating functionalities like pivot tables would make it more feasible to use."
 

Pricing and Cost Advice

"While the pricing of the product may be higher, the accompanying service and features justify the investment."
"Our licence is on a yearly renewal basis. While pricing is not the primary concern in our evaluation, as products are assessed by whether they can meet our user needs and expertise, the cost can be a limiting factor in the number of licences we procure."
"If it requires lot of data processing, maybe switching to IBM SPSS Clementine would be better for the buyer."
"The price of IBM SPSS Statistics could improve."
"The price of this solution is a little bit high, which was a problem for my company."
"We think that IBM SPSS is expensive for this function."
"More affordable training for new staff members."
"It's quite expensive, but they do a special deal for universities."
"Databricks are not costly when compared with other solutions' prices."
"Databricks is a very expensive solution. Pricing is an area that could definitely be improved. They could provide a lower end compute and probably reduce the price."
"We implement this solution on behalf of our customers who have their own Azure subscription and they pay for Databricks themselves. The pricing is more expensive if you have large volumes of data."
"I rate the price of Databricks as eight out of ten."
"The basic version of this solution is now open-source, so there are no license costs involved. However, there is a charge for any advanced functionality and this can be quite expensive."
"Databricks' cost could be improved."
"I do not exactly know the costs, but one of our clients pays between $100 USD and $200 USD monthly."
"I would rate the tool’s pricing an eight out of ten."
"I haven't personally dealt with the pricing aspects first-hand, but from what I understand, it largely depends on the specifics of your setup, especially the machines you use on AWS. The cost of using Starburst Enterprise can vary based on the amount of data you're processing and the type of machines you opt for, whether on AWS or another cloud platform."
report
Use our free recommendation engine to learn which Data Science Platforms solutions are best for your needs.
824,053 professionals have used our research since 2012.
 

Top Industries

By visitors reading reviews
Financial Services Firm
17%
Computer Software Company
9%
University
8%
Manufacturing Company
8%
Financial Services Firm
16%
Computer Software Company
11%
Manufacturing Company
9%
Healthcare Company
6%
Financial Services Firm
45%
Computer Software Company
10%
Government
5%
Energy/Utilities Company
5%
 

Company Size

By reviewers
Large Enterprise
Midsize Enterprise
Small Business
No data available
 

Questions from the Community

What do you like most about IBM SPSS Statistics?
The software offers consistency across multiple research projects helping us with predictive analytics capabilities.
What is your experience regarding pricing and costs for IBM SPSS Statistics?
The cost of IBM SPSS Statistics is managed by organizations, not individual researchers. It is a very expensive produ...
What needs improvement with IBM SPSS Statistics?
IBM SPSS Statistics does not keep you close to your data like KNIME. In KNIME, at every stage, you can see the result...
Which do you prefer - Databricks or Azure Machine Learning Studio?
Databricks gives you the option of working with several different languages, such as SQL, R, Scala, Apache Spark, or ...
How would you compare Databricks vs Amazon SageMaker?
We researched AWS SageMaker, but in the end, we chose Databricks. Databricks is a Unified Analytics Platform designe...
Which would you choose - Databricks or Azure Stream Analytics?
Databricks is an easy-to-set-up and versatile tool for data management, analysis, and business analytics. For analyti...
What is your experience regarding pricing and costs for Starburst Enterprise?
I haven't personally dealt with the pricing aspects first-hand, but from what I understand, it largely depends on the...
What needs improvement with Starburst Enterprise?
There are no specific projects supported by Starburst regarding AI initiatives or machine learning projects. In the f...
What is your primary use case for Starburst Enterprise?
We use Starburst with one client who is exploring their ecosystem to remove data silos and enable data access across ...
 

Also Known As

SPSS Statistics
Databricks Unified Analytics, Databricks Unified Analytics Platform, Redash
No data available
 

Learn More

Video not available
 

Overview

 

Sample Customers

LDB Group, RightShip, Tennessee Highway Patrol, Capgemini Consulting, TEAC Corporation, Ironside, nViso SA, Razorsight, Si.mobil, University Hospitals of Leicester, CROOZ Inc., GFS Fundraising Solutions, Nedbank Ltd., IDS-TILDA
Elsevier, MyFitnessPal, Sharethrough, Automatic Labs, Celtra, Radius Intelligence, Yesware
Information Not Available
Find out what your peers are saying about Databricks vs. Starburst Enterprise and other solutions. Updated: December 2024.
824,053 professionals have used our research since 2012.