Try our new research platform with insights from 80,000+ expert users

Databricks vs VAST Data comparison

Sponsored
 

Comparison Buyer's Guide

Executive SummaryUpdated on Oct 8, 2024
 

Categories and Ranking

IBM SPSS Statistics
Sponsored
Average Rating
8.0
Number of Reviews
37
Ranking in other categories
Data Mining (3rd), Data Science Platforms (10th)
Databricks
Average Rating
8.2
Number of Reviews
82
Ranking in other categories
Data Science Platforms (1st), Streaming Analytics (1st)
VAST Data
Average Rating
10.0
Number of Reviews
2
Ranking in other categories
All-Flash Storage (21st), File and Object Storage (8th), NVMe All-Flash Storage Arrays (8th)
 

Mindshare comparison

Data Science Platforms
NVMe All-Flash Storage Arrays
 

Featured Reviews

AbakarAhmat - PeerSpot reviewer
Sep 21, 2023
Enhancing survey analysis that provides valued insightfulness
I use it to analyze questionnaire surveys related to a product, solution, or application, such as open data services, which I provide to consumers and end-users. These surveys contain evaluation assessments, and I use SPSS to analyze the responses The most valuable feature is its robust…
Dunstan Matekenya - PeerSpot reviewer
Jul 10, 2024
Process large-scale data sets and integrates with Apache Spark with notebook environment
I primarily use Databricks to process large-scale data sets with Apache Spark. My main use case is processing large data sets, such as 600 GB or 800 GB Databricks integrates natively with Apache Spark, which I use as a processing engine for large-scale datasets. This native integration is one of…
Alan Powers - PeerSpot reviewer
May 3, 2023
Stability-wise, a device that has been up and running for years
The solution is useful for machine learning and scientific applications, including computer simulations The failover capability and resiliency are some of the solution's valuable features. The big thing is resilience because it has richer coding in it, so multiple devices can't fail. Also, one…

Quotes from Members

We asked business professionals to review the solutions they use. Here are some excerpts of what they said:
 

Pros

"I've found the descriptive statistics and cross-tabs valuable. The very simple correlations and regressions are as well."
"It is perfectly adequate if all you need are the results and not the trail of evidence."
"You can find a complete algorithm in the solution and use it. You don't need to write your own algorithms for predictive analytics. That's the most valuable feature and the main one we use."
"SPSS is quite robust and quicker in terms of providing you the output."
"One feature I found very valuable was the analysis of variance (ANOVA)."
"In terms of the features I've found most valuable, I'd say the duration, the correlation, and of course the nonparametric statistics. I use it for reliability and survival analysis, time series, regression models in different solutions, and different types of solutions."
"It has helped our analyst unit deliver work with more transparency and confidence, given that we can always view the dataset in totality, after each step of data transformation."
"The most valuable feature is its robust statistical analysis capabilities."
"Databricks covers end-to-end data analytics workflow in one platform, this is the best feature of the solution."
"The ease of use and its accessibility are valuable."
"Automation with Databricks is very easy when using the API."
"The solution's features are fantastic and include interactive clusters that perform at top speed when compared to other solutions."
"We are completely satisfied with the ease of connecting to different sources of data or pocket files in the search"
"Databricks gives you the flexibility of using several programming languages independently or in combination to build models."
"In the manufacturing industry, Databricks can be beneficial to use because of machine learning. It is useful for tasks, such as product analysis or predictive maintenance."
"The integration with Python and the notebooks really helps."
"This has been one of the most reliable storage systems that I have ever used."
"The solution is useful for machine learning and scientific applications, including computer simulations."
 

Cons

"Technical support needs some improvement, as they do not respond as quickly as we would like."
"I would like SPSS to improve its integration with other data-filing IBM tools. I also think its duration with data, utilization, and graphics could be better."
"I know that SPSS is a statistical tool but it should also include a little bit of analytical behavior. You can call it augmented analysis or predictive analysis. The bottom line is it should have more graphical and analytical capabilities."
"IBM SPSS Statistics could improve the visual outputs where you are producing, for example, a graph for a company board of directors, or an advert."
"Perhaps in terms of visualization. It's not really easy to do some data visualization, just simple, descriptive analysis in SPSS. I think that could be an area for improvement."
"In some cases, the product takes time to load a large dataset. They could improve this particular area."
"It would be helpful if there was better documentation on how to properly use the solution. A beginner's guide on how to use the various programming functions within the product would be so useful to a lot of people. I found that everything was very confusing at first. Having clear documentation would help alleviate that."
"This solution is not suitable for use with Big Data."
"I would like more integration with SQL for using data in different workspaces."
"Databricks has added some alerts and query functionality into their SQL persona, but the whole SQL persona, which is like a role, needs a lot of development. The alerts are not very flexible, and the query interface itself is not as polished as the notebook interface that is used through the data science and machine learning persona. It is clunky at present."
"We'd like a more visual dashboard for analysis It needs better UI."
"I believe that this product could be improved by becoming more user-friendly."
"I have seen better user interfaces, so that is something that can be improved."
"Databricks would benefit from enhanced metrics and tighter integration with Azure's diagnostics."
"The product needs samples and templates to help invite users to see results and understand what the product can do."
"Doesn't provide a lot of credits or trial options."
"The write performance could be improved because it is less than half of the read performance."
"The read/write ratio is an area in the solution with some flaws and needs improvement."
 

Pricing and Cost Advice

"The pricing of the modeler is high and can reduce the utility of the product for those who can not afford to adopt it."
"Our licence is on a yearly renewal basis. While pricing is not the primary concern in our evaluation, as products are assessed by whether they can meet our user needs and expertise, the cost can be a limiting factor in the number of licences we procure."
"SPSS is an expensive piece of software because it's incredibly complex and has been refined over decades, but I would say it's fairly priced."
"I rate the tool's pricing a five out of ten."
"More affordable training for new staff members."
"It's quite expensive, but they do a special deal for universities."
"We think that IBM SPSS is expensive for this function."
"The price of IBM SPSS Statistics could improve."
"The solution uses a pay-per-use model with an annual subscription fee or package. Typically this solution is used on a cloud platform, such as Azure or AWS, but more people are choosing Azure because the price is more reasonable."
"The pricing depends on the usage itself."
"My smallest project is around a hundred euros, and my most expensive is just under a thousand euros a week. That is based on terabytes of data processed each month."
"We find Databricks to be very expensive, although this improved when we found out how to shut it down at night."
"The price is okay. It's competitive."
"The price of Databricks is reasonable compared to other solutions."
"Databricks is a very expensive solution. Pricing is an area that could definitely be improved. They could provide a lower end compute and probably reduce the price."
"I'm not involved in the financing, but I can say that the solution seemed reasonably priced compared to the competitors. Similar products are usually in the same price range. With five being affordable and one being expensive, I would rate Databricks a four out of five."
"We acquired VAST Data as a one-time, capital purchase."
"Price-wise, VAST Data is not the cheapest, not the most expensive one."
report
Use our free recommendation engine to learn which Data Science Platforms solutions are best for your needs.
814,763 professionals have used our research since 2012.
 

Top Industries

By visitors reading reviews
Financial Services Firm
16%
University
10%
Computer Software Company
9%
Manufacturing Company
8%
Financial Services Firm
16%
Computer Software Company
12%
Manufacturing Company
9%
Healthcare Company
6%
Computer Software Company
18%
Manufacturing Company
15%
Financial Services Firm
10%
Educational Organization
6%
 

Company Size

By reviewers
Large Enterprise
Midsize Enterprise
Small Business
No data available
 

Questions from the Community

What do you like most about IBM SPSS Statistics?
The software offers consistency across multiple research projects helping us with predictive analytics capabilities.
What is your experience regarding pricing and costs for IBM SPSS Statistics?
While the pricing of the product may be higher, the accompanying service and features justify the investment. However...
What needs improvement with IBM SPSS Statistics?
In some cases, the product takes time to load a large dataset. They could improve this particular area.
Which do you prefer - Databricks or Azure Machine Learning Studio?
Databricks gives you the option of working with several different languages, such as SQL, R, Scala, Apache Spark, or ...
How would you compare Databricks vs Amazon SageMaker?
We researched AWS SageMaker, but in the end, we chose Databricks. Databricks is a Unified Analytics Platform designe...
Which would you choose - Databricks or Azure Stream Analytics?
Databricks is an easy-to-set-up and versatile tool for data management, analysis, and business analytics. For analyti...
What do you like most about VAST Data?
The solution is useful for machine learning and scientific applications, including computer simulations.
What is your experience regarding pricing and costs for VAST Data?
Price-wise, VAST Data is not the cheapest, not the most expensive one.
What needs improvement with VAST Data?
The read/write ratio is an area in the solution with some flaws and needs improvement.
 

Also Known As

SPSS Statistics
Databricks Unified Analytics, Databricks Unified Analytics Platform, Redash
No data available
 

Learn More

Video not available
 

Overview

 

Sample Customers

LDB Group, RightShip, Tennessee Highway Patrol, Capgemini Consulting, TEAC Corporation, Ironside, nViso SA, Razorsight, Si.mobil, University Hospitals of Leicester, CROOZ Inc., GFS Fundraising Solutions, Nedbank Ltd., IDS-TILDA
Elsevier, MyFitnessPal, Sharethrough, Automatic Labs, Celtra, Radius Intelligence, Yesware
Norwest Venture Partners, General Dynamics Information Technology, Ginkgo Bioworks
Find out what your peers are saying about Databricks, Knime, Microsoft and others in Data Science Platforms. Updated: October 2024.
814,763 professionals have used our research since 2012.