Try our new research platform with insights from 80,000+ expert users

Databricks vs VAST Data comparison

Sponsored
 

Comparison Buyer's Guide

Executive SummaryUpdated on Oct 8, 2024
 

Categories and Ranking

IBM SPSS Statistics
Sponsored
Average Rating
8.0
Number of Reviews
37
Ranking in other categories
Data Mining (3rd), Data Science Platforms (9th)
Databricks
Average Rating
8.2
Reviews Sentiment
7.4
Number of Reviews
84
Ranking in other categories
Data Science Platforms (1st), Streaming Analytics (1st)
VAST Data
Average Rating
10.0
Reviews Sentiment
7.5
Number of Reviews
2
Ranking in other categories
All-Flash Storage (20th), File and Object Storage (8th), NVMe All-Flash Storage Arrays (8th)
 

Mindshare comparison

Data Science Platforms
NVMe All-Flash Storage Arrays
 

Featured Reviews

Md Masudul Hassan - PeerSpot reviewer
Comprehensive data analysis capabilities with a user-friendly interface, providing an efficient and reliable platform for researchers and analysts
I believe that offering short-term SPSS licenses, perhaps when customer sourcing is available, could make it more affordable. These licenses shouldn't include features tailored for universities or large sales organizations. Instead, they could offer discounts or additional facilities for smaller entities to access the software. In developing countries, it would be beneficial to provide certain features to users at no cost initially, while also customizing pricing options. For example, offering basic features to the first hundred users can help them become familiar with the software and its capabilities. This approach encourages users to upgrade to higher tiers as they become more experienced and require additional functionality.
Dunstan Matekenya - PeerSpot reviewer
Process large-scale data sets and integrates with Apache Spark with notebook environment
Databricks integrates natively with Apache Spark, which I use as a processing engine for large-scale datasets. This native integration is one of its strengths. Another strength is that the platform makes it very easy to manage resources. For example, setting up a cluster of five or fifteen nodes is straightforward with Databricks. The notebook environment is also excellent, making it easy to perform various tasks.
Alan Powers - PeerSpot reviewer
Stability-wise, a device that has been up and running for years
The failover capability and resiliency are some of the solution's valuable features. The big thing is resilience because it has richer coding in it, so multiple devices can't fail. Also, one can still access a number of CBoxes that can allow one to access their file system. Once a device fails, it fails the transparency of the end-user, and it just starts using another resource. The encryption capability, the snapshots, along with a whole bunch of features make the tool valuable. VAST Data keeps adding more and more features all the time.

Quotes from Members

We asked business professionals to review the solutions they use. Here are some excerpts of what they said:
 

Pros

"They have many existing algorithms that we can use and use effectively to analyze and understand how to put our data to work to improve what we do."
"The learning curve to using this product is not steep. The program is appropriate for those who do not have a lot of background in programming, yet have to perform basic statistical analysis."
"The solution is very comprehensive, especially compared to Minitabs, which is considered more for manufacturing. However, whatever data you want to analyze can be handled with SPSS."
"The solution has numerous valuable features. We particularly like custom tabs. It's very useful. We end up analyzing a lot of software data, so features related to custom tabs are really helpful."
"The most valuable features are the small learning curve and its ability to hold a lot of data."
"It offers very good visualization."
"In terms of the features I've found most valuable, I'd say the duration, the correlation, and of course the nonparametric statistics. I use it for reliability and survival analysis, time series, regression models in different solutions, and different types of solutions."
"I've found the descriptive statistics and cross-tabs valuable. The very simple correlations and regressions are as well."
"The solution is very easy to use."
"The solution's features are fantastic and include interactive clusters that perform at top speed when compared to other solutions."
"It's great technology."
"It can send out large data amounts."
"Databricks' most valuable features are the workspace and notebooks. Its integration, interface, and documentation are also good."
"Databricks is a unified solution that we can use for streaming. It is supporting open source languages, which are cloud-agnostic. When I do database coding if any other tool has a similar language pack to Excel or SQL, I can use the same knowledge, limiting the need to learn new things. It supports a lot of Python libraries where I can use some very easily."
"The ability to stream data and the windowing feature are valuable."
"The tool helps with data processing and analytics with large-scale data or big data since it is associated with managing data at a large scale."
"This has been one of the most reliable storage systems that I have ever used."
"The solution is useful for machine learning and scientific applications, including computer simulations."
 

Cons

"If there is any self-generation data collection plan (DCP), it would be helpful in gathering data. It would also be useful if there is a function to scale it up to, let's say, UiPath and have it consolidate and integrate into a UiPath solution."
"I know that SPSS is a statistical tool but it should also include a little bit of analytical behavior. You can call it augmented analysis or predictive analysis. The bottom line is it should have more graphical and analytical capabilities."
"I would like SPSS to improve its integration with other data-filing IBM tools. I also think its duration with data, utilization, and graphics could be better."
"The statistics should be more self-explanatory with detailed automated reports."
"IBM SPSS Statistics could improve the visual outputs where you are producing, for example, a graph for a company board of directors, or an advert."
"The product should provide more ways to import data and export results that are user-friendly for high-level executives."
"It could provide even more in the way of automation as there are many opportunities."
"I feel that when it comes to conducting multiple analyses, there could be more detailed information provided. Currently, the software gives a summary and an overview, but it would be beneficial to have specific details for each product or variable."
"It's not easy to use, and they need a better UI."
"There is room for improvement in the documentation of processes and how it works."
"The ability to customize our own pipelines would enhance the product, similar to what's possible using ML files in Microsoft Azure DevOps."
"Databricks requires writing code in Python or SQL, so if you're a good programmer then you can use Databricks."
"I have had some issues with some of the Spark clusters running on Databricks, where the Spark runtime and clusters go up and down, which is an area for improvement."
"The biggest problem associated with the product is that it is quite pricey."
"A lot of people are required to manage this solution."
"The integration features could be more interesting, more involved."
"The read/write ratio is an area in the solution with some flaws and needs improvement."
"The write performance could be improved because it is less than half of the read performance."
 

Pricing and Cost Advice

"Our licence is on a yearly renewal basis. While pricing is not the primary concern in our evaluation, as products are assessed by whether they can meet our user needs and expertise, the cost can be a limiting factor in the number of licences we procure."
"More affordable training for new staff members."
"We think that IBM SPSS is expensive for this function."
"SPSS is an expensive piece of software because it's incredibly complex and has been refined over decades, but I would say it's fairly priced."
"While the pricing of the product may be higher, the accompanying service and features justify the investment."
"If it requires lot of data processing, maybe switching to IBM SPSS Clementine would be better for the buyer."
"The pricing of the modeler is high and can reduce the utility of the product for those who can not afford to adopt it."
"It's quite expensive, but they do a special deal for universities."
"We pay as we go, so there isn't a fixed price. It's charged by the unit. I don't have any details detail about how they measure this, but it should be a mix between processing and quantity of data handled. We run a simulation based on our use cases, which gives us an estimate. We've been monitoring this, and the costs have met our expectations."
"There are different versions."
"We implement this solution on behalf of our customers who have their own Azure subscription and they pay for Databricks themselves. The pricing is more expensive if you have large volumes of data."
"The basic version of this solution is now open-source, so there are no license costs involved. However, there is a charge for any advanced functionality and this can be quite expensive."
"We find Databricks to be very expensive, although this improved when we found out how to shut it down at night."
"I would rate Databricks' pricing seven out of ten."
"The solution requires a subscription."
"The price of Databricks is reasonable compared to other solutions."
"Price-wise, VAST Data is not the cheapest, not the most expensive one."
"We acquired VAST Data as a one-time, capital purchase."
report
Use our free recommendation engine to learn which Data Science Platforms solutions are best for your needs.
817,354 professionals have used our research since 2012.
 

Top Industries

By visitors reading reviews
Financial Services Firm
17%
Computer Software Company
9%
University
9%
Manufacturing Company
8%
Financial Services Firm
16%
Computer Software Company
11%
Manufacturing Company
9%
Healthcare Company
6%
Computer Software Company
17%
Manufacturing Company
15%
Financial Services Firm
10%
University
6%
 

Company Size

By reviewers
Large Enterprise
Midsize Enterprise
Small Business
No data available
 

Questions from the Community

What do you like most about IBM SPSS Statistics?
The software offers consistency across multiple research projects helping us with predictive analytics capabilities.
What is your experience regarding pricing and costs for IBM SPSS Statistics?
The cost of IBM SPSS Statistics is managed by organizations, not individual researchers. It is a very expensive produ...
What needs improvement with IBM SPSS Statistics?
IBM SPSS Statistics does not keep you close to your data like KNIME. In KNIME, at every stage, you can see the result...
Which do you prefer - Databricks or Azure Machine Learning Studio?
Databricks gives you the option of working with several different languages, such as SQL, R, Scala, Apache Spark, or ...
How would you compare Databricks vs Amazon SageMaker?
We researched AWS SageMaker, but in the end, we chose Databricks. Databricks is a Unified Analytics Platform designe...
Which would you choose - Databricks or Azure Stream Analytics?
Databricks is an easy-to-set-up and versatile tool for data management, analysis, and business analytics. For analyti...
What do you like most about VAST Data?
The solution is useful for machine learning and scientific applications, including computer simulations.
What is your experience regarding pricing and costs for VAST Data?
Price-wise, VAST Data is not the cheapest, not the most expensive one.
What needs improvement with VAST Data?
The read/write ratio is an area in the solution with some flaws and needs improvement.
 

Also Known As

SPSS Statistics
Databricks Unified Analytics, Databricks Unified Analytics Platform, Redash
No data available
 

Learn More

Video not available
 

Overview

 

Sample Customers

LDB Group, RightShip, Tennessee Highway Patrol, Capgemini Consulting, TEAC Corporation, Ironside, nViso SA, Razorsight, Si.mobil, University Hospitals of Leicester, CROOZ Inc., GFS Fundraising Solutions, Nedbank Ltd., IDS-TILDA
Elsevier, MyFitnessPal, Sharethrough, Automatic Labs, Celtra, Radius Intelligence, Yesware
Norwest Venture Partners, General Dynamics Information Technology, Ginkgo Bioworks
Find out what your peers are saying about Databricks, Knime, Amazon Web Services (AWS) and others in Data Science Platforms. Updated: November 2024.
817,354 professionals have used our research since 2012.