Try our new research platform with insights from 80,000+ expert users

Amazon SageMaker vs Databricks comparison

Sponsored
 

Comparison Buyer's Guide

Executive SummaryUpdated on Oct 8, 2024
 

Categories and Ranking

IBM SPSS Statistics
Sponsored
Ranking in Data Science Platforms
10th
Average Rating
8.0
Number of Reviews
37
Ranking in other categories
Data Mining (3rd)
Amazon SageMaker
Ranking in Data Science Platforms
5th
Average Rating
7.8
Reviews Sentiment
9.1
Number of Reviews
29
Ranking in other categories
AI Development Platforms (4th)
Databricks
Ranking in Data Science Platforms
1st
Average Rating
8.2
Reviews Sentiment
7.4
Number of Reviews
84
Ranking in other categories
Streaming Analytics (1st)
 

Mindshare comparison

As of November 2024, in the Data Science Platforms category, the mindshare of IBM SPSS Statistics is 2.8%, up from 2.6% compared to the previous year. The mindshare of Amazon SageMaker is 7.7%, down from 10.4% compared to the previous year. The mindshare of Databricks is 19.1%, up from 19.1% compared to the previous year. It is calculated based on PeerSpot user engagement data.
Data Science Platforms
 

Featured Reviews

AbakarAhmat - PeerSpot reviewer
Sep 21, 2023
Enhancing survey analysis that provides valued insightfulness
I use it to analyze questionnaire surveys related to a product, solution, or application, such as open data services, which I provide to consumers and end-users. These surveys contain evaluation assessments, and I use SPSS to analyze the responses The most valuable feature is its robust…
Natu Lauchande - PeerSpot reviewer
Feb 27, 2024
Easy to use and manage, but the documentation does not have a lot of information
We use the product for deploying machine learning models. We use it for the machine learning model development process We're currently implementing a project on a cross-selling model. It is like a standard XGBoost model. I’m evaluating the tool to see whether it will improve the workflow.…
Dunstan Matekenya - PeerSpot reviewer
Jul 10, 2024
Process large-scale data sets and integrates with Apache Spark with notebook environment
I primarily use Databricks to process large-scale data sets with Apache Spark. My main use case is processing large data sets, such as 600 GB or 800 GB Databricks integrates natively with Apache Spark, which I use as a processing engine for large-scale datasets. This native integration is one of…

Quotes from Members

We asked business professionals to review the solutions they use. Here are some excerpts of what they said:
 

Pros

"IBM SPSS Statistics depends on AI."
"It offers very good visualization."
"in terms of the simplicity, I think the SPSS basic can handle it."
"The most valuable features are the solution is easy to use, training new users is not difficult, and our usage is comprehensive because the whole service is beneficial."
"It is perfectly adequate if all you need are the results and not the trail of evidence."
"The most valuable feature is the user interface because you don't need to write code."
"SPSS is quite robust and quicker in terms of providing you the output."
"SPSS can handle whatever you throw at it, whether your data set contains 10,000, 100,000, or a million objects. It's like the heavy artillery of analytical tools."
"The most valuable feature of Amazon SageMaker is SageMaker Studio."
"The most valuable features are the ability to store artifacts and gather reports and measures from experiments."
"The most tool's valuable feature, in my experience, is hyperparameter tuning. It allows us to test different parameters for the same model in parallel, which helps us quickly identify the configuration that yields the highest accuracy. This parallel computing capability saves us a lot of time."
"We've had no problems with SageMaker's stability."
"I have contacted the solution's technical support, and they were really good. I rate the technical support a ten out of ten."
"The solution is easy to scale...The documentation and online community support have been sufficient for us so far."
"It's user-friendly for business teams as they can understand many aspects through the AWS interface."
"The few projects we have done have been promising."
"I like the ability to use workspaces with other colleagues because you can work together even without seeing the other team's job."
"Databricks is a robust solution for big data processing, offering flexibility and powerful features."
"The built-in optimization recommendations halved the speed of queries and allowed us to reach decision points and deliver insights very quickly."
"Ability to work collaboratively without having to worry about the infrastructure."
"This solution offers a lake house data concept that we have found exciting. We are able to have a large amount of data in a data lake and can manage all relational activities."
"Databricks has improved my organization by allowing us to transform data from sources to a different format and feed that to the analytics, business intelligence, and reporting teams. This tool makes it easy to do those kinds of things."
"Databricks helps crunch petabytes of data in a very short period of time."
"Databricks has helped us have a good presence in data."
 

Cons

"The design of the experience can be improved."
"SPSS is a tool that's been around since the late 60s, and it's the universal worldwide standard for quantitative social science data analysis. That said, it does seem a bit strange to me that the graphical output functions are so clunky after all these years. The output of charts and graphs that SPSS produces is hideous."
"It would be helpful if there was better documentation on how to properly use the solution. A beginner's guide on how to use the various programming functions within the product would be so useful to a lot of people. I found that everything was very confusing at first. Having clear documentation would help alleviate that."
"Improvements are needed in the user interface, particularly in terms of user-friendliness."
"The solution needs more planning tools and capabilities."
"It could provide even more in the way of automation as there are many opportunities."
"Most of the package will give you the fixed value, or the p-value, without an explanation as to whether it it significant or not. Some beginners might need not just the results, but also some explanation for them."
"The solution needs to improve forecasting using time series analysis."
"The documentation must be made clearer and more user-friendly."
"The model repository is a concern as models are stored on a bucket and there's an issue with versioning."
"I had to create custom templates for labeling multi-data sets, such as text and images, which was time-consuming."
"There are other better solutions for large data, such as Databricks."
"SageMaker would be improved with the addition of reporting services."
"When starting a new session, the waiting time can be quite long, ranging from two to five minutes."
"Amazon SageMaker could improve in the area of hyperparameter tuning by offering more automated suggestions and tips during the tuning process."
"The user interface (UI) and user experience (UX) of SageMaker and AWS, in general, need improvement as they are not intuitive and require substantial time to learn how to use specific services."
"While Databricks is generally a robust solution, I have noticed a limitation with debugging in the Delta Live Table, which could be improved."
"Databricks requires writing code in Python or SQL, so if you're a good programmer then you can use Databricks."
"Databricks doesn't offer the use of Python scripts by itself and is not connected to GitHub repositories or anything similar. This is something that is missing. if they could integrate with Git tools it would be an advantage."
"Support for Microsoft technology and the compatibility with the .NET framework is somewhat missing."
"The tool should improve its integration with other products."
"The biggest problem associated with the product is that it is quite pricey."
"I think setting up the whole account for one person and giving access are areas that can be difficult to manage and should be made a little easier."
"This solution only supports queries in SQL and Python, which is a bit limiting."
 

Pricing and Cost Advice

"We think that IBM SPSS is expensive for this function."
"More affordable training for new staff members."
"SPSS is an expensive piece of software because it's incredibly complex and has been refined over decades, but I would say it's fairly priced."
"If it requires lot of data processing, maybe switching to IBM SPSS Clementine would be better for the buyer."
"The pricing of the modeler is high and can reduce the utility of the product for those who can not afford to adopt it."
"The price of IBM SPSS Statistics could improve."
"While the pricing of the product may be higher, the accompanying service and features justify the investment."
"I rate the tool's pricing a five out of ten."
"The cost offers a pay-as-you-go pricing model. It depends on the instance that you do."
"You don't pay for Sagemaker. You only pay for the compute instances in your storage."
"The pricing could be better, especially for querying. The per-query model feels expensive."
"The tool's pricing is reasonable."
"In terms of pricing, I'd also rate it ten out of ten because it's been beneficial compared to other solutions."
"I rate the pricing a five on a scale of one to ten, where one is the lowest price, and ten is the highest price. The solution is priced reasonably. There is no additional cost to be paid in excess of the standard licensing fees."
"The product is expensive."
"I would rate the solution's price a ten out of ten since it is very high."
"The price is okay. It's competitive."
"Databricks is a very expensive solution. Pricing is an area that could definitely be improved. They could provide a lower end compute and probably reduce the price."
"Databricks' cost could be improved."
"The billing of Databricks can be difficult and should improve."
"Databricks are not costly when compared with other solutions' prices."
"The licensing costs of Databricks depend on how many licenses we need, depending on which Databricks provides a lot of discounts."
"The cost is around $600,000 for 50 users."
"My smallest project is around a hundred euros, and my most expensive is just under a thousand euros a week. That is based on terabytes of data processed each month."
report
Use our free recommendation engine to learn which Data Science Platforms solutions are best for your needs.
815,854 professionals have used our research since 2012.
 

Top Industries

By visitors reading reviews
Financial Services Firm
17%
University
9%
Computer Software Company
9%
Manufacturing Company
8%
Financial Services Firm
18%
Educational Organization
14%
Computer Software Company
11%
Manufacturing Company
8%
Financial Services Firm
16%
Computer Software Company
12%
Manufacturing Company
9%
Healthcare Company
6%
 

Company Size

By reviewers
Large Enterprise
Midsize Enterprise
Small Business
 

Questions from the Community

What do you like most about IBM SPSS Statistics?
The software offers consistency across multiple research projects helping us with predictive analytics capabilities.
What is your experience regarding pricing and costs for IBM SPSS Statistics?
While the pricing of the product may be higher, the accompanying service and features justify the investment. However...
What needs improvement with IBM SPSS Statistics?
In some cases, the product takes time to load a large dataset. They could improve this particular area.
How would you compare Databricks vs Amazon SageMaker?
We researched AWS SageMaker, but in the end, we chose Databricks. Databricks is a Unified Analytics Platform designe...
What do you like most about Amazon SageMaker?
We've had experience with unique ML projects using SageMaker. For example, we're developing a platform similar to Cha...
What is your experience regarding pricing and costs for Amazon SageMaker?
The license cost for Amazon SageMaker ranges between seven thousand to fifteen thousand dollars per month depending o...
Which do you prefer - Databricks or Azure Machine Learning Studio?
Databricks gives you the option of working with several different languages, such as SQL, R, Scala, Apache Spark, or ...
Which would you choose - Databricks or Azure Stream Analytics?
Databricks is an easy-to-set-up and versatile tool for data management, analysis, and business analytics. For analyti...
What do you like most about Databricks?
Databricks is hosted on the cloud. It is very easy to collaborate with other team members who are working on it. It i...
 

Also Known As

SPSS Statistics
AWS SageMaker, SageMaker
Databricks Unified Analytics, Databricks Unified Analytics Platform, Redash
 

Learn More

 

Overview

 

Sample Customers

LDB Group, RightShip, Tennessee Highway Patrol, Capgemini Consulting, TEAC Corporation, Ironside, nViso SA, Razorsight, Si.mobil, University Hospitals of Leicester, CROOZ Inc., GFS Fundraising Solutions, Nedbank Ltd., IDS-TILDA
DigitalGlobe, Thomson Reuters Center for AI and Cognitive Computing, Hotels.com, GE Healthcare, Tinder, Intuit
Elsevier, MyFitnessPal, Sharethrough, Automatic Labs, Celtra, Radius Intelligence, Yesware
Find out what your peers are saying about Amazon SageMaker vs. Databricks and other solutions. Updated: October 2024.
815,854 professionals have used our research since 2012.