Try our new research platform with insights from 80,000+ expert users

Apache Spark vs SAP HANA comparison

 

Comparison Buyer's Guide

Executive Summary

Review summaries and opinions

We asked business professionals to review the solutions they use. Here are some excerpts of what they said:
 

Categories and Ranking

Apache Spark
Average Rating
8.4
Reviews Sentiment
7.7
Number of Reviews
65
Ranking in other categories
Hadoop (1st), Compute Service (4th), Java Frameworks (2nd)
SAP HANA
Average Rating
8.2
Reviews Sentiment
6.5
Number of Reviews
84
Ranking in other categories
Data Virtualization (2nd), Embedded Database (4th), Relational Databases Tools (4th)
 

Featured Reviews

Ilya Afanasyev - PeerSpot reviewer
Reliable, able to expand, and handle large amounts of data well
We use batch processing. It works well with our formats and file versions. There's a lot of functionality. In our pipeline each hour, we make a copy of data from MongoDB, of the changes from MongoDB to some specific file. Each time pipeline copied all of the data, it would do it each time without changes to all of the tables. Tables have a lot of data, and in the last MongoDB version, there is a possibility to read only changed data. This reduced the cost and configuration of the cluster, and we saved about $150,000. The solution is scalable. It's a stable product.
Jayarami Reddy Pujeri - PeerSpot reviewer
Comprehensive system with real-time analytics for versatile industry applications
Our primary use case is working with various clients in industries such as pharmaceuticals and other services. We support clients as implementers of SAP HANA, providing expertise in functionality, finance, logistics, and processes The solution is very user-friendly and supports all kinds of…

Quotes from Members

We asked business professionals to review the solutions they use. Here are some excerpts of what they said:
 

Pros

"Apache Spark is known for its ease of use. Compared to other available data processing frameworks, it is user-friendly."
"Spark is used for transformations from large volumes of data, and it is usefully distributed."
"The processing time is very much improved over the data warehouse solution that we were using."
"Apache Spark provides a very high-quality implementation of distributed data processing."
"It's easy to prepare parallelism in Spark, run the solution with specific parameters, and get good performance."
"Spark helps us reduce startup time for our customers and gives a very high ROI in the medium term."
"The most valuable feature of this solution is its capacity for processing large amounts of data."
"The most valuable feature of Apache Spark is its flexibility."
"Technically it resembles Oracle, but as a somewhat lighter version."
"SAP HANA's best features are its programmability and extensibility - you can size and shape the software however you need."
"We've had good experiences with technical support."
"Integration is the most valuable feature we use SAP HANA for."
"The most valuable feature of SAP HANA is its performance and integration."
"The most valuable features I have found are speed, dashboard, and reporting."
"The solution is easy to manage enterprise resources and the reporting and analytics are including. It is good for company growth and all module are managed well."
"We use SAP HANA for Master Data Governance."
 

Cons

"Stream processing needs to be developed more in Spark. I have used Flink previously. Flink is better than Spark at stream processing."
"At the initial stage, the product provides no container logs to check the activity."
"We are building our own queries on Spark, and it can be improved in terms of query handling."
"More ML based algorithms should be added to it, to make it algorithmic-rich for developers."
"When you first start using this solution, it is common to run into memory errors when you are dealing with large amounts of data."
"We use big data manager but we cannot use it as conditional data so whenever we're trying to fetch the data, it takes a bit of time."
"The initial setup was not easy."
"Apache Spark lacks geospatial data."
"If the developers were to enhance or improve the application logic while processing the transactions, that would be great."
"The high price of the product is an area of concern where improvements are required."
"I think that the pricing is high and it needs improvement."
"What needs improvement in SAP HANA is its automation, in particular, it needs more enhancements in that area."
"The releases need to be more stable. It's surprising to still encounter significant bugs after ten years of the product being available."
"The relationship with a partner that sells SAP could be better. We depend much more on our own development, and the partner is for selling us the solutions, so we need them to be able to supply help and answers. The partner isn't very helpful, and we have to rely on our own knowledge and research."
"FI, or the financial module of SAP, has room for improvement. It has to have some better localization for the Middle East, especially in regards to taxes and the letter of credit cycle. I would like to see better localization from the HCM."
"I would like to see improvements in the connectivity of the solution with other BI software. Not every software can connect to it natively."
 

Pricing and Cost Advice

"Since we are using the Apache Spark version, not the data bricks version, it is an Apache license version, the support and resolution of the bug are actually late or delayed. The Apache license is free."
"Apache Spark is an expensive solution."
"It is an open-source platform. We do not pay for its subscription."
"We are using the free version of the solution."
"It is quite expensive. In fact, it accounts for almost 50% of the cost of our entire project."
"I did not pay anything when using the tool on cloud services, but I had to pay on the compute side. The tool is not expensive compared with the benefits it offers. I rate the price as an eight out of ten."
"Considering the product version used in my company, I feel that the tool is not costly since the product is available for free."
"Apache Spark is not too cheap. You have to pay for hardware and Cloudera licenses. Of course, there is a solution with open source without Cloudera."
"The pricing is a bit on the high side."
"SAP HANA is very expensive here in China. My company bought the SAP ERP suite, which includes SAP HANA. For others who use SAP HANA as an analytical database, CPU numbers will affect the pricing, so as a solution, it's costly."
"The price of this product is good."
"The tool is expensive compared to other products, like Oracle R or Microsoft, but it offers a range of all the company's functions."
"The solution's license is very expensive so I rate pricing a one out of ten."
"Price-wise, the product falls on the higher side of the spectrum. There is no need to pay for maintenance and support additionally. Support is available for bug fixes in the product."
"We pay annually for the license of the solution."
"It is expensive, which isn't a problem for us because SAP HANA is processing the data so fast."
report
Use our free recommendation engine to learn which Hadoop solutions are best for your needs.
842,145 professionals have used our research since 2012.
 

Top Industries

By visitors reading reviews
Financial Services Firm
27%
Computer Software Company
13%
Manufacturing Company
8%
Comms Service Provider
5%
Manufacturing Company
15%
Computer Software Company
12%
Financial Services Firm
10%
Government
7%
 

Company Size

By reviewers
Large Enterprise
Midsize Enterprise
Small Business
 

Questions from the Community

What do you like most about Apache Spark?
We use Spark to process data from different data sources.
What is your experience regarding pricing and costs for Apache Spark?
Compared to other solutions like Doc DB, Spark is more costly due to the need for extensive infrastructure. It requires significant investment in infrastructure, which can be expensive. While cloud...
What needs improvement with Apache Spark?
The Spark solution could improve in scheduling tasks and managing dependencies. Spark alone cannot handle sequential tasks, requiring environments like Airflow scheduler or scripts. For instance, o...
What are the biggest benefits of using SAP HANA?
Based on my work with SAP HANA, the biggest benefit that it can bring to your business is total data management. This product is by SAP - a company that serves almost all needs a client may have co...
Is SAP HANA’s customer and technical support reliable?
We have been using SAP HANA for a fairly short period of time and have only taken advantage of their customer support. So far, we have not had issues that required specialized help from technical s...
Is SAP HANA difficult to set up and start using?
SAP HANA is fairly easy to set up, however, I do not think a complete beginner can do it. You certainly need some preparation - either you need to have experience with similar solutions, or with ot...
 

Comparisons

 

Also Known As

No data available
SAP High-Performance Analytic Appliance, HANA
 

Overview

 

Sample Customers

NASA JPL, UC Berkeley AMPLab, Amazon, eBay, Yahoo!, UC Santa Cruz, TripAdvisor, Taboola, Agile Lab, Art.com, Baidu, Alibaba Taobao, EURECOM, Hitachi Solutions
Unilever, NHS 24, adidas Group, CHIO Aachen, Hamburg Port Authority (HPA), Bangkok Airways Public Company Limited
Find out what your peers are saying about Apache Spark vs. SAP HANA and other solutions. Updated: March 2025.
842,145 professionals have used our research since 2012.