Try our new research platform with insights from 80,000+ expert users

Apache Spark vs Zadara comparison

 

Comparison Buyer's Guide

Executive Summary

Review summaries and opinions

We asked business professionals to review the solutions they use. Here are some excerpts of what they said:
 

Categories and Ranking

Apache Spark
Ranking in Compute Service
4th
Average Rating
8.4
Reviews Sentiment
7.7
Number of Reviews
65
Ranking in other categories
Hadoop (1st), Java Frameworks (2nd)
Zadara
Ranking in Compute Service
10th
Average Rating
8.6
Reviews Sentiment
7.6
Number of Reviews
12
Ranking in other categories
All-Flash Storage (33rd), Software Defined Storage (SDS) (16th), Public Cloud Storage Services (15th), File and Object Storage (22nd)
 

Mindshare comparison

As of April 2025, in the Compute Service category, the mindshare of Apache Spark is 11.2%, up from 9.7% compared to the previous year. The mindshare of Zadara is 0.7%, up from 0.2% compared to the previous year. It is calculated based on PeerSpot user engagement data.
Compute Service
 

Featured Reviews

Ilya Afanasyev - PeerSpot reviewer
Reliable, able to expand, and handle large amounts of data well
We use batch processing. It works well with our formats and file versions. There's a lot of functionality. In our pipeline each hour, we make a copy of data from MongoDB, of the changes from MongoDB to some specific file. Each time pipeline copied all of the data, it would do it each time without changes to all of the tables. Tables have a lot of data, and in the last MongoDB version, there is a possibility to read only changed data. This reduced the cost and configuration of the cluster, and we saved about $150,000. The solution is scalable. It's a stable product.
Kirubel Behailu - PeerSpot reviewer
Enhancing storage management efficiency with user-friendly experience
Our customers are using Zadara for their research and development environments. We provide infrastructure for government projects, but we are often not fully aware of their specific usage.  I typically use it for our infrastructure and offer both Zadara and Microsoft Azure to our customers Zadara…

Quotes from Members

We asked business professionals to review the solutions they use. Here are some excerpts of what they said:
 

Pros

"AI libraries are the most valuable. They provide extensibility and usability. Spark has a lot of connectors, which is a very important and useful feature for AI. You need to connect a lot of points for AI, and you have to get data from those systems. Connectors are very wide in Spark. With a Spark cluster, you can get fast results, especially for AI."
"The main feature that we find valuable is that it is very fast."
"I found the solution stable. We haven't had any problems with it."
"Spark is used for transformations from large volumes of data, and it is usefully distributed."
"It is highly scalable, allowing you to efficiently work with extensive datasets that might be problematic to handle using traditional tools that are memory-constrained."
"The most significant advantage of Spark 3.0 is its support for DataFrame UDF Pandas UDF features."
"The most valuable feature of Apache Spark is its memory processing because it processes data over RAM rather than disk, which is much more efficient and fast."
"The product's deployment phase is easy."
"Zadara saves both time and money."
"It provides very satisfactory storage performance."
"One of the most valuable features is its integration with other cloud solutions. We have a presence within Amazon EC2 and we leverage compute instances in there. Being able to integrate with compute, both locally within Zadara, as well as with other cloud vendors such as Amazon, is very helpful, while also being able to maintain extremely low latency between those connections."
"A nice feature is the immutable object storage, which can be used in conjunction with Veeam."
"The processing is much faster with this product."
"It's very easy to expand and compared to other storage systems that we've used, it's a lot more expandable and a lot more flexible in how it's deployed."
"Zadara is a fully-fledged platform, and our customers are happy with its use."
"The most valuable feature is the flexibility in terms of deployment options."
 

Cons

"Include more machine learning algorithms and the ability to handle streaming of data versus micro batch processing."
"Spark could be improved by adding support for other open-source storage layers than Delta Lake."
"At the initial stage, the product provides no container logs to check the activity."
"It's not easy to install."
"It requires overcoming a significant learning curve due to its robust and feature-rich nature."
"The Spark solution could improve in scheduling tasks and managing dependencies."
"The initial setup was not easy."
"If you have a Spark session in the background, sometimes it's very hard to kill these sessions because of D allocation."
"There are still some storage features that they lack. For example, other vendors implemented the auto-tiering feature a long time ago, while Zadara Storage Cloud is just coming out with this feature today. So, they are a little bit late compared to the market."
"The management interface is more geared towards end-users rather than a service partner like ourselves, and there are improvements that can be made around that."
"I would like to see them be a little bit more proactive in terms of the patches and updates that are available. I would like to see more disclosure and information around what fixes or what enhancements are available within a patch, and help in coordinating and scheduling that. Right now, it's driven more by the customer in reaching out via a support ticket."
"The cost is not favorable as Zadara does not provide competitive rates for regions like South Africa and Ethiopia."
"The range of support of VMware could be better. It can support Windows, however, it cannot support other operating systems like IBM AIX. This needs to improve."
"Currently, when we do firmware upgrades, it sometimes causes issues and is not as nondisruptive as desired."
"Having iSCSI over the internet using a VPN, the IPSec tunnel is really the only thing that I find missing from this product."
"The initial installation was difficult because many steps required the command line interface (CLI). Maintenance can also be complicated, especially when deeper troubleshooting requires navigating the CLI and searching for logs."
 

Pricing and Cost Advice

"Apache Spark is not too cheap. You have to pay for hardware and Cloudera licenses. Of course, there is a solution with open source without Cloudera."
"They provide an open-source license for the on-premise version."
"Since we are using the Apache Spark version, not the data bricks version, it is an Apache license version, the support and resolution of the bug are actually late or delayed. The Apache license is free."
"Licensing costs can vary. For instance, when purchasing a virtual machine, you're asked if you want to take advantage of the hybrid benefit or if you prefer the license costs to be included upfront by the cloud service provider, such as Azure. If you choose the hybrid benefit, it indicates you already possess a license for the operating system and wish to avoid additional charges for that specific VM in Azure. This approach allows for a reduction in licensing costs, charging only for the service and associated resources."
"I did not pay anything when using the tool on cloud services, but I had to pay on the compute side. The tool is not expensive compared with the benefits it offers. I rate the price as an eight out of ten."
"On the cloud model can be expensive as it requires substantial resources for implementation, covering on-premises hardware, memory, and licensing."
"We are using the free version of the solution."
"It is quite expensive. In fact, it accounts for almost 50% of the cost of our entire project."
"The price of Zadara is very good and it covers everything. There is no subscription needed."
"The pricing and licensing are very simple and the cost is predictable, although, like everything that you pay for as you use, you have to be mindful of what you're using."
"It is a nice licensing model and it makes it quite simple because we just pay for what we use, and the bill that comes shows us exactly what customers are using what resources."
"The pricing is very competitive and the fact that they have very compelling discounts for multi-year commitments is great."
"If you just take the street price of Zadara Storage Cloud and look up the price or cost per hour, then you could think that Zadara Storage Cloud is extremely expensive or a solution only for enterprise use. That is not true. You need to compare the entire system. This means that you don't stop looking at just the street price, but you need to consider all the features, requirements, and costs of support as well as the extra cost that other vendors have. Other players just play with hidden, additional costs. Everything is included in Zadara Storage Cloud's licensing cost; what you get is what you pay for."
"One of the factors that ruled out several providers was cost. They were way too expensive for the volume of data that we needed and the speed at which we needed to be able to manage it. There aren't a lot of providers that can do that."
"For our use, it's appropriately priced and overall, it's proved to be very cost-effective against other tier-one vendors."
report
Use our free recommendation engine to learn which Compute Service solutions are best for your needs.
845,406 professionals have used our research since 2012.
 

Top Industries

By visitors reading reviews
Financial Services Firm
28%
Computer Software Company
13%
Manufacturing Company
8%
Comms Service Provider
5%
Computer Software Company
24%
Manufacturing Company
11%
Financial Services Firm
10%
Retailer
6%
 

Company Size

By reviewers
Large Enterprise
Midsize Enterprise
Small Business
 

Questions from the Community

What do you like most about Apache Spark?
We use Spark to process data from different data sources.
What is your experience regarding pricing and costs for Apache Spark?
Compared to other solutions like Doc DB, Spark is more costly due to the need for extensive infrastructure. It requires significant investment in infrastructure, which can be expensive. While cloud...
What needs improvement with Apache Spark?
The Spark solution could improve in scheduling tasks and managing dependencies. Spark alone cannot handle sequential tasks, requiring environments like Airflow scheduler or scripts. For instance, o...
What needs improvement with Zadara?
There is room for improvement in pricing as it is currently quite expensive. Adding AI capabilities could enhance the offering as well.
What is your primary use case for Zadara?
Our customers are using Zadara for their research and development environments. We provide infrastructure for government projects, but we are often not fully aware of their specific usage. I typica...
What advice do you have for others considering Zadara?
If companies have the budget, I recommend Zadara for small business classes due to its robust performance and user-friendliness. I'd rate the solution eight out of ten.
 

Comparisons

 

Overview

 

Sample Customers

NASA JPL, UC Berkeley AMPLab, Amazon, eBay, Yahoo!, UC Santa Cruz, TripAdvisor, Taboola, Agile Lab, Art.com, Baidu, Alibaba Taobao, EURECOM, Hitachi Solutions
Time, Inc. A&E Network, The Washington Post, News UK, McGraw Hill, Gilt, Toshiba, Deloitte, VMware
Find out what your peers are saying about Apache Spark vs. Zadara and other solutions. Updated: March 2025.
845,406 professionals have used our research since 2012.