Apache Spark without Hadoop -- Is this recommended?

Question

Apache Spark without Hadoop -- Is this recommended?

Hi community,

I'm aware that we can use Apache Spark with/without Hadoop.

But I am sure that the majority of people are using Apache Spark with Hadoop, and I read one article that states how using Apache Spark without Hadoop is not good for deployment, and can be usable for the development environment.

Is that true?

I'd greatly appreciate if anyone can elaborate on this.

Thanks.

Padmanesh NC

Big Data Solution Architect - Spatial Data Specialist at SCIERA, INC

4
196

Buyer's Guide

Software Configuration Management

March 2025

Get the category report

Helped 847,772 peers since 2012

3 Answers

Last answered Sep 3, 2021

Product comparison that may be of interest to you

Apache Hadoop

40 Reviews

Apache Spark

65 Reviews

Buyer's Guide

Software Configuration Management

March 2025

Download Free Report

Find out what your peers are saying about Broadcom, BMC, Microsoft and others in Software Configuration Management. Updated: March 2025.

DOWNLOAD NOW

847,772 professionals have used our research since 2012.

Software Configuration Management

Software Configuration Management ensures that software systems are maintained in a consistent state throughout their lifecycle, improving efficiency and reducing errors in software development. It addresses the challenges of software development projects by automating and providing a systematic approach to version control and deployment. Key features include version tracking, automated builds, and integrated testing, which streamline complex software maintenance tasks. What...

Download Software Configuration Management Report Read more

Related categories

Version Control

Defect Tracking

Mainframe Application Development

Related Q&As

Sep 7, 2023

Which solution do you prefer: Azure Data Factory or Apache Hadoop?

Aug 28, 2023

Which solution has better performance: Spring Boot or Apache Spark?

Software Configuration Management experts

Omar_Ismail

ECM, Archives and Digital Preservation Consultant at DataServe

Yong Seok Kang

Technical Consultant at MTRiver Consulting

Anand Viswanath

Project Manager at Unimity Solutions

Mustapha Sedki Ben Romdhane

Sourcing Purchasing Manager at Nexans autoelectric GmbH

Efrén Yanez

Security Manager & CM Specialist & Mainframe Specialist en eSoft at eSoft 2006

Suvajit Chakraborty

Manager, Digital Engineering at Harman International

Sneha Banks

ALM tools engineer at a computer software company with 201-500 employees

SB

ShaileshBhor

Founder at Druansh

Join the PeerSpot community

NitinKumar Director of Enginnering at Sigmoid · Answer 1 · 2021-09-03T05:02:26Z

I don't think using Apache Spark without Hadoop has any major drawbacks or issues. I have used Apache Spark quite successfully with AWS S3 on many projects which are batch based. Yes for very high performance system HDFS is a better option.

The main problem with Apache Spark with object storage like S3 has been the consistency problem of these object storage systems. You can read this post which will help you understand the issue and how to avoid it. Hope this helps you.

https://arnon.me/2015/08/spark...

score 0 · Answer 2 · 2017-01-04T13:58:25Z

I mean we can configure Spark without Hadoop as well like using WinUtils.exe . Is that recommended for Deployment ? Or would like to understand difference between Spark Hadoop Environment and Spark Without Hadoop?

it_user326337 Customer Success Manager at PeerSpot · Answer 3 · 2016-12-06T14:25:42Z

Can you elaborate on the information you've been told about how using Apache Spark without Hadoop isn't good for deployment?

This insight would help many of our users.