What do you like most about Apache Spark?

Spark provides programmers with an application programming interface centered on a data structure called the resilient distributed dataset (RDD), a read-only multiset of data items distributed over a cluster of machines, that is maintained in a fault-tolerant way. It was developed in response to limitations in the MapReduce cluster computing paradigm, which forces a particular linear dataflowstructure on distributed programs: MapReduce programs read input data from disk, map a function...

Download Apache Spark Report Read more

Related Q&As

Aug 28, 2023

Which solution has better performance: Spring Boot or Apache Spark?

Apr 19, 2020

Which is the best RDMBS solution for big data?

Suriya Senthilkumar Analyst at Deloitte · Answer 1 · 2024-02-26T16:01:50Z

The product’s most valuable features are lazy evaluation and workload distribution.

Hamid M. Hamid Data architect at Banking Sector · Answer 2 · 2024-02-05T09:17:45Z

Hamid M. Hamid

Data architect at Banking Sector

Real User

Top 5Leaderboard

Feb 5, 2024

The deployment of the product is easy.

Suresh_Srinivasan Co-Founder at FORMCEPT Technologies · Answer 3 · 2024-01-31T11:05:00Z

Suresh_Srinivasan

Co-Founder at FORMCEPT Technologies

Real User

Top 10

Jan 31, 2024

We use Spark to process data from different data sources.

Sachin Shukre Sr Manager at a transportation company with 10,001+ employees · Answer 4 · 2023-12-06T10:45:56Z

SS

Sachin Shukre

Sr Manager at a transportation company with 10,001+ employees

Real User

Top 5Leaderboard

Dec 6, 2023

We use it for ETL purposes as well as for implementing the full transformation pipelines.

score 0 · Answer 5 · 2023-11-10T13:04:33Z

The most crucial feature for us is the streaming capability. It serves as a fundamental aspect that allows us to exert control over our operations.

Jagannadha Rao Lead Data Scientist at International School of Engineering · Answer 6 · 2023-10-20T07:41:27Z

Jagannadha Rao

Lead Data Scientist at International School of Engineering

Real User

Top 10

Oct 20, 2023

The most valuable feature of Apache Spark is its flexibility.

Farzam Khodaei Data Engineer at Berief Food GmbH · Answer 7 · 2023-07-26T09:09:50Z

FK

Farzam Khodaei

Data Engineer at Berief Food GmbH

Real User

Top 5

Jul 26, 2023

The data processing framework is good.

score 0 · Answer 8 · 2023-07-06T10:55:23Z

reviewer2208003

Quantitative Developer at a marketing services firm with 11-50 employees

Real User

Top 20

Jul 6, 2023

The distribution of tasks, like the seamless map-reduce functionality, is quite impressive.

score 0 · Answer 9 · 2023-02-13T20:14:00Z

Armando Becerril

Partner / Head of Data & Analytics at Intelligence Software Consulting

Real User

Top 10

Feb 13, 2023

Provides a lot of good documentation compared to other solutions.

Ilya Afanasyev Senior Software Development Engineer at Yahoo! · Answer 10 · 2022-08-03T04:09:48Z

Ilya Afanasyev

Senior Software Development Engineer at Yahoo!

Real User

Aug 3, 2022

There's a lot of functionality.

score 0 · Answer 11 · 2022-07-04T15:18:53Z

reviewer1904019

Chief Technology Officer at a tech services company with 11-50 employees

Real User

Jul 4, 2022

The most valuable feature of Apache Spark is its ease of use.

AmitMataghare Associate Director at PricewaterhouseCoopers · Answer 12 · 2022-04-27T08:19:23Z

One of Apache Spark's most valuable features is that it supports in-memory processing, the execution of jobs compared to traditional tools is very fast.

Salvatore Campana CEO & Founder at Xautomata · Answer 13 · 2022-04-27T08:19:19Z

Spark helps us reduce startup time for our customers and gives a very high ROI in the medium term.

Onur Tokat Big Data Engineer Consultant at Collective[i] · Answer 14 · 2022-02-15T16:44:00Z

Onur Tokat

Big Data Engineer Consultant at Collective[i]

Consultant

Feb 15, 2022

Spark can handle small to huge data and is suitable for any size of company.

Suresh_Srinivasan Co-Founder at FORMCEPT Technologies · Answer 15 · 2021-12-28T09:52:00Z

Suresh_Srinivasan

Co-Founder at FORMCEPT Technologies

Real User

Top 10

Dec 28, 2021

Apache Spark can do large volume interactive data analysis.

Oscar Estorach Chief Data-strategist and Director at Theworkshop.es · Answer 16 · 2021-08-18T14:51:07Z

Oscar Estorach

Chief Data-strategist and Director at Theworkshop.es

Real User

Top 10

Aug 18, 2021

The solution has been very stable.

reviewer1535340 Senior Solutions Architect at a retailer with 10,001+ employees · Answer 17 · 2021-03-27T15:39:24Z

I like that it can handle multiple tasks parallelly. I also like the automation feature. JavaScript also helps with the parallel streaming of the library.

NitinKumar Director of Enginnering at Sigmoid · Answer 18 · 2021-02-01T12:04:16Z

Its scalability and speed are very valuable. You can scale it a lot. It is a great technology for big data. It is definitely better than a lot of earlier warehouse or pipeline solutions, such as Informatica.

Spark SQL is very compliant with normal SQL that we have been using over the years. This makes it easy to code in Spark. It is just like using normal SQL. You can use the APIs of Spark or you can directly write SQL code and run it. This is something that I feel is useful in Spark.

Kürşat Kurt Software Architect at Akbank · Answer 19 · 2020-10-28T02:27:29Z

AI libraries are the most valuable. They provide extensibility and usability. Spark has a lot of connectors, which is a very important and useful feature for AI. You need to connect a lot of points for AI, and you have to get data from those systems. Connectors are very wide in Spark. With a Spark cluster, you can get fast results, especially for AI.

Rajendran Veerappan Director at Nihil Solutions · Answer 20 · 2020-07-23T07:58:35Z

The memory processing engine is the solution's most valuable aspect. It processes everything extremely fast, and it's in the cluster itself. It acts as a memory engine and is very effective in processing data correctly.

Gopi Krishnan Works at Ideas2IT Technologies · Answer 21 · 2020-06-10T05:27:31Z

I love every core functionality of Apache Spark Initially they have only provided RDD basic interface to process the data across distributed cluster. Then it evolved to dataframe and dataset interface with optimised execution engine and more flexibility for developers to perform querying on the data.

score 0 · Answer 22 · 2020-02-02T10:42:14Z

KK

KamleshKhollam

Managing Consultant at a computer software company with 501-1,000 employees

Real User

Feb 2, 2020

The processing time is very much improved over the data warehouse solution that we were using.

Suresh_Srinivasan Co-Founder at FORMCEPT Technologies · Answer 23 · 2020-01-29T11:22:00Z

The features we find most valuable are the machine learning, data learning, and Spark Analytics.

it_user1223676 Lead Consultant at a tech services company with 51-200 employees · Answer 24 · 2020-01-29T11:22:00Z

it_user1223676

Lead Consultant at a tech services company with 51-200 employees

Consultant

Jan 29, 2020

The main feature that we find valuable is that it is very fast.

score 0 · Answer 25 · 2019-12-23T07:05:00Z

reviewer879201

Technical Consultant at a tech services company with 1-10 employees

Consultant

Dec 23, 2019

I feel the streaming is its best feature.

Mohamed Ghorbel Director of BigData Offer at IVIDATA · Answer 26 · 2019-12-09T10:58:00Z

MG

Mohamed Ghorbel

Director of BigData Offer at IVIDATA

Real User

Dec 9, 2019

The solution is very stable.

score 0 · Answer 27 · 2019-10-13T05:48:00Z

reviewer1046250

Senior Consultant & Training at a tech services company with 51-200 employees

Consultant

Oct 13, 2019

The most valuable feature of this solution is its capacity for processing large amounts of data.

Snrsecengin567 Snr Security Engineer at Securonix Solutions · Answer 28 · 2019-07-14T10:21:00Z

LC

Snrsecengin567

Snr Security Engineer at Securonix Solutions

Real User

Jul 14, 2019

The scalability has been the most valuable aspect of the solution.

score 0 · Answer 29 · 2019-07-10T12:01:00Z

it_user946074

Principal Architect at a financial services firm with 1,001-5,000 employees

Real User

Jul 10, 2019

I found the solution stable. We haven't had any problems with it.

reviewer894894 Works at a computer software company with 51-200 employees · Answer 30 · 2018-06-27T19:19:00Z

reviewer894894

Works at a computer software company with 51-200 employees

User

Jun 27, 2018

Features include machine learning, real time streaming, and data processing.