What is your primary use case for Apache Hadoop?

The Apache Hadoop project develops open-source software for reliable, scalable, distributed computing. The Apache Hadoop software library is a framework that allows for the distributed processing of large data sets across clusters of computers using simple programming models. It is designed to scale up from single servers to thousands of machines, each offering local computation and storage. Rather than rely on hardware to deliver high-availability, the library itself is designed to detect...

Download Apache Hadoop Report Read more

Related Q&As

Sep 7, 2023

Which solution do you prefer: Azure Data Factory or Apache Hadoop?

Jul 4, 2023

Which solution is better for setting up a data lake: Apache Hadoop or Oracle Exadata?

score 0 · Answer 1 · 2025-03-27T16:00:37Z

We had some Apache Hadoop ( /products/apache-hadoop-reviews ), but very little. It was an on-premises deployment of Apache Hadoop ( /products/apache-hadoop-reviews ), which we are trying to get away from. It was holding it just fine. We were running all the data that we now have in ADLS was in Apache Hadoop.

Satya Raju Archtect - software engineering at Innominds · Answer 2 · 2024-10-15T11:44:00Z

I use Hadoop as a data lake in an AIML solution, where it connects to various data sources and ingests data into Hadoop. It is utilized for processing large data volumes with various data sources such as RDBMS, file systems, Kafka for real-time streaming data, IoT, web sockets, and API metadata.

Kenechukwu Murphy Ezeoka IT Support Specialist at Convergys Corporation · Answer 3 · 2024-09-04T08:35:24Z

We used the product primarily for data analysis and storage. It helps handle large data sets, performing tasks like filtering, sorting, and joining. The platform is useful for data warehousing and provides distributed coordination and synchronization functionalities.

Cheikh Mbengue Database Administrator at Lacoste · Answer 4 · 2024-07-01T11:23:00Z

Our use case is for a customer who wants to migrate their data warehouse to Hadoop. It's a request from a customer in Senegal who wants to migrate their Oracle data warehouse to Hadoop. I'm trying to migrate it to Hive or HBase. They're choosing between upgrading Oracle or moving to Cloudera Hadoop. They seem to prefer Cloudera. The current data warehouse runs on Oracle DB, but we have to migrate the analytics process to Hadoop.

Sushil Arya Software developer at Fiserv · Answer 5 · 2024-06-25T04:40:43Z

I use the solution in my company since it makes the analytical processing easy. It takes data into one cluster and then processes it. While working on any GPU whatever the analytics are, and what I get as insight from the data, I can say that processing is very fast.

Teodor Muraru Developer at Emag · Answer 6 · 2024-05-20T07:33:00Z

Teodor Muraru

Developer at Emag

Real User

Top 5

May 20, 2024

The solution helps to store and retrieve information.

Akhilesh Chipre Senior Assosiate Consultant at Applied Materials · Answer 7 · 2024-04-11T06:44:03Z

AC

Akhilesh Chipre

Senior Assosiate Consultant at Applied Materials

Real User

Top 5

Apr 11, 2024

We use it to store data. Our team then takes this data to create reports on top of that.

Anand Viswanath Project Manager at Unimity Solutions · Answer 8 · 2024-03-21T12:55:43Z

I use the solution in my company for security purposes. In my company, we have intranet portals that we need to ensure are not accessible by outsiders. All the data that are within the internal applications is only accessible with valid credentials within the domain. In general, my company uses Apache Hadoop to secure our internal applications.

Syed Afroz Pasha Head Of Data Governance at Alibaba Group · Answer 9 · 2024-02-27T11:44:49Z

We use the Hadoop File System. We usually keep the data for our tables or big data on it. Hadoop has a query engine called Hive. We write SQL queries, and the tool usually processes in a parallel environment and gets us the data on Hive.

score 0 · Answer 10 · 2023-12-29T12:05:06Z

reviewer2324613

Data Architect at a computer software company with 51-200 employees

Real User

Top 5

Dec 29, 2023

We work on Apache Hadoop for various customers.

Miodrag Milojevic Senior Data Archirect at Yettel · Answer 11 · 2023-08-01T14:27:00Z

I have been using the latest version of Apache Hadoop. It is a file system for data collection. There are nodes in this cluster that contain all the information, directories, and other files. The nodes are based on the MySQL database.

Yevgen Manzhulyanov CEO at AM-BITS LLC · Answer 12 · 2023-08-01T12:17:28Z

This solution is used for a variety of purposes, including managing enterprise data hubs, monitoring network quality, implementing an AntiFraud system, and establishing a conveyor system.

Aria Amini Data Engineer at Behsazan Mellat · Answer 13 · 2023-07-26T11:57:00Z

We use the Apache Hadoop environment for use cases involving big data engineering. We have many applications, such as collecting, transforming, loading, and storing lag event data for big organizations.

score 0 · Answer 14 · 2022-09-29T11:28:03Z

reviewer1976262

Credit & Fraud Risk Analyst at a financial services firm with 10,001+ employees

Real User

Sep 29, 2022

We use Apache Hadoop for analytics purposes.

Yogesh Thakkar Business data analyst at RBSG Internet operations · Answer 15 · 2022-09-05T12:51:45Z

We use the solution as a data link for our customer payment and SaaS information. We get data from various sources and then utilize and leverage that data.

reviewer1040328 IT Expert at a tech services company with 1,001-5,000 employees · Answer 16 · 2022-07-21T16:29:00Z

reviewer1040328

IT Expert at a tech services company with 1,001-5,000 employees

Real User

Jul 21, 2022

We used Apache Hadoop mainly for ETL and data analysis.

Juliet Hoimonthi Manager at Robi Axiata Limited · Answer 17 · 2022-07-05T06:27:00Z

I'm from the data governance team, and this is how my team uses Apache Hadoop: there's a GUI called Apache Atlas, then there's an option called the "business glossary". My team uses the business glossary from Apache Atlas and also uses Apache Ranger. Apache Ranger is another GUI where you can check who is using which data source through the Apache Hadoop platform. My team also uses the Apache Hadoop platform for AI-related use cases and relevant data, so the data required from any kind of AI use case, that data is processed with ETL, specifically with the Talend tool. My team then loads the data in Apache Hadoop, uses that data by making some clusters, and uses the data for AI/ML cases.

reviewer901065 Partner at a tech services company with 11-50 employees · Answer 18 · 2021-10-05T18:57:00Z

There are several use cases for Hadoop. Sometimes it's used for data warehousing. Other times, it's analytics. And In some cases, it's used to do transformation. For example, I have one client using it to decompress, compress, or encrypt data on ingestion. So, he used it like an ETL engine.

reviewer1464630 Founder & CTO at a tech services company with 1-10 employees · Answer 19 · 2020-12-08T22:10:56Z

We mainly use Apache Hadoop for real-time streaming. Real-time streaming and integration using Spark streaming and the ecosystem of Spark technologies inside Hadoop.

score 0 · Answer 20 · 2020-07-14T08:15:56Z

As an example of a use case, when I was a contractor for Cisco, we were processing mobile network data and the volume was too big. RDBMS was not supporting anything. We started using the Hadoop framework to improve the process and get the results faster.

Abhik Ray Co-Founder at Quantic · Answer 21 · 2020-02-07T02:52:00Z

Abhik Ray

Co-Founder at Quantic

Real User

Feb 7, 2020

The primary use is as a data lake.

it_user1093134 Technical Architect at RBSG Internet Operations · Answer 22 · 2019-12-16T08:14:00Z

We are primarily dumping all the prior payment transaction data into a loop system and then we use some of the plug and play analytics tools to translate it.

Yevgen Manzhulyanov CEO at AM-BITS LLC · Answer 23 · 2019-11-27T05:42:00Z

We primarily use the solution for the enterprise data hub and big data warehouse extension.

Lucas Dreyer Data Engineer at BBD · Answer 24 · 2019-09-29T07:27:00Z

The primary use case of this solution is data engineering and data files. The deployment model we are using is private, on-premises.

reviewer1040328 IT Expert at a tech services company with 1,001-5,000 employees · Answer 25 · 2019-07-28T07:35:00Z

reviewer1040328

IT Expert at a tech services company with 1,001-5,000 employees

Real User

Jul 28, 2019

We primarily use this product to integrate legacy systems.

MahalingamShanmugam Works · Answer 26 · 2019-07-16T01:59:00Z

MS

MahalingamShanmugam

Works

Real User

Jul 16, 2019

We use this solution for our Enterprise Data Lake.

score 0 · Answer 27 · 2018-08-14T07:42:00Z

SF

Samuel Feinberg

Analytics Platform Manager at a consultancy with 10,001+ employees

Real User

Aug 14, 2018

We use it as a data lake for streaming analytical dashboards.