What is your primary use case for Apache Airflow?

Apache Airflow is an open-source workflow management system (WMS) that is primarily used to programmatically author, orchestrate, schedule, and monitor data pipelines as well as workflows. The solution makes it possible for you to manage your data pipelines by authoring workflows as directed acyclic graphs (DAGs) of tasks. By using Apache Airflow, you can orchestrate data pipelines over object stores and data warehouses, run workflows that are not data-related, and can also create and manage...

Download Apache Airflow Report Read more

Related Q&As

Nov 7, 2021

Which would you choose - Camunda Platform or Apache Airflow?

Mar 22, 2024

What do you like most about Apache Airflow?

Kemal Duman Team Lead, Data Engineering at Nesine.com · Answer 1 · 2024-12-25T10:17:00Z

I am using Apache Airflow to orchestrate my jobs, projects, and batch ETL jobs. It manages tasks, orchestrates jobs, and helps in handling batch processes effectively, incorporating integrations like DBT and Great Expectations.

Jitesh Rathod Sr. Team Lead - IT at InfoStretch · Answer 2 · 2024-12-24T11:09:00Z

We are using Apache Airflow, which has a modular architecture, integrated with our pipeline and with the code generation using Python and the code pipeline. We are running it in Airflow. It is easy for us to use to build the machine learning models, transport data, and manage our infrastructure.

Alvaro De Lucas IT Professional at Freelance · Answer 3 · 2024-09-06T08:36:46Z

We use the product for scheduling and defining workflows. It helps us extensively to manage complex workflows within Cloudera's ecosystem, particularly for handling and processing data.

score 0 · Answer 4 · 2024-06-20T14:04:20Z

We use the tool to schedule data pipelines. We also use Apache Airflow to orchestrate dbt, another data processing tool. Airflow helps manage dbt processes, which, in our case, load data from our data lake.

ManojKumar43 Big Data Engineer at BigTapp Analytics · Answer 5 · 2024-03-22T15:11:00Z

I have used Apache Airflow for various purposes, such as orchestrating Spark jobs, EMR clusters, Glue jobs, and submitting jobs within the DCP data flow on Azure Databricks including ad hoc queries. For instance, if there's a need to execute queries on Redshift or other databases.

UjjwalGupta Module Lead at Mphasis · Answer 6 · 2024-03-09T13:10:34Z

UjjwalGupta

Module Lead at Mphasis

Real User

Top 5

Mar 9, 2024

The main use case is orchestration. We use it to schedule our jobs.

Mikalai Surta Head of Big Data Department at IBA Group · Answer 7 · 2024-02-22T08:57:39Z

Mikalai Surta

Head of Big Data Department at IBA Group

MSP

Top 5

Feb 22, 2024

We use Apache Airflow for the orchestration of data pipelines.

Fr Br Product Owner at La Poste S.A. · Answer 8 · 2024-01-15T09:28:17Z

Our use cases are a bit complex, but primarily for data extraction, transformation, and loading (ETL) tasks.

Punit_Shah Director at Smart Analytica · Answer 9 · 2023-12-22T14:15:00Z

We utilize Apache Airflow for two primary purposes. Firstly, it serves as the tool for ingesting data from the source system application into our data warehouse. Secondly, it plays a crucial role in our ETL pipeline. After extracting data, it facilitates the transformation process and subsequently loads the transformed data into the designated target tables.

Luiz Cesar Gosi Senior Analytics Engineer at TalkDesk · Answer 10 · 2023-10-19T14:21:11Z

Luiz Cesar Gosi

Senior Analytics Engineer at TalkDesk

Real User

Top 5Leaderboard

Oct 19, 2023

We use Apache Airflow for data orchestration.

SabinaZeynalova Data Engineer Team Lead at Unibank · Answer 11 · 2023-09-22T12:41:00Z

We use Apache Airflow for the automation and orchestration of model deployment, training, and feature engineering steps. It is a model lifecycle management tool.

SUDHIR KUMAR RATHLAVATH Student at University of South Florida · Answer 12 · 2023-07-26T20:06:26Z

Apache Airflow is like a freeway. Just as a freeway allows cars to travel quickly and efficiently from one point to another, Apache Airflow allows data engineers to orchestrate their workflows in a similarly efficient way. There are a lot of scheduling tools in the market, but Apache Airflow has taken over everything. With the help of airflow operators, any task required for day-to-day data engineering work becomes possible. It manages the entire lifecycle of data engineering workflows.

score 0 · Answer 13 · 2023-06-28T06:37:50Z

AT

ABHISHEKTIWARI2

Lead of Monitoring Tech at a educational organization with 1,001-5,000 employees

Real User

Top 20

Jun 28, 2023

We use Apache Airflow to send our data to a third-party system.

Ravan Nannapaneni Senior Lead Engineer at Oliver Wyman · Answer 14 · 2023-03-31T09:53:26Z

Apache Airflow is utilized for automating data engineering tasks. When creating a sequence of tasks, Airflow can assist in automating them.

NitinKumar Director of Enginnering at Sigmoid · Answer 15 · 2021-10-04T07:20:31Z

The primary use case is the orchestration and automation of ELT/ETL data pipelines.

Apache Airflow is great in this respect and there are scheduling options to make it fully automated based on the used case.

Fadi Bathish Project Manager at Siren Analytics · Answer 16 · 2023-02-20T16:14:00Z

Fadi Bathish

Project Manager at Siren Analytics

Real User

Feb 20, 2023

We use this solution to monitor BD tasks.

score 0 · Answer 17 · 2022-12-01T16:03:48Z

Our primary use case for the solution is setting up workflows and processes applied everywhere because most industries are based on workflows and processes. We've deployed it for all kinds of workflows within the organization.

score 0 · Answer 18 · 2022-08-31T14:14:49Z

Our primary use case for this solution is scheduling task rates. We capture the data from the SQL Server location and migrate it to the central data warehouse.

Nomena NY HOAVY Lead Data Scientist at MVola · Answer 19 · 2022-06-20T12:19:00Z

Currently, I am a lead data scientist. Our primary use cases for Apache Airflow are for all orchestrations, from the basic big data lake to machine learning predictions. It is used for all the MLS processes. It is also used for some ELT, to transform, load, and export all big data from restricted, unrestricted, and all phase processes.

score 0 · Answer 20 · 2021-10-05T14:49:32Z

I've used it at past companies to build a data warehouse for analytics (populating redshift/snowflake).

My current company is using it for similar purposes but more to pull data from data sources across the company, join them into a central data repository (we're currently using postgres), and build datasets with this data. Having this central data repository will help serve other use cases for us in the future.

score 0 · Answer 21 · 2021-03-26T23:33:18Z

I'm a data engineer. In the past, I used Airflow for building data pipelines and to populate data warehouses. With my current company, it's a data product or datasets that we sell to biopharma companies. We are using those pipelines to generate those datasets.

score 0 · Answer 22 · 2021-02-11T12:31:46Z

There are a few use cases we have for Apache Airflow, one being government projects where we perform data operations on a monthly basis. For example, we'll collect data from various agencies, harmonize the data, and then produce a dashboard. In general, it's a BI use case, but focusing on social economy. We concentrate mainly on BI, and because my team members have strong technical backgrounds we often fall back to using open source tools like Airflow and our own coded solutions. For a single project, we will typically have three of us working on Airflow at a time. This includes two data engineers and a system administrator. Our infrastructure model is hybrid, based both in the cloud and on-premises.

score 0 · Answer 23 · 2021-01-15T22:07:12Z

We mainly used the solution in banking, finance, and insurance. We are looking for some opportunities in production companies, but this is only at the very early stages.

score 0 · Answer 24 · 2020-12-23T23:10:23Z

AJ

Ashok Juyal

Associate Director - Technologies at a tech services company with 51-200 employees

Real User

Dec 23, 2020

Our primary use case is to integrate with SLAs.

score 0 · Answer 25 · 2020-12-22T20:05:08Z

We normally use the solution for creating a specific flow for data transformation. We have several pipelines that we use and due to the fact that they're pretty well-defined, we use it in conjunction with other tools that do the mediation portion. With Airflow, we do the processing of such data.

Sudhir Ganti Engineering Manager - OTT Platform at Amagi · Answer 26 · 2020-04-13T06:27:00Z

We are a technology, media, and entertainment-technology company. We are using Apache Airflow for architecting our media workflows. We are using it for two major workflows. We have had it set up for some time on our own cloud. Recently, we migrated the setup to AWS.

Alexey Nadenenko Solution Architect at EPAM Systems · Answer 27 · 2019-09-02T05:33:00Z

AN

Alexey Nadenenko

Solution Architect at EPAM Systems

Real User

Top 5Leaderboard

Sep 2, 2019

The primary use case for this solution is to automate ETL process for datawarehouse.