Apache Airflow Reviews

Name: Apache Airflow
Brand: Apache
Rating: 4.0 (38 reviews)

4.0 out of 5

38 reviews
97% willing to recommend

721 followers

What is Apache Airflow?

Apache Airflow is an open-source workflow management system (WMS) that is primarily used to programmatically author, orchestrate, schedule, and monitor data pipelines as well as workflows. The solution makes it possible for you to manage your data pipelines by authoring workflows as directed acyclic graphs (DAGs) of tasks. By using Apache Airflow, you can orchestrate data pipelines over object stores and data warehouses, run workflows that are not data-related, and can also create and manage scripted data pipelines as code (Python).

Get the Apache Airflow Buyer's Guide and find out what your peers are saying about Apache Airflow, Informatica Intelligent Data Management Cloud (IDMC), Camunda and more!

Apache Airflow is the #2 ranked solution in BPM Software. PeerSpot users give Apache Airflow an average rating of 8.0 out of 10. Apache Airflow is most commonly compared to Informatica Intelligent Data Management Cloud (IDMC): Apache Airflow vs Informatica Intelligent Data Management Cloud (IDMC). Apache Airflow is popular among the large enterprise segment, accounting for 72% of users researching this solution on PeerSpot. The top industry researching this solution are professionals from a financial services firm, accounting for 26% of all views.

Helped 842,767 peers since 2012

Featured Apache Airflow reviews

Kemal Duman

Team Lead, Data Engineering at Nesine.com

I am using Apache Airflow to orchestrate my jobs, projects, and batch ETL jobs. It manages tasks, orchestrates jobs, and helps in handling batch processes effectively, incorporating integrations like DBT and Great Expectations Apache Airflow is easy to scale and its UI improves with each release.…

Read full review

SUDHIR KUMAR RATHLAVATH

Student at University of South Florida

One improvement could be the inclusion of a plugin with a drag-and-drop feature. This graphical feature would be beneficial when dealing with connectivity and integration services like connecting to BigQuery or other systems. As a first-time user, although the documentation is available, it would be more user-friendly to have a drag-and-drop interface within the portal. Users could simply drag and drop components to create a pseudo-code, making it more flexible and intuitive. Therefore, I suggest having a drag-and-drop feature for a more user-friendly experience and better code management. Moreover, for admins, there should be improved logging capabilities because Apache Airflow does have logging, but it's limited to some database data. It would be better if everything goes into the server where it's hosted. Probably on the interface level. If something goes well for the developers.

Read full review

ManojKumar43

Big Data Engineer at BigTapp Analytics

Apache Airflow is easy to use and can monitor task execution easily. For instance, when performing setup tasks, you can conveniently view the logs without delving into the job details. All logs are readily accessible within the interface itself. Examining the logs lets you discern which steps and processes are being executed. You don't have to configure SMTP for everything. You need to configure email settings, such as email on error, failure, or alert access. With Apache Airflow, you can send emails with just a few lines of code. You don't have to write extensive code to configure SMTP; all those configurations can be accomplished within a few lines of code. I managed a complex workflow for a finance application project. They use Apache Airflow to orchestrate processes, such as retrieving data from SFTP and landing it into S3. From S3, they trigger Glue jobs based on certain conditions. Additionally, they use the Glue catalog in Glusoft for data management, all orchestrated using Airflow. Furthermore, various logics are written in Airflow DAGs to handle scenarios like security mismatches. For instance, files are sent accordingly if there's a missing security. Apache Airflow triggers a set of tasks based on DAGs. If you have multiple tags, such as raw, transform, and ready layers, instead of manually triggering each DAGs. In that case, you can integrate them to trigger one, automatically triggering the others. Also, you can put conditions.

Read full review

Apache Airflow mindshare

As of April 2025, the mindshare of Apache Airflow in the Business Process Management (BPM) category stands at 6.8%, down from 9.5% compared to the previous year, according to calculations based on PeerSpot user engagement data.

Business Process Management (BPM)

PeerAnalyst reports based on Apache Airflow reviews

Type	Title	Date
Category	Business Process Management (BPM)	Apr 1, 2025	Download
Product	Reviews, tips, and advice from real users	Apr 1, 2025	Download
Comparison	Apache Airflow vs Camunda	Apr 1, 2025	Download
Comparison	Apache Airflow vs Automation Anywhere	Apr 1, 2025	Download
Comparison	Apache Airflow vs Pega Platform	Apr 1, 2025	Download

Title	Rating	Mindshare	Recommending
Informatica Intelligent Data Management Cloud (IDMC)	3.9	1.2%	93%	182 interviews Add to research
Camunda	4.1	21.3%	89%	77 interviews Add to research

Valuable Features

Apache Airflow's most valuable features include its user-friendly interface, Python integration, and versatility in creating and managing workflows. It offers a graphical representation of pipeline status, facilitates monitoring, and supports multiple programming languages. The tool is praised for scalability, integration capabilities with various services, ease of use and automation, and extensive community support. Its flexibility allows customization and efficient orchestration of data workflows with comprehensive functionalities in automation and scheduling tasks.

"I definitely recommend Apache Airflow and would rate it nine out of ten."
"I would rate Apache Airflow from one to ten points overall as ten points, which means it is a very good solution."
"Apache Airflow is easy to scale and its UI improves with each release."

Room for Improvement

Apache Airflow lacks support for cyclic workflows and real-time job execution. Its UI and graphical capabilities require enhancement. Users find maintenance and integration complex, and versioning support is limited. Documentation and technical support need improvement. Scalability, cloud integration, and state management present challenges. Users suggest better platform compatibility, logging, and security features. There's a desire for visual scripting tools and more built-in connectors. Pricing and performance stability are also concerns.

"Running frequent jobs, such as every minute or five minutes, is not appropriate for Airflow. It is not suitable for real-time ETL tasks."
"There is a minor issue with the manual work in Airflow, as everyday activities are managed manually. There is no dashboard for us to check all the Directed Acyclic Graphs (DAGs); a dashboard would help us analyze the work better."
"It is not suitable for real-time ETL tasks."

ROI

Users note Apache Airflow provides positive returns mainly due to its open-source nature and versatility. It efficiently facilitates tasks like data extraction, table summarization, and report generation. Many companies utilize its Python integration to enhance efficiency and ease of use, noting significant returns on investment. While some are still assessing precise data, most agree on its worthiness and timely pipeline creation abilities, ensuring good returns even with minimal initial investment details.

Pricing

Apache Airflow is open-source and free, with no licensing costs, providing a cost-effective option for enterprises. While users can deploy it locally or on cloud platforms like AWS, additional expenses may arise from infrastructure or managed services like Astronomer or Bitnami, which offer commercial support and extra features. For enterprises evaluating Apache Airflow, it remains a budget-friendly choice with optional costs for enhanced services and infrastructure deployment.

"Apache Airflow is open-source and free. Hyperscalers like Google (with Composer), Azure, and AWS offer managed Airflow services."
"I use the tool's open-source version."
"It is an open-source tool. There are no additional fees or charges associated with the product."

Popular Use Cases

Companies use Apache Airflow primarily for automating and orchestrating ETL processes, data pipelines, and workflow tasks in various industries, including technology, finance, and media. It assists in data transformation, scheduling, and model management, integrating with tools like DBT and Fivetran. Organizations deploy it in cloud and on-premises environments, supporting integration with SLAs, big data lakes, and machine learning, enabling efficient task automation and orchestration for diverse business workflows and data operations.

Service and Support

Apache Airflow offers open-source documentation and a strong community for support, often resolving issues without direct company assistance. Some rely on forums and resources like Stack Overflow, while others utilize internal support teams. Companies generally find technical assistance satisfactory, but some rate it highly. Many never contact the technical team, citing comprehensive guides and online resources as sufficient for problem-solving. Commercial support is available but often unnecessary.

Deployment

Apache Airflow's initial setup varies in complexity. Some find it intuitive and swift, especially with cloud setups via Google Cloud or AWS. Others encounter challenges with on-premises installations, requiring technical expertise. Documentation is praised, though additional case studies would be beneficial. Creative workarounds are sometimes needed. Experience with configurations is crucial, and while some rate the process highly, complexities may arise with cloud integrations or customized deployments. Many emphasize the importance of prior knowledge.

Scalability

Apache Airflow is known for its scalability capabilities, especially when configured as a cluster or deployed with Kubernetes. Users appreciate its adaptability for diverse requirements, with many successfully managing thousands of data pipelines. However, some encounter challenges with cloud-native integration, especially during high data loads. Scalability ranks generally high, though certain limitations arise in complex scenarios. Users utilize it across varied setups, valuing its flexibility despite occasional scaling complexities.

Stability

Users report mixed experiences with Apache Airflow's stability. Many find it stable, giving ratings between seven and ten out of ten. Some encounter bugs, glitches, and performance issues, particularly during heavy task loads or complex configurations. Community support is essential for troubleshooting due to open-source limitations. Performance is seen as satisfactory, especially when deployed on-premises or with Kubernetes. Examples of stability challenges include issues with large-scale operations and setup difficulties.

These insights are based on the in-depth reviews provided by peers to help you make a better buying decision.

Download our Apache Airflow Buyer's Guide for additional reliable information.

Review data by company size

By reviewers

By visitors reading reviews

Top industries

By visitors reading reviews

Financial Services Firm

26%

Computer Software Company

14%

Manufacturing Company

Retailer

Healthcare Company

Educational Organization

Insurance Company

Comms Service Provider

Government

University

Energy/Utilities Company

Media Company

Real Estate/Law Firm

Non Profit

Hospitality Company

Construction Company

Recreational Facilities/Services Company

Pharma/Biotech Company

Legal Firm

Wholesaler/Distributor

Consumer Goods Company

Logistics Company

Transportation Company

Outsourcing Company

Performing Arts

Aerospace/Defense Firm

Compare Apache Airflow with alternative products

Learn more about Apache Airflow

Apache Airflow Features

Apache Airflow has many valuable key features. Some of the most useful ones include:

Smart sensor: In Apache Airflow, tasks are executed sequentially. The smart sensors are executed in bundles, and therefore consume fewer resources.

Dockerfile: By using Apache Airflow’s dockerfile feature, you can run your business’s Airflow code without having to document and automate the process of running Airflow on a server.

Scalability: Because Apache Airflow has a modular architecture and uses a message queue to orchestrate an arbitrary number of workers, you can easily scale it.

Plug-and-play operators: With Apache Airflow, you can choose from several plug-and-play operators that are ready to execute your tasks on many third-party services.

Apache Airflow Benefits

There are many benefits to implementing Apache Airflow. Some of the biggest advantages the solution offers include:

User friendly: Using Apache Airflow requires minimal python knowledge to get started.

Intuitive user interface: The Apache Airflow user interface enables you to visualize pipelines running in production, monitor progress, and also troubleshoot issues when needed.

Easy integration: Apache Airflow can easily be integrated with cloud platforms (Google, AWS, Azure, etc).

Visual DAGs: Apache Airflow’s visual DAGs provide data lineage, which facilitates debugging of data flows and also aids in auditing and data governance.

Flexibility: Apache Airflow provides you with several ways to make DAG objects more flexible. At runtime, a context variable is passed to each workflow execution, which is quickly incorporated into an SQL statement that includes the run ID, execution date, and last and next run times.

Multiple deployment options: With Apache Airflow, you have several options for deployment, including self-service, open source, or a managed service.

Several data source connections: Apache Airflow can connect to a variety of data sources, including APIs, databases, data warehouses, and more.

Reviews from Real Users

Below are some reviews and helpful feedback written by PeerSpot users currently using the Apache Airflow solution.

A Senior Solutions Architect/Software Architect says, “The product integrates well with other pipelines and solutions. The ease of building different processes is very valuable to us. The difference between Kafka and Airflow, is that it's better for dealing with the specific flows that we want to do some transformation. It's very easy to create flows.”

An Assistant Manager at a comms service provider mentions, “The best part of Airflow is its direct support for Python, especially because Python is so important for data science, engineering, and design. This makes the programmatic aspect of our work easy for us, and it means we can automate a lot.”

A Senior Software Engineer at a pharma/biotech company comments that he likes Apache Airflow because it is “Feature rich, open-source, and good for building data pipelines.”

Apache Airflow was previously known as Airflow.

Apache Airflow customers

Agari, WePay, Astronomer

Product Categories

Business Process Management (BPM)

Popular Comparisons

Informatica Intelligent Data Management Cloud (IDMC) vs Apache Airflow

Camunda vs Apache Airflow

Appian vs Apache Airflow

Pega Platform vs Apache Airflow

SAP Signavio Process Manager vs Apache Airflow

Bizagi vs Apache Airflow

IBM BPM vs Apache Airflow

ARIS BPA vs Apache Airflow

Bonita vs Apache Airflow

Nintex Process Platform vs Apache Airflow

AWS Step Functions vs Apache Airflow

IBM Business Automation Workflow vs Apache Airflow

Oracle BPM vs Apache Airflow

KiSSFLOW vs Apache Airflow

Newgen OmniFlow vs Apache Airflow

See all alternatives

Apache Airflow reviews

Sort by:

Jitesh Rathod

Sr. Team Lead - IT at InfoStretch

Verified user of Apache Airflow

Dec 30, 2024

Efficiently builds machine learning models and manages infrastructure

Pros

"Apache Airflow is easy for us to use to build machine learning models, transport data, and manage our infrastructure. "

Cons

"There is a minor issue with the manual work in Airflow, as everyday activities are managed manually."

What is our primary use case?

We are using Apache Airflow, which has a modular architecture, integrated with our pipeline and with the code generation using Python and the code pipeline. We are running it in Airflow. It is easy for us to use to build the machine learning models, transport data, and manage our infrastructure.

What is most valuable?

Apache Airflow is easy for us to use to build machine learning models, transport data, and manage our infrastructure. It generates data using the pipeline, and we use Terraform code to run the airflow engine. The solution is easy for us to install and maintain and is very scalable. Apache Airflow is an open-source platform that allows easy integration with AWS, Azure, and Google Cloud Platform.

*Disclosure: I am a real user, and this review is based on my own experience and opinions.

Read full review (402 words)

Kemal Duman

Team Lead, Data Engineering at Nesine.com

Verified user of Apache Airflow

Dec 30, 2024

Product version discussed: 2.0.9

Enables efficient orchestration of batch processes with very good reliability

Pros

"Apache Airflow is easy to scale and its UI improves with each release."

Cons

"It is not suitable for real-time ETL tasks."

What is our primary use case?

What is most valuable?

Apache Airflow is easy to scale and its UI improves with each release. Reliability is good, and when integrated with Kubernetes, it performs better compared to on-premises environments. It also facilitates the construction of data pipelines by various teams, including those with limited technical knowledge.

*Disclosure: I am a real user, and this review is based on my own experience and opinions.

Read full review (343 words)

Buyer's Guide

Apache Airflow

Download free report

Find out what your peers are saying about Apache Airflow. Updated March 2025

842,767 professionals have used our research since 2012.

SUDHIR KUMAR RATHLAVATH

Student at University of South Florida

Verified user of Apache Airflow

Jul 31, 2023

Product version discussed: 2.4.0

Enable seamless integration with various connectivity and integrated services, including BigQuery and Python operators

Pros

"Every feature in Apache Airflow is valuable. The number of operators and features I've used are mainly related to connectivity services and integrated services because I primarily work with GCP. "

Cons

" For admins, there should be improved logging capabilities because Apache Airflow does have logging, but it's limited to some database data. "

What is our primary use case?

Apache Airflow is like a freeway. Just as a freeway allows cars to travel quickly and efficiently from one point to another, Apache Airflow allows data engineers to orchestrate their workflows in a similarly efficient way.

There are a lot of scheduling tools in the market, but Apache Airflow has taken over everything. With the help of airflow operators, any task required for day-to-day data engineering work becomes possible. It manages the entire lifecycle of data engineering workflows.

How has it helped my organization?

So, for example, let's say you want to connect to multiple sources, extract data, run your pipeline, and trigger your pipeline through other integrated services. You also want to do this at a specific time.

You have a number of operators that can help you with this. For example, you can use the External Sensor operator to take the dependency of workflows. This means that you can wait for one workflow to complete before triggering another workflow. There are also good operators like the Python operator and the Bash operator. These operators allow you to run your scripts without having to change your code. This is great for traditional projects that are already running on batch.

So, let's say you have a scheduled alert called Informatica. You can use Airflow to trigger your VTech scripts through the Informatica jobs. This way, you don't need to use Informatica as a scheduling engine. That's a good way to decouple your data pipelines from Informatica. Airflow is a powerful tool that can help you to automate your workflows and improve your efficiency.

*Disclosure: I am a real user, and this review is based on my own experience and opinions.

Read full review (948 words)

ManojKumar43

Big Data Engineer at BigTapp Analytics

Verified user of Apache Airflow

Apr 10, 2024

A solution for orchestrating EMR clusters with plug-and-play UI

Pros

"Apache Airflow is easy to use and can monitor task execution easily. For instance, when performing setup tasks, you can conveniently view the logs without delving into the job details. "

Cons

"Airflow should support the dynamic drag creation."

What is our primary use case?

I have used Apache Airflow for various purposes, such as orchestrating Spark jobs, EMR clusters, Glue jobs, and submitting jobs within the DCP data flow on Azure Databricks including ad hoc queries. For instance, if there's a need to execute queries on Redshift or other databases.

How has it helped my organization?

If you are working with APIs or databases, you must write SQL queries and formulate the right statements to retrieve everything. But with the UI, it's more like plug-and-play. You go there, select the task you want to see, like logs, and click on it. It will promptly display the details of the logs, automatically showing the returned logs. However, if you're accessing logs manually from the web server, you must write commands and perform additional tasks. These overheads can be efficiently managed using the UI.

*Disclosure: I am a real user, and this review is based on my own experience and opinions.

Read full review (696 words)

Damian Bukowski

Program Python at Santander Bank Polska

Verified user of Apache Airflow

Oct 9, 2023

A license-free tool that is not just easy to learn but also easy to use

Pros

"Apache Airflow is in Python language, making it easy to use and learn."

Cons

"I want to see Apache Airflow have more integrations with more production-based databases since it is an area where the product lacks currently."

What is our primary use case?

In my company, we use Apache Airflow as an orchestrator because we have a lot of business use cases that involve the automation of people's jobs. For example, if someone takes a file and then moves the file from one folder to another, and we have a lot of scripts to do this in PL/SQL or bash pipelines, we decide to move all of this to be orchestrated through one hub application. Instead of having a few things on the database from Oracle while a few things run on local machines, in our company, we wanted this all to be orchestrated through one thing, which is why we chose Apache Airflow.

What is most valuable?

I like that Apache Airflow is in Python language, making it easy to use and learn. I like Apache Airflow's versatility. Essentially, if you want to do something, there is generally a webhook that you can use with Apache AirFlow, especially if you use solutions from big companies like Google or Microsoft. Many providers are not from Apache since, with Apache Airflow, it is very easy to develop and integrate applications from various developers.

*Disclosure: I am a real user, and this review is based on my own experience and opinions.

Read full review (930 words)

Prathamesh D Marathe

Senior Software Engineer at Annalect India

Verified user of Apache Airflow

Jun 10, 2024

Easy to use and implement its functionalities

Pros

"Apache Airflow can be integrated or used to run multiple files."

Cons

"The in-built package dependencies in Python have some issues in Apache Airflow, making it an area that needs improvement."

What is most valuable?

Apache Airflow is an open-source tool. Apache Airflow can be integrated or used to run multiple files. The tool is helpful in the orchestration process, and it also allows our company to provide some notifications to teams via email. The main valuable feature of the tool is that it is an open-source product and that it can be integrated with any cloud environment.

What needs improvement?

I have not come across any challenges associated with the product.

The scripts that we use in our company refer to the package dependencies in Python, but those are lost when Apache Airflow starts running for a particular test.

The in-built package dependencies in Python have some issues in Apache Airflow, making it an area that needs improvement.

*Disclosure: I am a real user, and this review is based on my own experience and opinions.

Read full review (577 words)

Miodrag Milojevic

Senior Data Archirect at Yettel

Verified user of Apache Airflow

Mar 25, 2024

Streamlines complex data workflows with its user-friendly interface, robust scheduling, and monitoring capabilities, offering scalability and efficient orchestration of diverse sources

Pros

"Its user-friendly interface makes it straightforward to operate, offering a plethora of features for data preparation, buffering, and format conversion. "

Cons

"It would be beneficial to improve the pricing structure."

What is our primary use case?

It serves as a versatile tool for data ingestion, enabling various tasks including data transformation from one type or format to another. It facilitates seamless preparation and processing of data, supporting diverse operations such as format conversion, type transformation, and other related functions.

How has it helped my organization?

We leverage Apache Airflow to orchestrate our data pipelines, primarily due to the multitude of data sources we manage. These sources vary in nature, with some delivering streaming data, while others follow different protocols such as FTP or utilize landing areas. We utilize Airflow for orchestrating tasks such as data ingestion, transformation, and preparation, ensuring that the data is formatted appropriately for further processing. Typically, this involves tasks like normalization, enrichment, and structuring the data for consumption by tools like Spark or other similar platforms in our ecosystem.

The scheduling and monitoring functionalities enhance our data processing workflows. While the interface could be more user-friendly, proficiency in scheduling and monitoring can be attained through practice and skill development.

The scalability of Apache Airflow effectively accommodates our increasing data processing demands without issue. While occasional server problems may arise, particularly in this aspect, overall, the product remains reliably stable.

It offers a straightforward means to orchestrate numerous data sources efficiently, thanks to its user-friendly interface. Learning to use it is relatively quick and straightforward, although some experimentation, practice, and training may be required to master certain aspects.

*Disclosure: I am a real user, and this review is based on my own experience and opinions.

Read full review (884 words)

Fr Br

Product Owner at La Poste S.A.

Verified user of Apache Airflow

Jan 15, 2024

Well-documented with plenty of resources available online but improvement needed in automation capabilities

Pros

"We're running it on a virtual server, which we can easily upgrade if needed. "

Cons

"The automation capabilities could be improved; a visual workflow designer and a graphical tool to reduce coding would be very helpful. But for now, it's sufficient for our simple workflows."

What is our primary use case?

Our use cases are a bit complex, but primarily for data extraction, transformation, and loading (ETL) tasks.

What is most valuable?

It's well-documented and has plenty of resources online, making it easy to get started. On the other side, there aren’t many possibilities to draw processes that have more options for visually designing workflows and automatically generating processes, as some dedicated ETL tools offer.

*Disclosure: I am a real user, and this review is based on my own experience and opinions.

Read full review (383 words)

Apache Airflow Reviews

What is Apache Airflow?

Featured Apache Airflow reviews

Apache Airflow mindshare

PeerAnalyst reports based on Apache Airflow reviews

Valuable Features

Room for Improvement

ROI

Pricing

Popular Use Cases

Service and Support

Deployment

Scalability

Stability

Review data by company size

Top industries

Compare Apache Airflow with alternative products

Learn more about Apache Airflow

Apache Airflow customers

Related questions

Product Categories

Popular Comparisons

Apache Airflow reviews

Pros

Cons

What is our primary use case?

What is most valuable?

Pros

Cons

What is our primary use case?

What is most valuable?

Pros

Cons

What is our primary use case?

How has it helped my organization?

Pros

Cons

What is our primary use case?

How has it helped my organization?

Pros

Cons

What is our primary use case?

What is most valuable?

Pros

Cons

What is most valuable?

What needs improvement?

Pros

Cons

What is our primary use case?

How has it helped my organization?

Pros

Cons

What is our primary use case?

What is most valuable?