Software Alternatives & Reviews

Apache Airflow

Airflow is a platform to programmaticaly author, schedule and monitor data pipelines.

Apache Airflow Reviews and details

Screenshots and images

  • Apache Airflow Landing page
    Landing page //
    2023-06-17

Badges

Promote Apache Airflow. You can add any of these badges on your website.
SaaSHub badge
Show embed code
SaaSHub badge
Show embed code

Videos

Airflow Tutorial for Beginners - Full Course in 2 Hours 2022

Social recommendations and mentions

We have tracked the following product recommendations or mentions on various public social media platforms and blogs. They can help you see what people think about Apache Airflow and what they use it for.
  • The 2024 Web Hosting Report
    For the third, examples here might be analytics plugins in specialized databases like Clickhouse, data-transformations in places like your ETL pipeline using Airflow or Fivetran, or special integrations in your authentication workflow with Auth0 hooks and rules. - Source: dev.to / 3 months ago
  • Best ETL Tools And Why To Choose
    Apache Airflow is an open-source platform to programmatically author, schedule, and monitor workflows. The platform features a web-based user interface and a command-line interface for managing and triggering workflows. Source: 6 months ago
  • Simplifying Data Transformation in Redshift: An Approach with DBT and Airflow
    Airflow is the most widely used and well-known tool for orchestrating data workflows. It allows for efficient pipeline construction, scheduling, and monitoring. - Source: dev.to / 6 months ago
  • Share Your favorite python related software!
    AIRFLOW This is more of a library in my opinion, but Airflow has become an essential tool for scheduling in my work. All our ML training pipelines are ordered and scheduled with Airflow and it works seamlessly. The dashboard provided is also fantastic! Source: 7 months ago
  • Ask HN: What is the correct way to deal with pipelines?
    I agree there are many options in this space. Two others to consider: - https://airflow.apache.org/ - https://github.com/spotify/luigi There are also many Kubernetes based options out there. For the specific use case you specified, you might even consider a plain old Makefile and incrond if you expect these all to run on a single host and be triggered by a new file... - Source: Hacker News / 8 months ago
  • PyJaws: A Pythonic Way to Define Databricks Jobs and Workflows
    Folks who have used Python-based orchestration tools such as Apache Airflow, Luigi and Mage will be familiar with the concepts and the API if PyJaws. Source: 12 months ago
  • My experience with optimizing machine learning workflow
    There are a range of solutions available to help make this process easier. Some of these options include automation tools such as Apache Airflow and Azure Data Factory, specialized libraries that focus on fine-tuning deep learning models like FinetunerPlus, or machine learning platforms that provide end-to-end solutions like Amazon SageMaker and Google Cloud ML Engine. Source: about 1 year ago
  • Python task scheduler with a web UI
    Looks interesting as a light-weight alternative to https://www.prefect.io/ (which itself is a lighter-weight / more modern alternative to https://airflow.apache.org/ ). Source: about 1 year ago
  • .NET Modern Task Scheduler
    A few years ago, I opened a GitHub issue with Microsoft telling them that I think the .NET ecosystem needs its own equivalent of Apache Airflow or Prefect. Fast forward 'til now, and I still don't think we have anything close to these frameworks. Source: about 1 year ago
  • How do you backup running systems?
    If you have the spare capacity Apache Airflow is great for this. Source: about 1 year ago
  • How do you manage scheduled tasks?
    Its a bit overkill but I use Airflow with local executor. Source: over 1 year ago
  • Twitter Data Pipeline with Apache Airflow + MinIO (S3 compatible Object Storage)
    To learn more about it, I built a Data Pipeline that uses Apache Airflow to pull Elon Musk tweets using the Twitter API and store the result in a CSV stored in a MinIO (OSS alternative to AWS s3) Object Storage bucket. - Source: dev.to / over 1 year ago
  • What's the best and easiest to use GUI-based CI tool? No Jenkins suggestions, please.
    Airflow, that's it https://airflow.apache.org/. Source: over 1 year ago
  • First data lake pipeline advice - multitenancy
    If you are fixed on developing a solution in house, you may have options that don't require many additional tools. Aurora Postgres already supports exporting data to S3. So use an orchestration tool like AWS ECS Scheduled Tasks, Airflow, Prefect, etc to run a script (probably Python). That script can ask for all the distinct tenant ids "SELECT distinct tenant_id FROM...". Then iterate through them and run a query... Source: over 1 year ago
  • ETL tool
    Airflow is really popular, started at Airbnb. Pros: huge community, super mature. Cons: generic workflow orchestration, not the best for handling only data, hard to scale and maintain. Source: over 1 year ago
  • is there a kind of software for describing and searching data about batch jobs (their structure, relationship between them, constraints, date/time details etc)?
    Seems like you want documentation. Look into plantuml or something like that? For reference tools like airflow provide this stuff too. Source: over 1 year ago
  • How to do distributed cronjobs with worker queues?
    Airflow might also be a good option for you. Essentially DAGs of cronjobs. We like it a lot. Source: over 1 year ago
  • Airflow :: Deploy Apache Airflow on Rancher K3s
    $ helm upgrade --install airflow apache-airflow/airflow --namespace airflow --create-namespace Release "airflow" does not exist. Installing it now. NAME: airflow LAST DEPLOYED: Sun Nov 6 02:06:55 2022 NAMESPACE: airflow STATUS: deployed REVISION: 1 TEST SUITE: None NOTES: Thank you for installing Apache Airflow 2.4.1! Your release is named airflow. You can now access your dashboard(s) by executing the following... - Source: dev.to / over 1 year ago
  • Need help finding container automation solution (logic flows)
    Https://airflow.apache.org/ might be worth looking into. Source: over 1 year ago
  • Duct Size vs. Airflow (2012)
    I gotta admit, my first thought was "Duct Size" is a weird name for a distributed work-flow tool[1]. [1] https://airflow.apache.org/. - Source: Hacker News / over 1 year ago
  • Built and automated a complete end-to-end ELT pipeline using AWS, Airflow, dbt, Terraform, Metabase and more as a beginner project!
    Infrastructure provisioning through Terraform, containerized through Docker and orchestrated through Airflow. Created dashboard through Metabase. Source: over 1 year ago

External sources with reviews and comparisons of Apache Airflow

5 Airflow Alternatives for Data Orchestration
While Apache Airflow continues to be a popular tool for data orchestration, the alternatives presented here offer a range of features and benefits that may better suit certain projects or team preferences. Whether you prioritize simplicity, code-centric design, or the integration of machine learning workflows, there is likely an alternative that meets your needs. By exploring these options, teams can find the...
Top 8 Apache Airflow Alternatives in 2024
Apache Airflow is a workflow streamlining solution aiming at accelerating routine procedures. This article provides a detailed description of Apache Airflow as one of the most popular automation solutions. It also presents and compares alternatives to Airflow, their characteristic features, and recommended application areas. Based on that, each business could decide which workflow automation tool could benefit them.
10 Best Airflow Alternatives for 2024
In a nutshell, you gained a basic understanding of Apache Airflow and its powerful features. On the other hand, you understood some of the limitations and disadvantages of Apache Airflow. Hence, this article helped you explore the best Apache Airflow Alternatives available in the market. So, you can try hands-on on these Airflow Alternatives and select the best according to your use case.
A List of The 16 Best ETL Tools And Why To Choose Them
Apache Airflow is an open-source platform to programmatically author, schedule, and monitor workflows. The platform features a web-based user interface and a command-line interface for managing and triggering workflows.
15 Best ETL Tools in 2022 (A Complete Updated List)
Apache Airflow programmatically creates, schedules and monitors workflows. It can also modify the scheduler to run the jobs as and when required.
Python & ETL 2020: A List and Comparison of the Top Python ETL Tools
When does Apache Airflow make sense? If you're performing long ETL jobs or your ETL has multiple steps, Airflow will let you restart from any point during the ETL process. That being said, Apache Airflows IS NOT a library, so it has to be deployed and may make less sense on small ETL jobs.
Comparison of Python pipeline packages: Airflow, Luigi, Gokart, Metaflow, Kedro, PipelineX
Not designed to pass data between dependent tasks without using a database. There is no good way to pass unstructured data (e.g. image, video, pickle, etc.) between dependent tasks in Airflow.

Do you know an article comparing Apache Airflow to other products?
Suggest a link to a post with product alternatives.

Suggest an article

Apache Airflow discussion

Log in or Post with

This is an informative page about Apache Airflow. You can review and discuss the product here. The primary details have not been verified within the last quarter, and they might be outdated. If you think we are missing something, please use the means on this page to comment or suggest changes. All reviews and comments are highly encouranged and appreciated as they help everyone in the community to make an informed choice. Please always be kind and objective when evaluating a product and sharing your opinion.