Software Alternatives & Reviews

Comparison of Python pipeline packages: Airflow, Luigi, Gokart, Metaflow, Kedro, PipelineX

Apache Airflow Luigi Metaflow Kedro
  1. Airflow is a platform to programmaticaly author, schedule and monitor data pipelines.
    Pricing:
    • Open Source
    Not designed to pass data between dependent tasks without using a database. There is no good way to pass unstructured data (e.g. image, video, pickle, etc.) between dependent tasks in Airflow.

    #Workflows #Workflow Automation #Data Pipelines 65 social mentions

  2. 2
    Luigi is a Python module that helps you build complex pipelines of batch jobs.
    Luigi enables you to define your pipeline by child classes of Task with 3 class methods (requires, output, run) in Python code.

    #DevOps Tools #Workflow Automation #Workflows 9 social mentions

  3. 3
    G

    Gokart

    This product hasn't been added to SaaSHub yet

  4. Framework for real-life data science; build, improve, and operate end-to-end workflows.
    Pricing:
    • Open Source
    Metaflow enables you to define your pipeline as a child class of FlowSpec that includes class methods with step decorators in Python code.

    #Workflow Automation #DevOps Tools #Workflows 12 social mentions

  5. 5
    An open-source framework for data science code
    Pricing:
    • Open Source
    Kedro enables you to define pipelines using list of node functions with 3 arguments (func: task processing function, inputs: input data name (list or dict if multiple), outputs: output data name (list or dict if multiple)) in Python code (an independent Python module).

    #Data Science And Machine Learning #Data Pipelines #Python Web Framework

Discuss: Comparison of Python pipeline packages: Airflow, Luigi, Gokart, Metaflow, Kedro, PipelineX

Log in or Post with