Based on our record, Metaflow should be more popular than Azkaban. It has been mentiond 14 times since March 2021. We are tracking product recommendations and mentions on various public social media platforms and blogs. They can help you identify which product is more popular and what people think of it.
Not sure if https://azkaban.github.io/ would fit your use case. Source: about 3 years ago
I used this once, was pretty nice: https://azkaban.github.io/. Source: about 3 years ago
Apache Azkaban is a batch workflow job scheduler to help developers run Hadoop jobs. The open-sourced platform “resolves ordering through job dependencies” and offers an intuitive web interface to help users maintain and track workflows. Source: about 3 years ago
Metaflow is an open source framework developed at Netflix for building and managing ML, AI, and data science projects. This tool addresses the issue of deploying large data science applications in production by allowing developers to build workflows using their Python API, explore with notebooks, test, and quickly scale out to the cloud. ML experiments and workflows can also be tracked and stored on the platform. - Source: dev.to / 7 months ago
As a data scientist/ML practitioner, how would you feel if you can independently iterate on your data science projects without ever worrying about operational overheads like deployment or containerization? Let’s find out by walking you through a sample project that helps you do so! We’ll combine Python, AWS, Metaflow and BentoML into a template/scaffolding project with sample code to train, serve, and deploy ML... - Source: dev.to / 10 months ago
I would recommend the following: - https://www.mage.ai/ - https://dagster.io/ - https://www.prefect.io/ - https://metaflow.org/ - https://zenml.io/home. Source: about 2 years ago
1) I've been looking into [Metaflow](https://metaflow.org/), which connects nicely to AWS, does a lot of heavy lifting for you, including scheduling. Source: about 2 years ago
Even for people who don't have an ML background there's now a lot of very fully-featured model deployment environments that allow self-hosting (kubeflow has a good self-hosting option, as do mlflow and metaflow), handle most of the complicated stuff involved in just deploying an individual model, and work pretty well off the shelf. Source: over 2 years ago
Apache Airflow - Airflow is a platform to programmaticaly author, schedule and monitor data pipelines.
RunDeck - RunDeck is an open source automation service with a web console, command line tools and a WebAPI.
Luigi - Luigi is a Python module that helps you build complex pipelines of batch jobs.
DepHell - :package: :fire: Python project management. Manage packages: convert between formats, lock, install, resolve, isolate, test, build graph, show outdated, audit. Manage venvs, build package, bump ver...
Jenkins - Jenkins is an open-source continuous integration server with 300+ plugins to support all kinds of software development
StackStorm - StackStorm is a powerful open-source automation platform that wires together all of your apps...