Software Alternatives & Reviews

Twitter Data Pipeline with Apache Airflow + MinIO (S3 compatible Object Storage)

Pandas Minio Apache Airflow
  1. 1
    Pandas is an open source library providing high-performance, easy-to-use data structures and data analysis tools for the Python.
    Pricing:
    • Open Source
    Below is the python Task that transforms the tweets list into a Pandas dataframe, then dumps it in our MinIO Object Storage as a CSV file:.

    #Data Science And Machine Learning #Data Science Tools #Python Tools 198 social mentions

  2. 2
    Minio is an open-source minimal cloud storage server.
    To learn more about it, I built a Data Pipeline that uses Apache Airflow to pull Elon Musk tweets using the Twitter API and store the result in a CSV stored in a MinIO (OSS alternative to AWS s3) Object Storage bucket.

    #Cloud Storage #Cloud Computing #Object Storage 154 social mentions

  3. Airflow is a platform to programmaticaly author, schedule and monitor data pipelines.
    Pricing:
    • Open Source
    To learn more about it, I built a Data Pipeline that uses Apache Airflow to pull Elon Musk tweets using the Twitter API and store the result in a CSV stored in a MinIO (OSS alternative to AWS s3) Object Storage bucket.

    #Workflows #Workflow Automation #Data Pipelines 65 social mentions

Discuss: Twitter Data Pipeline with Apache Airflow + MinIO (S3 compatible Object Storage)

Log in or Post with