Software Alternatives, Accelerators & Startups

Google Cloud Dataflow

Google Cloud Dataflow is a fully-managed cloud service and programming model for batch and streaming big data processing.

Top 12 Open-Source Alternatives to Google Cloud Dataflow

Google Cloud Dataflow
Google BigQuery Databricks Qubole Confluent Apache Beam Apache Spark Apache Flink Apache Storm Presto DB Lenses.io

Summary

The top open-source alternatives to Google Cloud Dataflow are Google BigQuery, Databricks, and Qubole. One of the criteria for ordering this list is the number of mentions that products have on reliable external sources. You can suggest additional sources through the form here.
  1. A fully managed data warehouse for large-scale data analytics.
    Pricing:
    • Open Source

    #Data Management #Data Warehousing #Data Dashboard 35 social mentions

  2. Databricks provides a Unified Analytics Platform that accelerates innovation by unifying data science, engineering and business.‎What is Apache Spark?
    Pricing:
    • Open Source

    #Data Science #Data Dashboard #Database Tools 17 social mentions

  3. 3
    Qubole delivers a self-service platform for big aata analytics built on Amazon, Microsoft and Google Clouds.
    Pricing:
    • Open Source

    #Data Dashboard #Data Warehousing #Big Data

  4. Confluent offers a real-time data platform built around Apache Kafka.
    Pricing:
    • Open Source

    #Stream Processing #Data Management #Big Data 1 social mentions

  5. Apache Beam provides an advanced unified programming model to implement batch and streaming data processing jobs.
    Pricing:
    • Open Source

    #Big Data #Data Dashboard #Data Warehousing 14 social mentions

  6. Apache Spark is an engine for big data processing, with built-in modules for streaming, SQL, machine learning and graph processing.
    Pricing:
    • Open Source

    #Databases #Big Data #Big Data Analytics 56 social mentions

  7. Apache Storm is a free and open source distributed realtime computation system.
    Pricing:
    • Open Source

    #Big Data #Data Management #Data Warehousing 11 social mentions

  8. Distributed SQL Query Engine for Big Data (by Facebook)
    Pricing:
    • Open Source

    #Database Tools #Data Dashboard #Big Data Analytics 6 social mentions

  9. Lenses delivers DataOps for any Apache Kafka. With Lenses, engineers are more productive when building streaming applications on Kafka.
    Pricing:
    • Open Source
    • Freemium
    • Free Trial
    • $49.0 / Monthly

    #Data Visualization #Big Data #Data Processing 6 social mentions

  10. 11
    Impala is a modern, open source, distributed SQL query engine for Apache Hadoop.
    Pricing:
    • Open Source

    #Big Data #Big Data Infrastructure #Databases

  11. A Streaming Database for Real-Time Applications
    Pricing:
    • Open Source

    #Database Tools #Databases #Relational Databases 65 social mentions

Suggest an alternative
If you think we've missed something, please suggest an alternative to Google Cloud Dataflow.
Please use the Feedback button if you think any of the listed products shouldn't be regarded as open-source.

Google Cloud Dataflow discussion

Log in or Post with