Software Alternatives, Accelerators & Startups

Apache Spark

Apache Spark is an engine for big data processing, with built-in modules for streaming, SQL, machine learning and graph processing.

Top 12 Open-Source Alternatives to Apache Spark

Apache Spark
Apache Flink Hadoop Apache Hive Apache Storm ClickHouse PostgreSQL Apache Druid Google BigQuery Redis Apache Beam

Summary

The top open-source alternatives to Apache Spark are Apache Flink, Hadoop, and Apache Hive. One of the criteria for ordering this list is the number of mentions that products have on reliable external sources. You can suggest additional sources through the form here.
  1. 2
    Open-source software for reliable, scalable, distributed computing
    Pricing:
    • Open Source

    #Big Data #Databases #NoSQL Databases 23 social mentions

  2. Apache Hive data warehouse software facilitates querying and managing large datasets residing in distributed storage.
    Pricing:
    • Open Source

    #Big Data #Databases #Relational Databases 8 social mentions

  3. Apache Storm is a free and open source distributed realtime computation system.
    Pricing:
    • Open Source

    #Data Dashboard #Big Data #Stream Processing 11 social mentions

  4. ClickHouse is an open-source column-oriented database management system that allows generating analytical data reports in real time.
    Pricing:
    • Open Source

    #Databases #Relational Databases #Data Warehousing 55 social mentions

  5. PostgreSQL is a powerful, open source object-relational database system.
    Pricing:
    • Open Source

    #Databases #NoSQL Databases #Relational Databases 16 social mentions

  6. Fast column-oriented distributed data store
    Pricing:
    • Open Source

    #Big Data #Databases #Data Analysis 10 social mentions

  7. A fully managed data warehouse for large-scale data analytics.
    Pricing:
    • Open Source

    #Data Dashboard #Big Data #Data Management 42 social mentions

  8. 9
    Redis is an open source in-memory data structure project implementing a distributed, in-memory key-value database with optional durability.
    Pricing:
    • Open Source

    #Databases #Graph Databases #NoSQL Databases 216 social mentions

  9. Apache Beam provides an advanced unified programming model to implement batch and streaming data processing jobs.
    Pricing:
    • Open Source

    #Data Dashboard #Big Data #Big Data Tools 15 social mentions

  10. Distributed SQL Query Engine for Big Data (by Facebook)
    Pricing:
    • Open Source

    #Data Dashboard #Big Data #Database Tools 10 social mentions

  11. Greenplum Database is an open source parallel data warehousing platform.
    Pricing:
    • Open Source

    #Big Data #Databases #Relational Databases 4 social mentions

Suggest an alternative
If you think we've missed something, please suggest an alternative to Apache Spark.
Please use the Feedback button if you think any of the listed products shouldn't be regarded as open-source.

Apache Spark discussion

Log in or Post with