Software Alternatives & Reviews

Apache Spark

Apache Spark is an engine for big data processing, with built-in modules for streaming, SQL, machine learning and graph processing.

Top 12 Open-Source Alternatives to Apache Spark

Apache Flink Hadoop Apache Storm Apache Hive Databricks Apache Druid Presto DB Apache Beam Redis Google BigQuery

Summary

The top open-source alternatives to Apache Spark are Apache Flink, Hadoop, and Apache Storm. One of the criteria for ordering this list is the number of mentions that products have on reliable external sources. You can suggest additional sources through the form here.
  1. 2
    Open-source software for reliable, scalable, distributed computing
    Pricing:
    • Open Source

    #Databases #NoSQL Databases #Big Data 15 social mentions

  2. Apache Storm is a free and open source distributed realtime computation system.
    Pricing:
    • Open Source

    #Big Data #Data Management #Databases 11 social mentions

  3. Apache Hive data warehouse software facilitates querying and managing large datasets residing in distributed storage.
    Pricing:
    • Open Source

    #Databases #Big Data #Data Warehousing 8 social mentions

  4. Databricks provides a Unified Analytics Platform that accelerates innovation by unifying data science, engineering and business.‎What is Apache Spark?
    Pricing:
    • Open Source

    #Data Science #Data Dashboard #Database Tools 17 social mentions

  5. Fast column-oriented distributed data store
    Pricing:
    • Open Source

    #Databases #Big Data #Data Analysis 9 social mentions

  6. Distributed SQL Query Engine for Big Data (by Facebook)
    Pricing:
    • Open Source

    #Database Tools #Data Dashboard #Big Data Analytics 6 social mentions

  7. Apache Beam provides an advanced unified programming model to implement batch and streaming data processing jobs.
    Pricing:
    • Open Source

    #Big Data #Data Dashboard #Data Warehousing 14 social mentions

  8. 9
    Redis is an open source in-memory data structure project implementing a distributed, in-memory key-value database with optional durability.
    Pricing:
    • Open Source

    #Key-Value Database #NoSQL Databases #Databases 183 social mentions

  9. A fully managed data warehouse for large-scale data analytics.
    Pricing:
    • Open Source

    #Data Management #Data Warehousing #Data Dashboard 35 social mentions

  10. Confluent offers a real-time data platform built around Apache Kafka.
    Pricing:
    • Open Source

    #Stream Processing #Big Data #Data Management 1 social mentions

  11. ClickHouse is an open-source column-oriented database management system that allows generating analytical data reports in real time.
    Pricing:
    • Open Source

    #Databases #Relational Databases #Data Warehousing 43 social mentions

Suggest an alternative
If you think we've missed something, please suggest an alternative to Apache Spark.
Please use the Feedback button if you think any of the listed products shouldn't be regarded as open-source.

Apache Spark discussion

Log in or Post with