Software Alternatives, Accelerators & Startups

PySpark

PySpark Tutorial - Apache Spark is written in Scala programming language. To support Python with Spark, Apache Spark community released a tool, PySpark. Using PySpark, you can wor.

Top 8 Open-Source Alternatives to PySpark

PySpark
Dask Apache Spark Hadoop Apache Airflow Grist Apache Beam Apache Storm Google Cloud Datastore

Summary

The top open-source alternatives to PySpark are Dask, Apache Spark, and Hadoop. One of the criteria for ordering this list is the number of mentions that products have on reliable external sources. You can suggest additional sources through the form here.
  1. 1
    Dask natively scales Python Dask provides advanced parallelism for analytics, enabling performance at scale for the tools you love
    Pricing:
    • Open Source

    #Big Data #Databases #Workflow Automation 16 social mentions

  2. Apache Spark is an engine for big data processing, with built-in modules for streaming, SQL, machine learning and graph processing.
    Pricing:
    • Open Source

    #Big Data #Databases #Big Data Infrastructure 72 social mentions

  3. 3
    Open-source software for reliable, scalable, distributed computing
    Pricing:
    • Open Source

    #Big Data #Databases #NoSQL Databases 26 social mentions

  4. Airflow is a platform to programmaticaly author, schedule and monitor data pipelines.
    Pricing:
    • Open Source

    #Automation #Workflow Automation #ETL 79 social mentions

  5. 5
    Grist makes it easy to transform spreadsheets into a custom database where data is truly actionable.
    Pricing:
    • Open Source

    #Open Source #Spreadsheets #Databases 9 social mentions

  6. Apache Beam provides an advanced unified programming modelย to implement batch and streaming data processing jobs.
    Pricing:
    • Open Source

    #Data Dashboard #Big Data #Big Data Tools 15 social mentions

  7. Apache Storm is a free and open source distributed realtime computation system.
    Pricing:
    • Open Source

    #Data Dashboard #Big Data #Data Management 11 social mentions

  8. Cloud Datastore is a NoSQL database for your web and mobile applications.
    Pricing:
    • Open Source

    #Databases #NoSQL Databases #Relational Databases 7 social mentions

Suggest an alternative
If you think we've missed something, please suggest an alternative to PySpark.
Please use the Feedback button if you think any of the listed products shouldn't be regarded as open-source.

PySpark discussion

Log in or Post with