Apache Spark
Apache Spark is an engine for big data processing, with built-in modules for streaming, SQL, machine learning and graph processing.
- Open Source
Apache Spark Alternatives
The best Apache Spark alternatives based on verified products, community votes, reviews and other factors.
Latest update:
-
/apache-flink-alternatives
Flink is a streaming dataflow engine that provides data distribution, communication, and fault tolerance for distributed computations.
-
/airflow-alternatives
Airflow is a platform to programmaticaly author, schedule and monitor data pipelines.
-
Visit website
Seamless project management and collaboration for your team.
-
/hadoop-alternatives
Open-source software for reliable, scalable, distributed computing
-
/apache-kafka-alternatives
Apache Kafka is an open-source message broker project developed by the Apache Software Foundation written in Scala.
-
/apache-storm-alternatives
Apache Storm is a free and open source distributed realtime computation system.
-
/google-cloud-dataflow-alternatives
Google Cloud Dataflow is a fully-managed cloud service and programming model for batch and streaming big data processing.
-
/databricks-alternatives
Databricks provides a Unified Analytics Platform that accelerates innovation by unifying data science, engineering and business.What is Apache Spark?
-
/apache-hive-alternatives
Apache Hive data warehouse software facilitates querying and managing large datasets residing in distributed storage.
-
/amazon-emr-alternatives
Amazon Elastic MapReduce is a web service that makes it easy to quickly process vast amounts of data.
-
/amazon-kinesis-alternatives
Amazon Kinesis services make it easy to work with real-time streaming data in the AWS cloud.
-
/apache-druid-alternatives
Fast column-oriented distributed data store
-
/presto-db-alternatives
Distributed SQL Query Engine for Big Data (by Facebook)
-
/spring-batch-alternatives
Level up your Java code and explore what Spring can do for you.