Software Alternatives & Reviews
Register   |   login

Apache Spark VS Hadoop

Compare Apache Spark VS Hadoop and see what are their differences


Apache Spark is an engine for big data processing, with built-in modules for streaming, SQL, machine learning and graph processing.

Open-source software for reliable, scalable, distributed computing
Apache Spark Landing Page
Apache Spark Landing Page
Hadoop Landing Page
Hadoop Landing Page

Apache Spark details

Categories
Databases Big Data Big Data Analytics Big Data Infrastructure
Website spark.apache.org  
Pricing URL-
Details $-
Platforms
-
Release Date-

Hadoop details

Categories
Databases Big Data Relational Databases
Website hadoop.apache.org  
Pricing URL-
Details $-
Platforms
-
Release Date-

Apache Spark videos

Weekly Apache Spark live Code Review -- look at StringIndexer multi-col (Scala) & Python testing

More videos:

  • - What's New in Apache Spark 3.0.0
  • - Apache Spark for Data Engineering and Analysis - Overview

Hadoop videos

No Hadoop videos yet. You could help us improve this page by suggesting one.

+ Add video

Category Popularity beta

0-100% (relative to Apache Spark and Hadoop)
69
69%
31%
31
76
76%
24%
24
76
76%
24%
24
0
0%
100%
100

What are some alternatives?

When comparing Apache Spark and Hadoop, you can also consider the following products

Hive - The Productivity Platform

Hortonworks - Hadoop-Related

Apache Flink - Flink is a streaming dataflow engine that provides data distribution, communication, and fault tolerance for distributed computations.

MongoDB - MongoDB (from "humongous") is a scalable, high-performance, open source NoSQL database.

Apache Druid - Fast column-oriented distributed data store

MySQL - The world's most popular open source database

User comments

Share your experience with using Apache Spark and Hadoop. For example, how are they different and which one is better?

Add Comment