The Apache HDFS is a distributed file system that makes it possible to scale a single Apache Hadoop cluster to hundreds (and even thousands) of nodes.
-
A fully managed data warehouse for large-scale data analytics.
-
Amazon Elastic MapReduce is a web service that makes it easy to quickly process vast amounts of data.
-
Google Cloud Dataflow is a fully-managed cloud service and programming model for batch and streaming big data processing.