1. The Apache HDFS is a distributed file system that makes it possible to scale a single Apache Hadoop cluster to hundreds (and even thousands) of nodes.

  2. Amazon Elastic MapReduce is a web service that makes it easy to quickly process vast amounts of data.

  3. Google Cloud Dataflow is a fully-managed cloud service and programming model for batch and streaming big data processing.

  4. A fully managed data warehouse for large-scale data analytics.

  5. Apache Spark is an engine for big data processing, with built-in modules for streaming, SQL, machine learning and graph processing.

  6. Hadoop-Related

  7. The Productivity Platform

  8. Open-source software for reliable, scalable, distributed computing

  9. Databricks provides a Unified Analytics Platform that accelerates innovation by unifying data science, engineering and business.‎What is Apache Spark?

  10. MapR is a leading high-performance data management or IT management solution that integrates Apache Drill, Hadoop and Spark with real-time global event streaming, scalable enterprise storage, and database capabilities in order to control large appli…

  11. Clustering and highly scalable data distribution platform for Java

  12. Analytics Engine is a combined Apache Spark and Apache Hadoop service for creating analytics applications.