Software Alternatives, Accelerators & Startups

Top 9 Big Data in Data Integration

The best Big Data within the Data Integration category - based on our collection of reviews & verified products.

Google Cloud Dataflow Amazon EMR Apache Flink ibm.com Hadoop HDFS Amazon Redshift Snowflake Qubole Confluent Apache Kafka

Summary

The top products on this list are Google Cloud Dataflow, Amazon EMR, and Apache Flink. All products here are categorized as: Software and platforms for processing and analyzing large data sets. Software for combining data from different sources into a unified view. One of the criteria for ordering this list is the number of mentions that products have on reliable external sources. You can suggest additional sources through the form here.
  1. Google Cloud Dataflow is a fully-managed cloud service and programming model for batch and streaming big data processing.

    #Data Dashboard #Big Data #Data Management 14 social mentions

  2. Amazon Elastic MapReduce is a web service that makes it easy to quickly process vast amounts of data.

    #Data Dashboard #Big Data #Big Data Infrastructure 10 social mentions

  3. NOTE: ibm.com Hadoop HDFS has been discontinued.
    The Apache HDFS is a distributed file system that makes it possible to scale a single Apache Hadoop cluster to hundreds (and even thousands) of nodes.

    #Data Dashboard #Big Data #Data Management

  4. Learn about Amazon Redshift cloud data warehouse.

    #Big Data #Databases #Data Management 29 social mentions

  5. Snowflake is the only data platform built for the cloud for all your data & all your users. Learn more about our purpose-built SQL cloud data warehouse.

    #Data Dashboard #Big Data #Big Data Infrastructure 4 social mentions

  6. 7
    Qubole delivers a self-service platform for big aata analytics built on Amazon, Microsoft and Google Clouds.
    Pricing:
    • Open Source

    #Data Dashboard #Big Data #Data Management

  7. Confluent offers a real-time data platform built around Apache Kafka.
    Pricing:
    • Open Source

    #Data Dashboard #Big Data #Data Management 1 social mentions

  8. Apache Kafka is an open-source message broker project developed by the Apache Software Foundation written in Scala.
    Pricing:
    • Open Source

    #Data Integration #Monitoring Tools #Stream Processing 146 social mentions

Related categories

Recently added products

If you want to make changes on any of the products, you can go to its page and click on the "Suggest Changes" link. Alternatively, if you are working on one of these products, it's best to verify it and make the changes directly through the management page. Thanks!