Software Alternatives & Reviews

Top 9 Big Data in Big Data Infrastructure

The best Big Data within the Big Data Infrastructure category - based on our collection of reviews & verified products.

Amazon EMR Apache Spark Snowflake Hadoop Impala MapR Converged Data Platform BlueData Apache Flume Apache ORC

Summary

The top products on this list are Amazon EMR, Apache Spark, and Snowflake. All products here are categorized as: Software and platforms for processing and analyzing large data sets. Big Data Infrastructure. One of the criteria for ordering this list is the number of mentions that products have on reliable external sources. You can suggest additional sources through the form here.
  1. Amazon Elastic MapReduce is a web service that makes it easy to quickly process vast amounts of data.

    #Big Data #Big Data Tools #Big Data Infrastructure 10 social mentions

  2. Apache Spark is an engine for big data processing, with built-in modules for streaming, SQL, machine learning and graph processing.
    Pricing:
    • Open Source

    #Databases #Big Data #Big Data Analytics 56 social mentions

  3. Snowflake is the only data platform built for the cloud for all your data & all your users. Learn more about our purpose-built SQL cloud data warehouse.

    #Data Warehousing #Cloud Data #Data Dashboard 4 social mentions

  4. 4
    Open-source software for reliable, scalable, distributed computing
    Pricing:
    • Open Source

    #Databases #NoSQL Databases #Big Data 15 social mentions

  5. 5
    Impala is a modern, open source, distributed SQL query engine for Apache Hadoop.
    Pricing:
    • Open Source

    #Big Data #Big Data Infrastructure #Databases

  6. An enterprise-grade distributed data platform that you can trust to reliably store and process big and fast data.

    #Big Data #Data Dashboard #Development

  7. BlueData's software platform makes it easier, faster and more cost-effective for organizations to deploy Big Data infrastructure on-premises.

    #Big Data #Big Data Infrastructure #Data Dashboard

  8. Apache Flume is a distributed, reliable, and available service for efficiently collecting, aggregating, and moving large amounts of log data

    #Big Data #Log Management #Databases 1 social mentions

  9. Apache ORC is a columnar storage for Hadoop workloads.
    Pricing:
    • Open Source

    #Big Data #Databases #Stream Processing 3 social mentions

Related categories

Recently added products

If you want to make changes on any of the products, you can go to its page and click on the "Suggest Changes" link. Alternatively, if you are working on one of these products, it's best to verify it and make the changes directly through the management page. Thanks!