Software Alternatives, Accelerators & Startups

Search

Top 20 open-source products relevant to Big Data

Showing 20 of 40+ results. Refine your search or use the filters to narrow down the products.

  1. A fully managed data warehouse for large-scale data analytics.

    Open Source

    /google-bigquery-alternatives
  2. Meet Neo4j: The graph database platform powering today's mission-critical enterprise applications, including artificial intelligence, fraud detection and recommendations.

    Open Source

    /neo4j-alternatives
  3. Qubole delivers a self-service platform for big aata analytics built on Amazon, Microsoft and Google Clouds.

    Open Source

    /qubole-alternatives
  4. Distributed SQL Query Engine for Big Data (by Facebook).

    Open Source

    /presto-db-alternatives
  5. Apache Spark is an engine for big data processing, with built-in modules for streaming, SQL, machine learning and graph processing.

    Open Source

    /apache-spark-alternatives
  6. Flink is a streaming dataflow engine that provides data distribution, communication, and fault tolerance for distributed computations.

    Open Source

    /apache-flink-alternatives
  7. Scalable datastore for metrics, events, and real-time analytics.

    Open Source

    /influxdata-alternatives
  8. Confluent offers a real-time data platform built around Apache Kafka.

    Open Source

    /confluent-alternatives
  9. Open-source software for reliable, scalable, distributed computing.

    Open Source

    /hadoop-alternatives
  10. Apache Hive data warehouse software facilitates querying and managing large datasets residing in distributed storage.

    Open Source

    /apache-hive-alternatives
  11. Apache Storm is a free and open source distributed realtime computation system.

    Open Source

    /apache-storm-alternatives
  12. A fast, distributed graph database with ACID transactions.

    Open Source

    /dgraph-alternatives
  13. A Streaming Database for Real-Time Applications.

    Open Source

    /materialize-alternatives
  14. Fast column-oriented distributed data store.

    Open Source

    /apache-druid-alternatives
  15. DuckDB is an in-process SQL OLAP database management system.

    Open Source

    /duckdb-alternatives
  16. Apache Arrow is a cross-language development platform for in-memory data.

    Open Source

    /apache-arrow-alternatives
  17. Apache Beam provides an advanced unified programming modelย to implement batch and streaming data processing jobs.

    Open Source

    /apache-beam-alternatives
  18. NetworkX is a Python language software package for the creation, manipulation, and study of the...

    Open Source

    /networkx-alternatives
  19. StarRocks offers the next generation of real-time SQL engines for enterprise-scale analytics. Learn how we make it easy to deliver real-time analytics.

    Open Source

    /starrocks-alternatives
  20. Open-source graph database.

    Open Source

    /cayley-alternatives

Products list improvements

Thank for using SaaSHub. We hope you find it useful. A lot of the information here is community driven, and while we will try to keep it up to date, we appreciate any help we can get. For example, if you think the details about a product are outdated, you can go to its page on SaaSHub and update it by using the "Edit" button. You don't need an account for that. Thank you for your help!