Software Alternatives & Reviews

Demystifying Apache Arrow

Apache Parquet Apache Arrow
  1. Apache Parquet is a columnar storage format available to any project in the Hadoop ecosystem.
    Pricing:
    • Open Source
    Apache Parquet (Parquet for short), which nowadays is an industry standard to store columnar data on disk. It compress the data with high efficiency and provides fast read and write speeds. As written in the Arrow documentation, "Arrow is an ideal in-memory transport layer for data that is being read or written with Parquet files".

    #Databases #Big Data #Relational Databases 19 social mentions

  2. Apache Arrow is a cross-language development platform for in-memory data.
    Pricing:
    • Open Source
    Apache Arrow (Arrow for short) is an open source project that defines itself as "a language-independent columnar memory format" (more on that later). It is part of the Apache Software Foundation, and as such is governed by a community of several stakeholders. It has implementations in several languages (C++ and also Rust, Julia, Go, and even JavaScript) and bindings for Python, R and others that wrap the C++ implementation.

    #Databases #NoSQL Databases #Relational Databases 33 social mentions

Discuss: Demystifying Apache Arrow

Log in or Post with