No features have been listed yet.
No Apache Parquet videos yet. You could help us improve this page by suggesting one.
Based on our record, Apache Parquet seems to be a lot more popular than Infinispan. While we know about 25 links to Apache Parquet, we've tracked only 1 mention of Infinispan. We are tracking product recommendations and mentions on various public social media platforms and blogs. They can help you identify which product is more popular and what people think of it.
If there was a way to package and compress the Excel spreadsheet in a web-friendly format, then there's nothing stopping us from loading the entire dataset in the browser!1 Sure enough, the Parquet file format was specifically designed for efficient portability. - Source: dev.to / about 1 month ago
Iceberg decouples storage from compute. That means your data isnโt trapped inside one proprietary system. Instead, it lives in open file formats (like Apache Parquet) and is managed by an open, vendor-neutral metadata layer (Apache Iceberg). - Source: dev.to / 6 months ago
Data prep kit github repository: https://github.com/data-prep-kit/data-prep-kit?tab=readme-ov-file Quick start guide: https://github.com/data-prep-kit/data-prep-kit/blob/dev/doc/quick-start/contribute-your-own-transform.md Provided samples and examples: https://github.com/data-prep-kit/data-prep-kit/tree/dev/examples Parquet: https://parquet.apache.org/. - Source: dev.to / 6 months ago
Deliver nice ready-to-use data as duckdb, parquet and csv. - Source: dev.to / 6 months ago
Push the dataset to hugging face in parquet format. - Source: dev.to / 11 months ago
You can already download it for free you know. Source: over 2 years ago
Apache Arrow - Apache Arrow is a cross-language development platform for in-memory data.
Redis - Redis is an open source in-memory data structure project implementing a distributed, in-memory key-value database with optional durability.
Apache Spark - Apache Spark is an engine for big data processing, with built-in modules for streaming, SQL, machine learning and graph processing.
memcached - High-performance, distributed memory object caching system
DuckDB - DuckDB is an in-process SQL OLAP database management system
CouchBase - Document-Oriented NoSQL Database