Based on our record, Apache Parquet should be more popular than Azure Blob Storage. It has been mentiond 25 times since March 2021. We are tracking product recommendations and mentions on various public social media platforms and blogs. They can help you identify which product is more popular and what people think of it.
There are also other object storage services that provide more comprehensive CAS support such as ABS, GCS, MinIO, R2, and Tigris. - Source: dev.to / 4 months ago
Responds to changes in Azure Blob Storage (e.g., file uploads). - Source: dev.to / 9 months ago
Azure Blob Storage{:target="_blank"} is a scalable and highly available object storage service provided by Microsoft Azure. They offer various storage tiers, so you can optimize cost and performance based on your requirements. They also provides features like lifecycle management, versioning, and data encryption. - Source: dev.to / about 2 years ago
An object storage system (e.g. Amazon S3, Azure Blob Storage, Google Cloud Platform Cloud Storage, etc.) makes it easy and simple to save large amounts of historical data and retrieve it for future use. - Source: dev.to / over 2 years ago
I want to share my experience unzipping large files stored in Azure Blob Storage using Azure Functions with Node.js. - Source: dev.to / almost 3 years ago
If there was a way to package and compress the Excel spreadsheet in a web-friendly format, then there's nothing stopping us from loading the entire dataset in the browser!1 Sure enough, the Parquet file format was specifically designed for efficient portability. - Source: dev.to / about 1 month ago
Iceberg decouples storage from compute. That means your data isnโt trapped inside one proprietary system. Instead, it lives in open file formats (like Apache Parquet) and is managed by an open, vendor-neutral metadata layer (Apache Iceberg). - Source: dev.to / 6 months ago
Data prep kit github repository: https://github.com/data-prep-kit/data-prep-kit?tab=readme-ov-file Quick start guide: https://github.com/data-prep-kit/data-prep-kit/blob/dev/doc/quick-start/contribute-your-own-transform.md Provided samples and examples: https://github.com/data-prep-kit/data-prep-kit/tree/dev/examples Parquet: https://parquet.apache.org/. - Source: dev.to / 6 months ago
Deliver nice ready-to-use data as duckdb, parquet and csv. - Source: dev.to / 6 months ago
Push the dataset to hugging face in parquet format. - Source: dev.to / 11 months ago
Google Cloud Storage - Google Cloud Storage offers developers and IT organizations durable and highly available object storage.
Apache Arrow - Apache Arrow is a cross-language development platform for in-memory data.
Minio - Minio is an open-source minimal cloud storage server.
Apache Spark - Apache Spark is an engine for big data processing, with built-in modules for streaming, SQL, machine learning and graph processing.
Amazon S3 - Amazon S3 is an object storage where users can store data from their business on a safe, cloud-based platform. Amazon S3 operates in 54 availability zones within 18 graphic regions and 1 local region.
DuckDB - DuckDB is an in-process SQL OLAP database management system