No Apache Parquet videos yet. You could help us improve this page by suggesting one.
Based on our record, Apache Kafka should be more popular than Apache Parquet. It has been mentiond 146 times since March 2021. We are tracking product recommendations and mentions on various public social media platforms and blogs. They can help you identify which product is more popular and what people think of it.
If there was a way to package and compress the Excel spreadsheet in a web-friendly format, then there's nothing stopping us from loading the entire dataset in the browser!1 Sure enough, the Parquet file format was specifically designed for efficient portability. - Source: dev.to / about 1 month ago
Iceberg decouples storage from compute. That means your data isnโt trapped inside one proprietary system. Instead, it lives in open file formats (like Apache Parquet) and is managed by an open, vendor-neutral metadata layer (Apache Iceberg). - Source: dev.to / 6 months ago
Data prep kit github repository: https://github.com/data-prep-kit/data-prep-kit?tab=readme-ov-file Quick start guide: https://github.com/data-prep-kit/data-prep-kit/blob/dev/doc/quick-start/contribute-your-own-transform.md Provided samples and examples: https://github.com/data-prep-kit/data-prep-kit/tree/dev/examples Parquet: https://parquet.apache.org/. - Source: dev.to / 6 months ago
Deliver nice ready-to-use data as duckdb, parquet and csv. - Source: dev.to / 6 months ago
Push the dataset to hugging face in parquet format. - Source: dev.to / 11 months ago
Dive deeper into your PHP framework of choice by mastering its routing, middleware, and ORM capabilities. As your expertise grows, consider exploring advanced approaches like microservices for independent deployment or GraphQL for more flexible data querying. Event-driven architectures using tools like RabbitMQ or Kafka can also improve scalability and responsiveness. - Source: dev.to / about 1 month ago
If you've ever worked as an enterprise developer in any moderately complex company, you've likely encountered distributed systems of the kind I want to talk about in this postโtwo or more systems communicating together via a message queue (MQ), such as RabbitMQ or Apache Kafka. Distributed, message-based systems are ubiquitous in today's programming landscape, especially due to the (now hopefully at least somewhat... - Source: dev.to / about 2 months ago
Kafka: Our trusty message bus. Events land here first. - Source: dev.to / 5 months ago
For those interested in a deeper dive into Apache Kafkaโs multifaceted world, further details can be found on the official Kafka website and the Apache Kafka GitHub repository. Additionally, exploring innovative funding models via resources like tokenizing open source licenses provides insight into the future of open source software sustainability. - Source: dev.to / 5 months ago
Ingest real-time data from Kafka, Pulsar, or CDC sources like Postgresand MySQL, with built-in support for Debezium. - Source: dev.to / 5 months ago
Apache Arrow - Apache Arrow is a cross-language development platform for in-memory data.
RabbitMQ - RabbitMQ is an open source message broker software.
Apache Spark - Apache Spark is an engine for big data processing, with built-in modules for streaming, SQL, machine learning and graph processing.
Histats - Start tracking your visitors in 1 minute!
DuckDB - DuckDB is an in-process SQL OLAP database management system
StatCounter - StatCounter is a simple but powerful real-time web analytics service that helps you track, analyse and understand your visitors so you can make good decisions to become more successful online.