No Apache Parquet videos yet. You could help us improve this page by suggesting one.
Based on our record, Apache Parquet should be more popular than KeyDB. It has been mentiond 24 times since March 2021. We are tracking product recommendations and mentions on various public social media platforms and blogs. They can help you identify which product is more popular and what people think of it.
Iceberg decouples storage from compute. That means your data isn’t trapped inside one proprietary system. Instead, it lives in open file formats (like Apache Parquet) and is managed by an open, vendor-neutral metadata layer (Apache Iceberg). - Source: dev.to / 22 days ago
Data prep kit github repository: https://github.com/data-prep-kit/data-prep-kit?tab=readme-ov-file Quick start guide: https://github.com/data-prep-kit/data-prep-kit/blob/dev/doc/quick-start/contribute-your-own-transform.md Provided samples and examples: https://github.com/data-prep-kit/data-prep-kit/tree/dev/examples Parquet: https://parquet.apache.org/. - Source: dev.to / 27 days ago
Deliver nice ready-to-use data as duckdb, parquet and csv. - Source: dev.to / about 1 month ago
Push the dataset to hugging face in parquet format. - Source: dev.to / 6 months ago
It's this kind of certainty that underscores the vital role of the Apache Software Foundation (ASF). Many first encounter Apache through its pioneering project, the open-source web server framework that remains ubiquitous in web operations today. The ASF was initially created to hold the intellectual property and assets of the Apache project, and it has since evolved into a cornerstone for open-source projects... - Source: dev.to / 11 months ago
These facts only hold when the size of your payload and the number of connections remain relatively small. This easily jumps out the window with ever-increasing load parameters. The threshold is, unfortunately, rather low at a high number of connections and increased payload sizes. Modern large-scale micro-services will easily have over 100 running instances at medium scale. And since most instances employ some... - Source: dev.to / 3 months ago
The LMS Moodle Operator serves as a meta-operator, orchestrating the deployment and management of Moodle instances in Kubernetes. It handles the entire stack required to run Moodle, including components like Postgres, Keydb, NFS-Ganesha, and Moodle itself. Each of these components has its own Kubernetes Operator, ensuring seamless integration and management. - Source: dev.to / about 1 year ago
Congrats on the funding and getting production ready, it's good that KeyDB (and Redis) get some competition. https://docs.keydb.dev/ Open question, how does Dragonfly differ from KeyDB? - Source: Hacker News / about 2 years ago
See: Distroless images[0] This is one of the huge benefits of recent systems languages like go and rust -- they compile to single binaries so you can use things like scatch[1] containers. You may have to fiddle with gnu libc/musl libc (usually when getaddrinfo is involved/dns etc), but once you're done with it, packaging is so easy. Even languages like Node (IMO the most progressive of the scripting languages)... - Source: Hacker News / almost 3 years ago
Interesting project. Very similar to KeyDB [1] which also developed a multi-threaded scale-up approach to Redis. It's since been acquired by Snapchat. There's also Aerospike [2] which has developed a lot around low-latency performance. 1. https://docs.keydb.dev/ 2. https://aerospike.com/. - Source: Hacker News / almost 3 years ago
Apache Arrow - Apache Arrow is a cross-language development platform for in-memory data.
Redis - Redis is an open source in-memory data structure project implementing a distributed, in-memory key-value database with optional durability.
Apache Spark - Apache Spark is an engine for big data processing, with built-in modules for streaming, SQL, machine learning and graph processing.
MongoDB - MongoDB (from "humongous") is a scalable, high-performance NoSQL database.
Apache Ignite - high-performance, integrated and distributed in-memory platform for computing and transacting on...
Amazon S3 - Amazon S3 is an object storage where users can store data from their business on a safe, cloud-based platform. Amazon S3 operates in 54 availability zones within 18 graphic regions and 1 local region.