Recommended and mentioned products
lakeFS is an open-source tool that transforms your object storage to Git-like repositories. Start managing data the way you manage your code.
How are you incrementally testing your data pipelines as you... about 4 months agoI mean if you're ready to adopt a new framework into your ecosystem this is one of the major usecases for LakeFS.
The Shared POSIX File System for the Cloud
An open-source distributed object storage service about 4 months ago:For this use-case, I like JuiceFS better. * https://juicefs.com/en/ * https://github.com/juicedata/juicefs I am not affiliated with them, just a regular user.
SeaweedFS is a simple and highly scalable distributed file system to store and serve billions of files fast! SeaweedFS object store has O(1) disk seek and SeaweedFS Filer supports cross-cluster replication, POSIX, S3 API, ,…
Open source cloud file system. Posix, HDFS and S3 compatible about about 2 months ago:Adopted SeaweedFS few months back. Never looked back since then. It's fast even on HDD disks. https://github.com/seaweedfs/seaweedfs#introduction.
Minio is an open-source minimal cloud storage server.
Query Real Time Data in Kafka Using SQL about 7 days ago:With this configuration, Docker initiates a demo cluster with all RisingWave components, including the frontend node, compute node, metadata node, and MinIO. The workload generator will start to generate random mock data and feed them into Kafka topics. In this demo cluster, data of materialized views will be stored in the MinIO instance.