Software Alternatives, Accelerators & Startups

LakeFS VS GlusterFS

Compare LakeFS VS GlusterFS and see what are their differences

LakeFS logo LakeFS

lakeFS is an open-source tool that transforms your object storage to Git-like repositories. Start managing data the way you manage your code.

GlusterFS logo GlusterFS

GlusterFS is a scale-out network-attached storage file system.
  • LakeFS Landing page
    Landing page //
    2023-08-27
  • GlusterFS Landing page
    Landing page //
    2019-03-10

LakeFS videos

Getting Started With lakeFS

More videos:

  • Review - Get Ready for ML! Level Up Your Data Lake with Delta and lakeFS | Treeverse

GlusterFS videos

An Overview of GlusterFS Architecture Part 2 - Non-replicated Cluster

Category Popularity

0-100% (relative to LakeFS and GlusterFS)
Cloud Storage
13 13%
87% 87
Cloud Computing
13 13%
87% 87
File Sharing
100 100%
0% 0
Storage
7 7%
93% 93

User comments

Share your experience with using LakeFS and GlusterFS. For example, how are they different and which one is better?
Log in or Post with

Reviews

These are some of the external sources and on-site user reviews we've used to compare LakeFS and GlusterFS

LakeFS Reviews

4 Must-Have Open Source Solutions for Object Storage
LakeFS allows you to create a development environment where you can perform experiments and document them in a reproducible manner. Like Git, you can create commits and branches, making it possible for you to move along the timeline of your application development and try out new features in isolation. Amazingly, lakeFS performs all this without duplicating any data —...

GlusterFS Reviews

We have no reviews of GlusterFS yet.
Be the first one to post

Social recommendations and mentions

Based on our record, LakeFS should be more popular than GlusterFS. It has been mentiond 6 times since March 2021. We are tracking product recommendations and mentions on various public social media platforms and blogs. They can help you identify which product is more popular and what people think of it.

LakeFS mentions (6)

  • Ask HN: AWS S3, Cloudflare R2, GCS, Wasabi, or B2?
    I would add https://github.com/gaul/s3proxy to your list. - Source: Hacker News / 2 months ago
  • Dev / Stage / Prod is the wrong pattern for data pipelines
    * data state - this is contents of both your data and metadata at a given point in time. if your data doesn't fit into a single database, this can be difficult to manage. We use this technology to help us: https://lakefs.io/. - Source: Hacker News / 10 months ago
  • Dev / Stage / Prod is the wrong pattern for data pipelines
    Saltcured, find these comments super insightful! > Yeah, there's a lot of hidden magic/assumptions in having a "writable snapshot of a specific version" of production data. That's absolutely a huge assumption. This technology has been a game changer for us: https://lakefs.io/ > It becomes a headache when there is too much contention to use these sandboxes, or too much manual effort to reset them to a desired... - Source: Hacker News / 10 months ago
  • Using git to version control experimental data (not code)?
    You should not store your data in git itself, but rather use git to version your data sets. The currently best option for that is (IMHO) https://lakefs.io though there are a few others in various states of usability/maturity. Source: about 1 year ago
  • How are you incrementally testing your data pipelines as you develop them?
    I mean if you're ready to adopt a new framework into your ecosystem this is one of the major usecases for LakeFS. Source: over 1 year ago
View more

GlusterFS mentions (2)

  • [D] What are the compute options you've considered for your projects?
    I am a fan of Gearman to schedule and dispatch distributed jobs, Redis as a collaborative blackboard, and GlusterFS to share models across multiple systems and make bulk data available across the entire system (usually referenced in the blackboard as a pathname). Source: about 1 year ago
  • Gluster vs Oracle Gluster
    If you're not relying on support, then I would probably standardize on the latest packages available from gluster.org. Source: almost 3 years ago

What are some alternatives?

When comparing LakeFS and GlusterFS, you can also consider the following products

Seaweed FS - SeaweedFS is a simple and highly scalable distributed file system to store and serve billions of files fast! SeaweedFS object store has O(1) disk seek and SeaweedFS Filer supports cross-cluster replication, POSIX, S3 API, ,…

Ceph - Ceph is a distributed object store and file system designed to provide excellent performance...

Minio - Minio is an open-source minimal cloud storage server.

WekaFS - WekaFS is an extreme-performance parallel filesystem for Linux from WekaIO that works in AWS or on-prem on Industry-Standard Servers. WekaFS includes Enterprise features such as snapshots and tiering to S3 Object Stores.

rkt - App Container runtime

JuiceFS - The Shared POSIX File System for the Cloud