Based on our record, Minio seems to be a lot more popular than Apache Calcite. While we know about 156 links to Minio, we've tracked only 12 mentions of Apache Calcite. We are tracking product recommendations and mentions on various public social media platforms and blogs. They can help you identify which product is more popular and what people think of it.
> Make diff work on more than just SQLite. Another way of doing this that I've been wanting to do for a while is to implement the DIFF operator in Apache Calcite[0]. Using Calcite, DIFF could be implemented as rewrite rules to generate the appropriate SQL to be directly executed against the database or the DIFF operator can be implemented outside of the database (which the original paper shows is more efficient).... - Source: Hacker News / 11 months ago
Use a SQL Parser like sqlglot or Apache Calcite to compile user's query into an AST. Source: about 1 year ago
One parser I think deserves a mention is the one from Apache Calcite[0]. Calcite does more than parsing, there are a number of users who pick up Calcite just for the parser. While the default parser attempts to adhere strictly to the SQL standard, of interest is also the Babel parser, which aims to be as permissive as possible in accepting different dialects of SQL. Disclaimer: I am on the PMC of Apache Calcite,... - Source: Hacker News / almost 2 years ago
Apache Calcite can do this, though it's not a beginner-friendly task: https://calcite.apache.org/. - Source: Hacker News / almost 2 years ago
You should look at Apache Calcite[0]. Like OctoSQL, you can join data from different data sources. It's also relatively easy to add your own data sources ("adapters" in Calcite lingo) and rules to efficiently query those sources. Calcite already has adapters that do things like read from HTML tables over HTTP, files on your file system, running processes, etc. This is in addition to connecting to a bunch of... - Source: Hacker News / almost 2 years ago
The meta-data and model artifacts from experiment tracking can contain large amounts of data, such as the training model files, data files, metrics and logs, visualizations, configuration files, checkpoints, etc. In cases where the experiment tool doesn't support data storage, an alternative option is to track the training and validation data versions per experiment. They use remote data storage systems such as S3... - Source: dev.to / 15 days ago
> When it gets too out of hand, people will paper it over with a new, simpler abstraction layer, and the process starts again, only with a layer of garbage spaghetti underneath. I'm pretty happy that there are S3 compatible stores that you can host yourself, that aren't insanely complex. MinIO: https://min.io/ SeaweedFS: https://github.com/seaweedfs/seaweedfs Of course, many will prefer hosted/managed solutions... - Source: Hacker News / 28 days ago
Here are the basic steps to getting a minio tenant deployed inot kubernetes. There are some pre-requisites tasks to be deployed (and will not be covered in this article) including. - Source: dev.to / 2 months ago
I'd throw minio [1] in the list there as well for homelab k8s object storage. [1] https://min.io/. - Source: Hacker News / 5 months ago
Can you just append the data to a blob using something like the s3 blob api? AWS, Azure and Minio https://min.io/ all support it. That way you don't have to reinvent the wheel. Source: 10 months ago
Apache Drill - Schema-Free SQL Query Engine for Hadoop and NoSQL
Ceph - Ceph is a distributed object store and file system designed to provide excellent performance...
Presto DB - Distributed SQL Query Engine for Big Data (by Facebook)
Google Cloud Storage - Google Cloud Storage offers developers and IT organizations durable and highly available object storage.
Superintendent.app - Superintendent.app is a Desktop app that enables you to write SQL on CSV files.
Azure Blob Storage - Use Azure Blob Storage to store all kinds of files. Azure hot, cool, and archive storage is reliable cloud object storage for unstructured data