Based on our record, Materialize should be more popular than Apache Arrow. It has been mentiond 74 times since March 2021. We are tracking product recommendations and mentions on various public social media platforms and blogs. They can help you identify which product is more popular and what people think of it.
I had no idea what Arrow is: https://arrow.apache.org or arrow-rs: https://github.com/apache/arrow-rs. - Source: Hacker News / about 2 months ago
- Open source: Pontoon is free to use by anyone Under the hood, we use Apache Arrow (https://arrow.apache.org/) to move data between sources and destinations. Arrow is very performant - we wanted to use a library that could handle the scale of moving millions of records per minute. In the shorter-term, there are several improvements we want to make, like:. - Source: Hacker News / 2 months ago
Apache Arrow : It contains a set of technologies that enable big data systems to process and move data fast. - Source: dev.to / 10 months ago
One of the main selling points of Polars over similar solutions such as Pandas is performance. Polars is written in highly optimized Rust and uses the Apache Arrow container format. - Source: dev.to / 11 months ago
Kotlin DataFrame v0.14 comes with improvements for reading Apache Arrow format, especially loading a DataFrame from any ArrowReader. This improvement can be used to easily load results from analytical databases (such as DuckDB, ClickHouse) directly into Kotlin DataFrame. - Source: dev.to / over 1 year ago
Did I miss in the article where OP reveals the magic database that actually does this? 3rd party solutions like https://readyset.io/ and https://materialize.com/ exist specifically because databases donโt actually have what we all want materialized views to be. - Source: Hacker News / about 1 month ago
This triggered some associations for me. Strongest was Cells[0], a library for Common Lisp CLOS. The earliest reference I can find is 2002[1], making it over 20 years old. Second is incremental view maintenance systems like Feldera[2] or Materialize[3]. These use sophisticated theories (z-sets and differential dataflow) to apply efficient updates over sets of data, which generalizes the case of single variables.... - Source: Hacker News / 4 months ago
It's hard to write something that is both accessible and well-motivated. The best uses of category theory is when the morphisms are far more exotic than "regular functions". E.g. It would be nice to describe a circuit of live queries (like https://materialize.com/ stuff) with proper caching, joins, etc. Figuring this out is a bit of an open problem. Haskell's standard library's Monad and stuff are watered down to... - Source: Hacker News / 10 months ago
> [...] `https://materialize.com/` to solve their memory issues [...] Disclaimer: I work at Materialize Recently there have been major improvements in Materialize's memory usage as well as using disk to swap out some data. I find it pretty easy to hook up to Postgres/MySQL/Kafka instances: https://materialize.com/blog/materialize-emulator/. - Source: Hacker News / 11 months ago
I agree. So many disparate solutions. The streaming sql primitives are by themselves good enough (e.g. `tumble`, `hop` or `session` windows), but the infrastructural components are always rough in real life use cases. Crossing fingers for solutions like `https://github.com/feldera/feldera` to solve their memory issues, or `https://clickhouse.com/docs/en/materialized-view` to solve reliable streaming consumption.... - Source: Hacker News / 11 months ago
Redis - Redis is an open source in-memory data structure project implementing a distributed, in-memory key-value database with optional durability.
Apache Flink - Flink is a streaming dataflow engine that provides data distribution, communication, and fault tolerance for distributed computations.
Apache Parquet - Apache Parquet is a columnar storage format available to any project in the Hadoop ecosystem.
OctoSQL - OctoSQL is a query tool that allows you to join, analyse and transform data from multiple databases and file formats using SQL. - cube2222/octosql
DuckDB - DuckDB is an in-process SQL OLAP database management system
RisingWave - RisingWave is a stream processing platform that utilizes SQL to enhance data analysis, offering improved insights on real-time data.