No OctoSQL videos yet. You could help us improve this page by suggesting one.
OctoSQL might be a bit more popular than Hadoop. We know about 23 links to it since March 2021 and only 23 links to Hadoop. We are tracking product recommendations and mentions on various public social media platforms and blogs. They can help you identify which product is more popular and what people think of it.
Over the years, Indian developers have played increasingly vital roles in many international projects. From contributions to frameworks such as Kubernetes and Apache Hadoop to the emergence of homegrown platforms like OpenStack India, India has steadily carved out a global reputation as a powerhouse of open source talent. - Source: dev.to / 1 day ago
One of the key attributes of Apache License 2.0 is its flexible nature. Permitting use in both proprietary and open source environments, it has become the go-to choice for innovative projects ranging from the Apache HTTP Server to large-scale initiatives like Apache Spark and Hadoop. This flexibility is not solely legal; it is also philosophical. The license is designed to encourage transparency and maintain a... - Source: dev.to / about 2 months ago
Apache Hadoop is more than just software—it’s a full-fledged ecosystem built on the principles of open collaboration and decentralized governance. Born out of a need to process vast amounts of information efficiently, Hadoop uses a distributed file system and the MapReduce programming model to enable scalable, fault-tolerant computing. Central to its success is a diverse ecosystem that includes influential... - Source: dev.to / about 2 months ago
Navya: Designed to streamline administrative processes in educational institutions, Navya continues to demonstrate the power of open source in addressing local needs. Additionally, India’s vibrant tech communities are well represented on platforms like GitHub and SourceForge. These platforms host numerous Indian-led projects and serve as collaborative hubs for developers across diverse technology landscapes.... - Source: dev.to / 2 months ago
The rise of big data has seen Java arise as a crucial player in this domain. Tools like Hadoop and Apache Spark are built using Java, enabling businesses to process and analyze massive datasets efficiently. Java’s scalability and performance are critical for big data results that demand high trustability. - Source: dev.to / 5 months ago
This looks extremely cool. This is basically incremental view maintenance in databases, a problem that almost everybody (I think) has when using SQL databases and wanting to do some derived views for more performant access patterns. Importantly, they seem to support a wide breath of SQL operators, and it's open-source! There's already a bunch of tools in this area: 1. Materialize[0], which afaik is more... - Source: Hacker News / 7 months ago
OctoSQL[0] or DuckDB[1] will most likely be much simpler, while going through 10 GB of JSON in a couple seconds at most. Disclaimer: author of OctoSQL [0]: https://github.com/cube2222/octosql. - Source: Hacker News / about 2 years ago
This is really cool! With their Postgres scanner[0] you can now easily query multiple datasources using SQL and join between them (i.e. Postgres table with JSON file). Something I strived to build with OctoSQL[1] before. It's amazing to see how quickly DuckDB is adding new features. Not a huge fan of C++, which is right now used for authoring extensions, it'd be really cool if somebody implemented a Rust extension... - Source: Hacker News / about 2 years ago
Congrats on the Show HN! It's great to see more tools in this area (querying data from various sources in-place) and the Lambda use case is a really cool idea! I've recently done a bunch of benchmarking, including ClickHouse Local and the usage was straightforward, with everything working as it's supposed to. Just to comment on the performance area though, one area I think ClickHouse could still possibly improve... - Source: Hacker News / over 2 years ago
SPyQL is really cool and its design is very smart, with it being able to leverage normal Python functions! As far as similar tools go, I recommend taking a look at DataFusion[0], dsq[1], and OctoSQL[2]. DataFusion is a very (very very) fast command-line SQL engine but with limited support for data formats. Dsq is based on SQLite which means it has to load data into SQLite first, but then gives you the whole breath... - Source: Hacker News / over 2 years ago
Apache Spark - Apache Spark is an engine for big data processing, with built-in modules for streaming, SQL, machine learning and graph processing.
Materialize - A Streaming Database for Real-Time Applications
PostgreSQL - PostgreSQL is a powerful, open source object-relational database system.
Steampipe - Steampipe: select * from cloud; The extensible SQL interface to your favorite cloud APIs select * from AWS, Azure, GCP, Github, Slack etc.
Apache Storm - Apache Storm is a free and open source distributed realtime computation system.
LNAV - The Log File Navigator (lnav) is an advanced log file viewer for the console.