Software Alternatives & Reviews

Show HN: ClickHouse-local – a small tool for serverless data analytics

ClickHouse OctoSQL
  1. ClickHouse is an open-source column-oriented database management system that allows generating analytical data reports in real time.
    Pricing:
    • Open Source
    What happens if you download the install script and edit it? I don't have a windows machine at hand right now so I can't check myself. Something like this: <pre><code> curl https://clickhouse.com/ --output install.sh.

    #Databases #Relational Databases #Data Warehousing 43 social mentions

  2. OctoSQL is a query tool that allows you to join, analyse and transform data from multiple databases and file formats using SQL. - cube2222/octosql
    Congrats on the Show HN! It's great to see more tools in this area (querying data from various sources in-place) and the Lambda use case is a really cool idea! I've recently done a bunch of benchmarking, including ClickHouse Local and the usage was straightforward, with everything working as it's supposed to. Just to comment on the performance area though, one area I think ClickHouse could still possibly improve on - vs OctoSQL[0] at least - is that it seems like the JSON datasource is slower, especially if only a small part of the JSON objects is used. If only a single field of many is used, OctoSQL lazily parses only that field, and skips the others, which yields non-trivial performance gains on big JSON files with small queries. Basically, for a query like `SELECT COUNT(*), AVG(overall) FROM books.json` with the Amazon Review Dataset, OctoSQL is twice as fast (3s vs 6s). That's a minor thing though (OctoSQL will slow down for more complicated queries, while for ClickHouse decoding the input is and remains the bottleneck). [0]: <a href="https://github.com/cube2222/octosql">https://github.com/cube2222/octosql</a>.

    #Databases #Big Data #Relational Databases 22 social mentions

Discuss: Show HN: ClickHouse-local – a small tool for serverless data analytics

Log in or Post with