Software Alternatives, Accelerators & Startups

Delta Lake VS ClickHouse

Compare Delta Lake VS ClickHouse and see what are their differences

Delta Lake logo Delta Lake

Application and Data, Data Stores, and Big Data Tools

ClickHouse logo ClickHouse

ClickHouse is an open-source column-oriented database management system that allows generating analytical data reports in real time.
  • Delta Lake Landing page
    Landing page //
    2023-08-26
  • ClickHouse Landing page
    Landing page //
    2019-06-18

Delta Lake videos

A Thorough Comparison of Delta Lake, Iceberg and Hudi

More videos:

  • Tutorial - Delta Lake for apache Spark | How does it work | How to use delta lake | Delta Lake for Spark ACID
  • Review - ACID ORC, Iceberg, and Delta Lake—An Overview of Table Formats for Large Scale Storage and Analytics

ClickHouse videos

No ClickHouse videos yet. You could help us improve this page by suggesting one.

+ Add video

Category Popularity

0-100% (relative to Delta Lake and ClickHouse)
Development
100 100%
0% 0
Databases
16 16%
84% 84
Data Dashboard
100 100%
0% 0
Relational Databases
0 0%
100% 100

User comments

Share your experience with using Delta Lake and ClickHouse. For example, how are they different and which one is better?
Log in or Post with

Reviews

These are some of the external sources and on-site user reviews we've used to compare Delta Lake and ClickHouse

Delta Lake Reviews

We have no reviews of Delta Lake yet.
Be the first one to post

ClickHouse Reviews

Rockset, ClickHouse, Apache Druid, or Apache Pinot? Which is the best database for customer-facing analytics?
ClickHouse is an open-source, column-oriented, distributed, and OLAP database that’s very easy to set up and maintain. “Because it’s columnar, it’s the best architectural approach for aggregations and for ‘sort by’ on more than one column. It also means that group by’s are very fast. It’s distributed, replication is asynchronous, and it’s OLAP—which means it’s meant for...
Source: embeddable.com
ClickHouse vs TimescaleDB
Recently, TimescaleDB published a blog comparing ClickHouse & TimescaleDB using timescale/tsbs, a timeseries benchmarking framework. I have some experience with PostgreSQL and ClickHouse but never got the chance to play with TimescaleDB. Some of the claims about TimescaleDB made in their post are very bold, that made me even more curious. I thought it’d be a great...
20+ MongoDB Alternatives You Should Know About
ClickHouse may be a great contender for moving analytical workloads from MongoDB. Much faster, and with JSON support and Nested Data Structures, it can be great choice for storing and analyzing document data.
Source: www.percona.com

Social recommendations and mentions

ClickHouse might be a bit more popular than Delta Lake. We know about 44 links to it since March 2021 and only 31 links to Delta Lake. We are tracking product recommendations and mentions on various public social media platforms and blogs. They can help you identify which product is more popular and what people think of it.

Delta Lake mentions (31)

  • Delta Lake vs. Parquet: A Comparison
    Delta is pretty great, let's you do upserts into tables in DataBricks much easier than without it. I think the website is here: https://delta.io. - Source: Hacker News / 4 months ago
  • Getting Started with Flink SQL, Apache Iceberg and DynamoDB Catalog
    Apache Iceberg is one of the three types of lakehouse, the other two are Apache Hudi and Delta Lake. - Source: dev.to / 5 months ago
  • [D] Is there other better data format for LLM to generate structured data?
    The Apache Spark / Databricks community prefers Apache parquet or Linux Fundation's delta.io over json. Source: 6 months ago
  • Databricks Strikes $1.3B Deal for Generative AI Startup MosaicML
    Databricks provides Jupyter lab like notebooks for analysis and ETL pipelines using spark through pyspark, sparkql or scala. I think R is supported as well but it doesn't interop as well with their newer features as well as python and SQL do. It interfaces with cloud storage backend like S3 and offers some improvements to the parquet format of data querying that allows for updating, ordering and merged through... - Source: Hacker News / 11 months ago
  • The "Big Three's" Data Storage Offerings
    Structured, Semi-structured and Unstructured can be stored in one single format, a lakehouse storage format like Delta, Iceberg or Hudi (assuming those don't require low-latency SLAs like subsecond). Source: 12 months ago
View more

ClickHouse mentions (44)

  • Simplified API Creation and Management: ClickHouse to APISIX Integration Without Code
    In the world of data management and web services, creating and managing APIs can often be a complex and time-consuming task. However, with the right tools, this process can be significantly simplified. In this article, we will explore how to create APIs for fetching data from ClickHouse tables without writing any code and manage these APIs using APISIX. ClickHouse, a fast and open-source columnar database... - Source: dev.to / 4 days ago
  • The 2024 Web Hosting Report
    For the third, examples here might be analytics plugins in specialized databases like Clickhouse, data-transformations in places like your ETL pipeline using Airflow or Fivetran, or special integrations in your authentication workflow with Auth0 hooks and rules. - Source: dev.to / 3 months ago
  • Choosing Between a Streaming Database and a Stream Processing Framework in Python
    Online analytical processing (OLAP) databases like Apache Druid, Apache Pinot, and ClickHouse shine in addressing user-initiated analytical queries. You might write a query to analyze historical data to find the most-clicked products over the past month efficiently using OLAP databases. When contrasting with streaming databases, they may not be optimized for incremental computation, leading to challenges in... - Source: dev.to / 4 months ago
  • Real-Time Data Enrichment and Analytics With RisingWave and ClickHouse
    To achieve seamless real-time data ingestion, transformation, and analytics, a powerful combination to explore is RisingWave and ClickHouse. RisingWave is a PostgreSQL-compatible database specifically designed for stream processing. It excels at ingesting real-time data streams, performing diverse transformations, and enabling instant querying of results. ClickHouse® is a high-performance, column-oriented SQL... - Source: dev.to / 5 months ago
  • Ask HN: Is there a Hacker News takeout to export my comments / upvotes, etc.?
    You can export the whole dataset as described here: https://github.com/ClickHouse/ClickHouse/issues/29693
        curl https://clickhouse.com/ | sh.
    - Source: Hacker News / 6 months ago
View more

What are some alternatives?

When comparing Delta Lake and ClickHouse, you can also consider the following products

Google BigQuery - A fully managed data warehouse for large-scale data analytics.

PostgreSQL - PostgreSQL is a powerful, open source object-relational database system.

Amazon SageMaker - Amazon SageMaker provides every developer and data scientist with the ability to build, train, and deploy machine learning models quickly.

MySQL - The world's most popular open source database

GeoSpock - GeoSpock is the platform for data lake management, providing a unified view of the data assets within an organization and making it easily accessible.

Apache Doris - Apache Doris is an open-source real-time data warehouse for big data analytics.