Software Alternatives, Accelerators & Startups

Apache Calcite VS Presto DB

Compare Apache Calcite VS Presto DB and see what are their differences

Apache Calcite logo Apache Calcite

Relational Databases

Presto DB logo Presto DB

Distributed SQL Query Engine for Big Data (by Facebook)
  • Apache Calcite Landing page
    Landing page //
    2022-04-30
  • Presto DB Landing page
    Landing page //
    2023-03-18

Apache Calcite features and specs

No features have been listed yet.

Presto DB features and specs

  • High-Performance Query Engine
    Presto is designed for high-performance querying, capable of performing complex analytics and large-scale data processing at interactive speeds.
  • Distributed SQL Query Engine
    Presto can scale out to large clusters of machines, allowing for efficient distribution of queries over multiple servers to handle big data workloads.
  • Versatility
    Supports querying data from multiple data sources such as Hadoop, relational databases, NoSQL databases, and cloud object storage within a single query.
  • ANSI-SQL Compatibility
    Presto supports ANSI SQL, making it easier for users familiar with SQL to adapt and write queries without a steep learning curve.
  • Open Source
    Presto is an open-source project, which means it benefits from continuous community contributions and improvements, keeping it up-to-date and robust.
  • Extensible
    Presto's architecture is designed to be extensible, allowing users to add custom functions and connectors, tailored to specific needs.

Possible disadvantages of Presto DB

  • Resource Intensive
    High performance comes with significant resource requirements, necessitating robust infrastructure to realize its full potential.
  • Complex Configuration
    Setting up and configuring Presto can be complex and time-consuming, often requiring expertise and an understanding of its various components.
  • Limited Support for Transactions
    Presto is primarily designed for reading data and performing analytics, and it has limited support for transactional processing compared to traditional relational databases.
  • Community Support
    While it has a vibrant open-source community, users may find the support less comprehensive than that provided by commercial enterprise solutions.
  • Latency for Small Queries
    Designed for big data and complex queries, Presto may exhibit higher latency for small, simple queries compared to specialized databases optimized for such use cases.
  • Maintenance Overhead
    Managing and maintaining a Presto cluster can be labor-intensive, requiring ongoing tuning and maintenance to ensure optimal performance and reliability.

Apache Calcite videos

The Evolution of Apache Calcite and its Community - A Discussion with Julian Hyde

More videos:

  • Review - Building modern SQL query optimizers with Apache Calcite - Vladimir Ozerov

Presto DB videos

No Presto DB videos yet. You could help us improve this page by suggesting one.

Add video

Category Popularity

0-100% (relative to Apache Calcite and Presto DB)
Databases
48 48%
52% 52
Data Dashboard
7 7%
93% 93
Database Tools
8 8%
92% 92
Big Data Analytics
0 0%
100% 100

User comments

Share your experience with using Apache Calcite and Presto DB. For example, how are they different and which one is better?
Log in or Post with

Social recommendations and mentions

Apache Calcite might be a bit more popular than Presto DB. We know about 12 links to it since March 2021 and only 10 links to Presto DB. We are tracking product recommendations and mentions on various public social media platforms and blogs. They can help you identify which product is more popular and what people think of it.

Apache Calcite mentions (12)

  • Data diffs: Algorithms for explaining what changed in a dataset (2022)
    > Make diff work on more than just SQLite. Another way of doing this that I've been wanting to do for a while is to implement the DIFF operator in Apache Calcite[0]. Using Calcite, DIFF could be implemented as rewrite rules to generate the appropriate SQL to be directly executed against the database or the DIFF operator can be implemented outside of the database (which the original paper shows is more efficient).... - Source: Hacker News / almost 2 years ago
  • How to manipulate SQL string programmatically?
    Use a SQL Parser like sqlglot or Apache Calcite to compile user's query into an AST. Source: about 2 years ago
  • Parsing SQL
    One parser I think deserves a mention is the one from Apache Calcite[0]. Calcite does more than parsing, there are a number of users who pick up Calcite just for the parser. While the default parser attempts to adhere strictly to the SQL standard, of interest is also the Babel parser, which aims to be as permissive as possible in accepting different dialects of SQL. Disclaimer: I am on the PMC of Apache Calcite,... - Source: Hacker News / over 2 years ago
  • Semantic Diff for SQL
    Apache Calcite can do this, though it's not a beginner-friendly task: https://calcite.apache.org/. - Source: Hacker News / almost 3 years ago
  • OctoSQL allows you to join data from different sources using SQL
    You should look at Apache Calcite[0]. Like OctoSQL, you can join data from different data sources. It's also relatively easy to add your own data sources ("adapters" in Calcite lingo) and rules to efficiently query those sources. Calcite already has adapters that do things like read from HTML tables over HTTP, files on your file system, running processes, etc. This is in addition to connecting to a bunch of... - Source: Hacker News / almost 3 years ago
View more

Presto DB mentions (10)

  • Data Warehouses and Data Lakes: Understanding Modern Data Storage Paradigms 📦
    Follow Presto at Official Website, Linkedin, Youtube, and Slack channel to join the community. - Source: dev.to / 8 days ago
  • Introduction to Presto: Open Source SQL Query Engine that's changing Big Data Analytics
    In today's data-driven world, organizations face a constant challenge: how to analyse massive datasets quickly and efficiently without moving data between disparate systems. Presto, an open-source distributed SQL query engine that's revolutionizing how we approach big data analytics. - Source: dev.to / 8 days ago
  • Twitter's 600-Tweet Daily Limit Crisis: Soaring GCP Costs and the Open Source Fix Elon Musk Ignored
    Presto: Presto is an open-source distributed SQL query engine that enables querying data from various sources. It provides fast and interactive analytics capabilities, supporting a wide range of data formats and integration with different storage systems. - Source: dev.to / 23 days ago
  • Using IRIS and Presto for high-performance and scalable SQL queries
    The rise of Big Data projects, real-time self-service analytics, online query services, and social networks, among others, have enabled scenarios for massive and high-performance data queries. In response to this challenge, MPP (massively parallel processing database) technology was created, and it quickly established itself. Among the open-source MPP options, Presto (https://prestodb.io/) is the best-known... - Source: dev.to / 3 months ago
  • Parsing logs from multiple data sources with Ahana and Cube
    Presto is an open-source distributed SQL query engine, originally developed at Facebook, now hosted under the Linux Foundation. It connects to multiple databases or other data sources (for example, Amazon S3). We can use a Presto cluster as a single compute engine for an entire data lake. - Source: dev.to / almost 3 years ago
View more

What are some alternatives?

When comparing Apache Calcite and Presto DB, you can also consider the following products

Apache Drill - Schema-Free SQL Query Engine for Hadoop and NoSQL

Looker - Looker makes it easy for analysts to create and curate custom data experiences—so everyone in the business can explore the data that matters to them, in the context that makes it truly meaningful.

Open Data Hub - OpenDataHub

Google BigQuery - A fully managed data warehouse for large-scale data analytics.

SQLite - SQLite Home Page

Jupyter - Project Jupyter exists to develop open-source software, open-standards, and services for interactive computing across dozens of programming languages. Ready to get started? Try it in your browser Install the Notebook.