Software Alternatives, Accelerators & Startups

Apache Doris VS InfluxData

Compare Apache Doris VS InfluxData and see what are their differences

Apache Doris logo Apache Doris

Apache Doris is an open-source real-time data warehouse for big data analytics.

InfluxData logo InfluxData

Scalable datastore for metrics, events, and real-time analytics.
  • Apache Doris Apache Doris
    Apache Doris //
    2024-01-10
  • InfluxData Landing page
    Landing page //
    2023-07-30

Apache Doris features and specs

  • High Performance
    Apache Doris is designed to deliver high query performance, especially for aggregate queries, due to its columnar storage and vectorized execution engine.
  • Real-time Analytics
    Supports real-time data analytics with low latency, thanks to its efficient data ingestion processes and real-time data update capabilities.
  • Unified Analytics
    Provides a unified platform that supports both real-time and batch data processing, offering flexibility for different analytical workloads.
  • Ease of Use
    Features a SQL-like interface, which makes it accessible for users familiar with SQL, reducing the learning curve.
  • Scalability
    Can scale out horizontally, allowing it to handle increasing volumes of data and user queries by adding more nodes to the cluster.

Possible disadvantages of Apache Doris

  • Ecosystem Integration
    While improving, the ecosystem isn't as mature as older database management systems, which might pose integration challenges with certain tools.
  • Community Support
    Being a relatively newer project, it may not have as large a community or as extensive third-party support as more established databases.
  • Complexity in Setup
    Initial setup and configuration can be complex, especially for users not already familiar with similar distributed systems.
  • Limited Use Cases
    Optimized specifically for online analytical processing (OLAP), it may not be suitable for all types of databases or transactional use cases.
  • Features Maturity
    Some features may lack the maturity and robustness found in more mature and widely adopted database systems, requiring careful evaluation based on project needs.

InfluxData features and specs

  • High Performance
    InfluxData's InfluxDB is designed to handle high write and query loads, making it suitable for time-series data and real-time applications.
  • Open-Source
    The core InfluxDB product is open-source, allowing for transparency, community contributions, and the option to self-host the database.
  • Scalability
    InfluxDB offers horizontal scalability, enabling users to handle increasing volumes of data efficiently through clustering.
  • Built-In Data Processing
    InfluxData offers integrated tools for data processing and scripting, such as Kapacitor for real-time processing and Flux for advanced querying.
  • Rich Ecosystem
    InfluxData provides a comprehensive ecosystem including Telegraf for data collection, Chronograf for visualization, and Kapacitor for alerting and processing.
  • Time-Series Focused
    InfluxDB is optimized for time-series data, offering specialized features like time-based retention policies, continuous queries, and downsampling.
  • Easy Integration
    InfluxDB integrates well with many third-party data visualization and monitoring tools such as Grafana, making it easier to build end-to-end solutions.

Possible disadvantages of InfluxData

  • Complexity
    The comprehensive features and tools in the InfluxData ecosystem can result in a steeper learning curve, especially for novices.
  • Cost
    While the open-source version is free, the enterprise and cloud-hosted versions come with a cost, which can be significant for small to mid-sized businesses.
  • Resource Intensive
    InfluxDB can be resource-intensive, especially under high loads, requiring significant hardware resources for optimal performance.
  • Limited SQL Support
    InfluxDB doesn’t fully support SQL, which can be a hurdle for users accustomed to traditional relational databases. It uses its own query languages like InfluxQL and Flux.
  • Fragmented Documentation
    Some users find the documentation fragmented or lacking in depth, which can make troubleshooting and advanced usage more challenging.
  • Data Backup and Restore
    Managing backups and restores in InfluxDB can be intricate and may require additional effort and tools to ensure data integrity and availability.

Apache Doris videos

No Apache Doris videos yet. You could help us improve this page by suggesting one.

Add video

InfluxData videos

Barbara Nelson [InfluxData] | Best Practices for Data Ingestion into InfluxDB

Category Popularity

0-100% (relative to Apache Doris and InfluxData)
Databases
36 36%
64% 64
Relational Databases
100 100%
0% 0
Time Series Database
0 0%
100% 100
Data Warehousing
100 100%
0% 0

User comments

Share your experience with using Apache Doris and InfluxData. For example, how are they different and which one is better?
Log in or Post with

Reviews

These are some of the external sources and on-site user reviews we've used to compare Apache Doris and InfluxData

Apache Doris Reviews

Log analysis: Elasticsearch vs Apache Doris
If you are looking for an efficient log analytic solution, Apache Doris is friendly to anyone equipped with SQL knowledge; if you find friction with the ELK stack, try Apache Doris provides better schema-free support, enables faster data writing and queries, and brings much less storage burden.

InfluxData Reviews

ReductStore vs. MinIO & InfluxDB on LTE Network: Who Really Wins the Speed Race?
Maintaining consistency between multiple databases, like MinIO and InfluxDB, adds a layer of complexity. In our setup, MinIO, used for blob storage, is linked to data points in InfluxDB via its filename. Any inconsistencies or mismatches between the two could potentially result in data loss. Furthermore, we need to query both databases, which is quite inefficient. Lastly,...
Apache Druid vs. Time-Series Databases
We occasionally get questions regarding how Apache Druid differs from time-series databases (TSDB) such as InfluxDB or Prometheus, and when to use each technology. This short post serves to help answer these questions.
Source: imply.io
4 Best Time Series Databases To Watch in 2019
InfluxDB is part of the TICK stack : Telegraf, InfluxDB, Chronograf and Kapacitor. InfluxData provides, out of the box, a visualization tool (that can be compared to Grafana), a data processing engine that binds directly with InfluxDB, and a set of more than 50+ agents that can collect real-time metrics for a lot of different data sources.
Source: medium.com

Social recommendations and mentions

Based on our record, Apache Doris should be more popular than InfluxData. It has been mentiond 6 times since March 2021. We are tracking product recommendations and mentions on various public social media platforms and blogs. They can help you identify which product is more popular and what people think of it.

Apache Doris mentions (6)

  • Evolution of Data Sharding Towards Automation and Flexibility
    Like in many databases, Apache Doris shards data into partitions, and then a partition is further divided into buckets. Partitions are typically defined by time or other continuous values. This allows query engines to quickly locate the target data during queries by pruning irrelevant data ranges. - Source: dev.to / 9 months ago
  • Steps to industry-leading query speed: evolution of the Apache Doris execution engine
    What makes a modern database system? The three key modules are query optimizer, execution engine, and storage engine. Among them, the role of execution engine to the DBMS is like the chef to a restaurant. This article focuses on the execution engine of the Apache Doris data warehouse, explaining the secret to its high performance. - Source: dev.to / 9 months ago
  • Apache Doris for log and time series data analysis in NetEase, why not Elasticsearch and InfluxDB?
    For most people looking for a log management and analytics solution, Elasticsearch is the go-to choice. The same applies to InfluxDB for time series data analysis. These were exactly the choices of NetEase, one of the world's highest-yielding game companies but more than that. As NetEase expands its business horizons, the logs and time series data it receives explode, and problems like surging storage costs and... - Source: dev.to / 10 months ago
  • Multi-tenant workload isolation in Apache Doris: a better balance between isolation and utilization
    This is an in-depth introduction to the workload isolation capabilities of Apache Doris. But first of all, why and when do you need workload isolation? If you relate to any of the following situations, read on and you will end up with a solution:. - Source: dev.to / 11 months ago
  • SQL Convertor for Easy Migration from Presto, Trino, ClickHouse, and Hive to Apache Doris
    Apache Doris is an all-in-one data platform that is capable of real-time reporting, ad-hoc queries, data lakehousing, log management and analysis, and batch data processing. As more and more companies have been replacing their component-heavy data architecture with Apache Doris, there is an increasing need for a more convenient data migration solution. That's why the Doris SQL Convertor is made. - Source: dev.to / 12 months ago
View more

InfluxData mentions (2)

  • Can i log data into excel/csv using aws?
    I would highly recommend using a proper Time Series Database like QuestDB or InfluxDB to do this instead. You can always export data from wither of those two into Excel if your boss wants it in excel, but it's much easier to do data transformations, create graphs and reports, etc. If you have all the data in a proper database. Source: over 3 years ago
  • How to stream IoT data into Excel
    I would suggest using something better suited to IoT data than ... a spreadsheet. I'd recommend looking at one of the Time Series Databases for this. 1) QuestDB or 2) InfluxDB as these are much better suited to streaming data. Source: over 3 years ago

What are some alternatives?

When comparing Apache Doris and InfluxData, you can also consider the following products

ClickHouse - ClickHouse is an open-source column-oriented database management system that allows generating analytical data reports in real time.

TimescaleDB - TimescaleDB is a time-series SQL database providing fast analytics, scalability, with automated data management on a proven storage engine.

StarRocks - StarRocks offers the next generation of real-time SQL engines for enterprise-scale analytics. Learn how we make it easy to deliver real-time analytics.

Prometheus - An open-source systems monitoring and alerting toolkit.

Apache Hive - Apache Hive data warehouse software facilitates querying and managing large datasets residing in distributed storage.

VictoriaMetrics - Fast, easy-to-use, and cost-effective time series database