Software Alternatives, Accelerators & Startups

HortonWorks Data Platform VS InfluxData

Compare HortonWorks Data Platform VS InfluxData and see what are their differences

HortonWorks Data Platform logo HortonWorks Data Platform

The Hortonworks Data Platform is a 100% open source distribution of Apache Hadoop that is truly...

InfluxData logo InfluxData

Scalable datastore for metrics, events, and real-time analytics.
  • HortonWorks Data Platform Landing page
    Landing page //
    2023-09-28
  • InfluxData Landing page
    Landing page //
    2023-07-30

HortonWorks Data Platform features and specs

  • Open Source Foundation
    HortonWorks Data Platform (HDP) is built entirely on open-source technologies, allowing for greater community support, flexibility, and transparency in its development and deployment.
  • Enterprise-Grade Security
    HDP offers robust security features, including authentication, authorization, auditing, and data protection, which are critical for managing sensitive data in enterprise environments.
  • Scalability
    The platform can handle large volumes of data, making it suitable for enterprises that require scalable solutions to manage their big data demands.
  • Comprehensive Ecosystem
    HortonWorks provides a comprehensive suite of tools and integrations, including Apache Hadoop, Hive, HBase, and others, enabling diverse data processing and analytics capabilities.

Possible disadvantages of HortonWorks Data Platform

  • Complexity
    The platform's extensive set of features and integrations can be complex to configure and manage, especially for organizations without dedicated data engineering teams.
  • Resource Intensiveness
    Running HDP can be resource-intensive, requiring significant hardware and infrastructure investments, which might be a barrier for smaller organizations.
  • Learning Curve
    Due to its complexity and the breadth of technologies involved, there is a steep learning curve for new users or teams unfamiliar with the Hadoop ecosystem.
  • Support and Documentation
    While there is community support available due to its open-source nature, some users might find official support and comprehensive documentation lacking compared to proprietary solutions.

InfluxData features and specs

  • High Performance
    InfluxData's InfluxDB is designed to handle high write and query loads, making it suitable for time-series data and real-time applications.
  • Open-Source
    The core InfluxDB product is open-source, allowing for transparency, community contributions, and the option to self-host the database.
  • Scalability
    InfluxDB offers horizontal scalability, enabling users to handle increasing volumes of data efficiently through clustering.
  • Built-In Data Processing
    InfluxData offers integrated tools for data processing and scripting, such as Kapacitor for real-time processing and Flux for advanced querying.
  • Rich Ecosystem
    InfluxData provides a comprehensive ecosystem including Telegraf for data collection, Chronograf for visualization, and Kapacitor for alerting and processing.
  • Time-Series Focused
    InfluxDB is optimized for time-series data, offering specialized features like time-based retention policies, continuous queries, and downsampling.
  • Easy Integration
    InfluxDB integrates well with many third-party data visualization and monitoring tools such as Grafana, making it easier to build end-to-end solutions.

Possible disadvantages of InfluxData

  • Complexity
    The comprehensive features and tools in the InfluxData ecosystem can result in a steeper learning curve, especially for novices.
  • Cost
    While the open-source version is free, the enterprise and cloud-hosted versions come with a cost, which can be significant for small to mid-sized businesses.
  • Resource Intensive
    InfluxDB can be resource-intensive, especially under high loads, requiring significant hardware resources for optimal performance.
  • Limited SQL Support
    InfluxDB doesn’t fully support SQL, which can be a hurdle for users accustomed to traditional relational databases. It uses its own query languages like InfluxQL and Flux.
  • Fragmented Documentation
    Some users find the documentation fragmented or lacking in depth, which can make troubleshooting and advanced usage more challenging.
  • Data Backup and Restore
    Managing backups and restores in InfluxDB can be intricate and may require additional effort and tools to ensure data integrity and availability.

HortonWorks Data Platform videos

Why You Need Hortonworks Data Platform 3.0

More videos:

  • Review - Hortonworks Data Platform 3.0 – Faster, Smarter, Hybrid Data

InfluxData videos

Barbara Nelson [InfluxData] | Best Practices for Data Ingestion into InfluxDB

Category Popularity

0-100% (relative to HortonWorks Data Platform and InfluxData)
Data Dashboard
100 100%
0% 0
Databases
0 0%
100% 100
Big Data
58 58%
42% 42
Time Series Database
0 0%
100% 100

User comments

Share your experience with using HortonWorks Data Platform and InfluxData. For example, how are they different and which one is better?
Log in or Post with

Reviews

These are some of the external sources and on-site user reviews we've used to compare HortonWorks Data Platform and InfluxData

HortonWorks Data Platform Reviews

We have no reviews of HortonWorks Data Platform yet.
Be the first one to post

InfluxData Reviews

ReductStore vs. MinIO & InfluxDB on LTE Network: Who Really Wins the Speed Race?
Maintaining consistency between multiple databases, like MinIO and InfluxDB, adds a layer of complexity. In our setup, MinIO, used for blob storage, is linked to data points in InfluxDB via its filename. Any inconsistencies or mismatches between the two could potentially result in data loss. Furthermore, we need to query both databases, which is quite inefficient. Lastly,...
Apache Druid vs. Time-Series Databases
We occasionally get questions regarding how Apache Druid differs from time-series databases (TSDB) such as InfluxDB or Prometheus, and when to use each technology. This short post serves to help answer these questions.
Source: imply.io
4 Best Time Series Databases To Watch in 2019
InfluxDB is part of the TICK stack : Telegraf, InfluxDB, Chronograf and Kapacitor. InfluxData provides, out of the box, a visualization tool (that can be compared to Grafana), a data processing engine that binds directly with InfluxDB, and a set of more than 50+ agents that can collect real-time metrics for a lot of different data sources.
Source: medium.com

Social recommendations and mentions

Based on our record, InfluxData should be more popular than HortonWorks Data Platform. It has been mentiond 2 times since March 2021. We are tracking product recommendations and mentions on various public social media platforms and blogs. They can help you identify which product is more popular and what people think of it.

HortonWorks Data Platform mentions (1)

InfluxData mentions (2)

  • Can i log data into excel/csv using aws?
    I would highly recommend using a proper Time Series Database like QuestDB or InfluxDB to do this instead. You can always export data from wither of those two into Excel if your boss wants it in excel, but it's much easier to do data transformations, create graphs and reports, etc. If you have all the data in a proper database. Source: about 3 years ago
  • How to stream IoT data into Excel
    I would suggest using something better suited to IoT data than ... a spreadsheet. I'd recommend looking at one of the Time Series Databases for this. 1) QuestDB or 2) InfluxDB as these are much better suited to streaming data. Source: over 3 years ago

What are some alternatives?

When comparing HortonWorks Data Platform and InfluxData, you can also consider the following products

Amazon EMR - Amazon Elastic MapReduce is a web service that makes it easy to quickly process vast amounts of data.

TimescaleDB - TimescaleDB is a time-series SQL database providing fast analytics, scalability, with automated data management on a proven storage engine.

Google Cloud Dataproc - Managed Apache Spark and Apache Hadoop service which is fast, easy to use, and low cost

Prometheus - An open-source systems monitoring and alerting toolkit.

Google BigQuery - A fully managed data warehouse for large-scale data analytics.

MongoDB - MongoDB (from "humongous") is a scalable, high-performance NoSQL database.