Spring Cloud Data Flow might be a bit more popular than HortonWorks Data Platform. We know about 1 link to it since March 2021 and only 1 link to HortonWorks Data Platform. We are tracking product recommendations and mentions on various public social media platforms and blogs. They can help you identify which product is more popular and what people think of it.
A copy of Hadoop installed on each of these machines. You can download Hadoop from the Apache website, or you can use a distribution like Cloudera or Hortonworks. - Source: dev.to / over 2 years ago
And a Cloudera project: https://www.cloudera.com/products/cdf.html And an Azure feature: https://docs.microsoft.com/en-us/azure/data-factory/control-flow-execute-data-flow-activity And a Spring feature: https://spring.io/projects/spring-cloud-dataflow. - Source: Hacker News / over 4 years ago
Amazon EMR - Amazon Elastic MapReduce is a web service that makes it easy to quickly process vast amounts of data.
Google Cloud Dataflow - Google Cloud Dataflow is a fully-managed cloud service and programming model for batch and streaming big data processing.
Google Cloud Dataproc - Managed Apache Spark and Apache Hadoop service which is fast, easy to use, and low cost
Confluent - Confluent offers a real-time data platform built around Apache Kafka.
Google BigQuery - A fully managed data warehouse for large-scale data analytics.
Amazon Kinesis - Amazon Kinesis services make it easy to work with real-time streaming data in the AWS cloud.