Software Alternatives & Reviews
Table of contents
  1. Social Mentions
  2. Comments

Apache Beam

Apache Beam provides an advanced unified programming model to implement batch and streaming data processing jobs.

Big Data Data Dashboard Big Data Tools

Apache Beam Reviews and details

Apache Beam Landing Page
Apache Beam Landing Page

Social recommendations and mentions

We have tracked the following product recommendations or mentions on Reddit, HackerNews and some other sources. They can help you see what people think about Apache Beam and what they use it for.
  • Real Time Data Infra Stack
    Apache Beam: Streaming framework which can be run on several runner such as Apache Flink and GCP Dataflow. - Source: dev.to / 6 months ago
  • Google Cloud Reference
    Apache Beam: Batch/streaming data processing 🔗Link. - Source: dev.to / 9 months ago
  • Composer out of resources - "INFO Task exited with return code Negsignal.SIGKILL"
    What you are looking for is Dataflow. It can be a bit tricky to wrap your head around at first, but I highly suggest leaning into this technology for most of your data engineering needs. It's based on the open source Apache Beam framework that originated at Google. We use an internal version of this system at Google for virtually all of our pipeline tasks, from a few GB, to Exabyte scale systems -- it can do it all. - Source: Reddit / 10 months ago
  • Pub/Sub parallel processing best practices
    That being said, there is a learning curve in understanding how Apache Beam works. Take a look at the beam website for more information. - Source: Reddit / 10 months ago
  • Data engineering in GCP is not matured
    Take a look at Apache Beam as it's the basis for the Dataflow service. - Source: Reddit / 10 months ago
  • GCP to AWS
    Apache Beam a framework in which we can implement batch and streaming data processing pipeline independent of the underlying engine e.g. spark, flink, dataflow etc. - Source: Reddit / over 1 year ago
  • Jinja2 not formatting my text correctly. Any advice?
    ListItem(name='Apache Beam', website='https://beam.apache.org/', category='Batch Processing', short_description='Apache Beam is an open source unified programming model to define and execute data processing pipelines, including ETL, batch and stream processing'),. - Source: Reddit / over 1 year ago
  • Frameworks of the Future?
    I asked a similar question in a different community, and the closest they came up with was the niche Apache Beam and the obligatory vague hand-waving about no-code systems. So, maybe DEV seeming to skew younger and more deliberately technical might get a better view of things? Is anybody using a "Framework of the Future" that we should know about? - Source: dev.to / almost 2 years ago
  • Best library for CSV to XML or JSON.
    Apache Beam may be what you're looking for. It will work with both Python and Java. It's used by GCP in the Cloud Dataflow service as a sort of streaming ETL tool. It occupies a similar niche to Spark, but is a little easier to use IMO. - Source: Reddit / almost 2 years ago
  • How to guarantee exactly once with Beam(on Flink) for side effects
    Now that we understand how exactly-once state consistency works, you might think what about side effects, such as sending out an email, or write to database. That is a valid concern, because Flink's recovery mechanism are not sufficient to provide end to end exactly once guarantees even though the application state is exactly once consistent, for example, if message x and y from above contains info and action to... - Source: dev.to / about 2 years ago
  • Best Practices to Become a Data Engineer
    Apache Beam - Apache Beam is a scalable framework that allows you to implement batch and streaming data processing jobs. It is a framework that you can use in order to create a data pipeline on Google Cloud or on Amazon Web Services. - Source: dev.to / about 2 years ago
  • Ecosystem: Haskell vs JVM (Eta, Frege)
    Dataflow is Google's implementation of a runner for Apache Beam jobs in Google cloud. Right now, python and java are pretty much the only two options supported for writing Beam jobs that run on Dataflow. - Source: Reddit / about 2 years ago

Do you know an article comparing Apache Beam to other products?
Suggest a link to a post with product alternatives.

Generic Apache Beam discussion

Log in or Post with

This is an informative page about Apache Beam. You can review and discuss the product here. The primary details have not been verified within the last quarter, and they might be outdated. If you think we are missing something, please use the means on this page to comment or suggest changes. All reviews and comments are highly encouranged and appreciated as they help everyone in the community to make an informed choice. Please always be kind and objective when evaluating a product and sharing your opinion.