Platform Engineering

We regularly write about our technical experiences (good and bad) and what we're learning from the market.

    Tags

    Write and deploy an Apache Beam pipeline with Dataflow

    Posted by Sheng Wu on 02 April 2019

    tech, gcp, dataflow, Apache Beam, Fast-Data, parquet, csv

    Overview 

    Apache Beam is a unified programming model and the name Beam means Batch + strEAM. It is good at processing both batch and streaming data and can be run on different runners, such as Google Dataflow, Apache Spark, and Apache Flink. The Beam programming guide documents on how to develop a pipeline and the ...

    Continue reading