blog post

In this article I will walk you through the processing of some Dutch COVID-19 data using Google Dataflow and Apache Beam via Spotify’s Scio Scala library and a dash of Twitter’s Algebird. (Bake at 200 degrees for 20 minutes)

Why this combo? Because I wanted to learn more about Dataflow, from a quick comparison I much prefer the Scala API over the Java or Python API for Beam and Covid numbers are of course a very current topic and relatable for many people.

Read more…

Related Articles

blog-post

CD4ML: Ability to reproduce

Introduction Barry is a data scientist working for a small data science and engineer consulting company based in …