Repository navigation
apache-beam
- Website
- Wikipedia
TFX is an end-to-end platform for deploying production ML pipelines
Cloud Dataflow Google-provided templates for solving in-Cloud data tasks
Yet Another UserAgent Analyzer
[DEPRECATED] Kubernetes operator for managing the lifecycle of Apache Flink and Beam applications.
ETL scripts for Bitcoin, Litecoin, Dash, Zcash, Doge, Bitcoin Cash. Available in Google BigQuery https://goo.gl/oY5BCQ
Tools to make weather data accessible and useful.
Kubernetes operator for managing the lifecycle of Apache Flink and Beam applications.
TFRecorder makes it easy to create TensorFlow records (TFRecords) from Pandas DataFrames and CSVs files containing images or structured data.
A collection of tools for extracting FHIR resources and analytics services on top of that data.
Clojure API for a more dynamic Google Dataflow
Collection of transforms for the Apache beam python SDK.
Streaming Ethereum and Bitcoin blockchain data to Google Pub/Sub or Postgres in Kubernetes
Asgarde allows simplifying error handling with Apache Beam Java, with less code, more concise and expressive code.
Repository to quickly get you started with new Machine Learning projects on Google Cloud Platform. More info(slides):
Export a whole BigQuery table to Google Datastore with Apache Beam/Google Dataflow
Microservices in Post-Kubernetes Era. A polyglot monorepo
Some class materials for a data processing course using PySpark
Opinionated serverless event analytics pipeline