Repository navigation

#

apache-beam

TFX is an end-to-end platform for deploying production ML pipelines

Python
2142
25 天前

[DEPRECATED] Kubernetes operator for managing the lifecycle of Apache Flink and Beam applications.

Go
658
3 年前

ETL scripts for Bitcoin, Litecoin, Dash, Zcash, Doge, Bitcoin Cash. Available in Google BigQuery https://goo.gl/oY5BCQ

Python
421
2 个月前

Tools to make weather data accessible and useful.

Python
224
4 天前

Kubernetes operator for managing the lifecycle of Apache Flink and Beam applications.

Go
211
13 天前

TFRecorder makes it easy to create TensorFlow records (TFRecords) from Pandas DataFrames and CSVs files containing images or structured data.

Python
183
3 年前

A collection of tools for extracting FHIR resources and analytics services on top of that data.

Java
176
6 天前

Clojure API for a more dynamic Google Dataflow

Clojure
131
3 天前

Collection of transforms for the Apache beam python SDK.

Python
89
1 年前

Asgarde allows simplifying error handling with Apache Beam Java, with less code, more concise and expressive code.

Java
74
9 个月前

Mercari Dataflow Template

Java
72
9 天前

Repository to quickly get you started with new Machine Learning projects on Google Cloud Platform. More info(slides):

Python
64
6 年前

Export a whole BigQuery table to Google Datastore with Apache Beam/Google Dataflow

Java
58
5 年前

Some class materials for a data processing course using PySpark

Python
52
2 年前

Opinionated serverless event analytics pipeline

Go
43
2 年前