Repository navigation

#

apache-beam

TFX is an end-to-end platform for deploying production ML pipelines

Python
2157
2 个月前

[DEPRECATED] Kubernetes operator for managing the lifecycle of Apache Flink and Beam applications.

Go
658
3 年前

ETL scripts for Bitcoin, Litecoin, Dash, Zcash, Doge, Bitcoin Cash. Available in Google BigQuery https://goo.gl/oY5BCQ

Python
430
4 个月前

Tools to make weather data accessible and useful.

Python
231
7 天前

Kubernetes operator for managing the lifecycle of Apache Flink and Beam applications.

Go
215
14 小时前

A collection of tools for extracting FHIR resources and analytics services on top of that data.

Jupyter Notebook
187
4 天前

TFRecorder makes it easy to create TensorFlow records (TFRecords) from Pandas DataFrames and CSVs files containing images or structured data.

Python
180
3 年前

Clojure API for a more dynamic Google Dataflow

Clojure
130
2 个月前

Collection of transforms for the Apache beam python SDK.

Python
89
2 年前

Tool to define Apache Beam pipeline in YAML or JSON

Java
76
7 天前

Asgarde allows simplifying error handling with Apache Beam Java, with less code, more concise and expressive code.

Java
75
1 年前

Repository to quickly get you started with new Machine Learning projects on Google Cloud Platform. More info(slides):

Python
64
7 年前

Export a whole BigQuery table to Google Datastore with Apache Beam/Google Dataflow

Java
58
5 年前

Some class materials for a data processing course using PySpark

Python
52
3 年前

Opinionated serverless event analytics pipeline

Go
43
2 年前