Repository navigation

#

ml-pipelines

An open-source data logging library for machine learning models and data pipelines. 📚 Provides visibility into data quality & model performance over time. 🛡️ Supports privacy-preserving data collection, ensuring safety & robustness. 📈

Jupyter Notebook
2707
3 个月前
Python
986
3 个月前

An AutoML pipeline selection system to quickly select a promising pipeline for a new dataset.

Python
82
3 年前

Free Open-source ML observability course for data scientists and ML engineers. Learn how to monitor and debug your ML models in production.

Jupyter Notebook
81
1 年前

Free and open source automation platform

Go
50
4 天前

Best practices for engineering ML pipelines.

Jupyter Notebook
35
3 年前

Library for streaming data and incremental learning algorithms.

Python
23
1 年前

Components that I have created for Kubeflow Pipelines. Try them in https://cloud-pipelines.net/pipeline-editor/

Python
14
2 年前

Serverless ML system to predict the direction and volume of electricity flows to and from the Netherlands and its energy transmission partners.

Python
11
13 天前

This Project is a part of Data Science Nanodegree Program by Udacity in collaboration with Figure Eight. The initial dataset contains pre-labelled tweet and messages from real-life disasters. The aim of this project is to build a Natural Language Processing tool that categorize messages.

Jupyter Notebook
9
5 年前

This a repo that was created to learn more about Airflow and develop awesome data engineering projects. 🚀🚀

Python
5
1 年前

Fraud detection ML pipeline and serving POC using Dask and hopeit.engine. Project created with nbdev: https://www.fast.ai/2019/12/02/nbdev/

Jupyter Notebook
5
2 年前

This repository contains my code solution to DeepLearning.AIs Practical Data Science On AWS Cloud Specialization.

Jupyter Notebook
2
2 年前

Big data application of Machine Learning concepts for sentiment classification of US Airlines tweets. The focus is on the usage of pyspark libraries (ml-lib) on big data to solve a problem using Machine Learning algorithms and not about the choice of algorithm used in the ML model creation. It also involves data pre-processing using NLP techniques, cross-validation and parameter-grid builder.

Jupyter Notebook
2
3 年前

ML pipeline to categorize emergency messages based on the needs communicated by the sender.

Jupyter Notebook
2
1 个月前

Develop algorithms to classify genetic mutations based on clinical evidence (text).

Jupyter Notebook
1
2 年前

In this project, I developed a completed Vertex and Kubeflow pipelines SDK to build and deploy an AutoML / BigQuery ML regression model for online predictions. Using this ML Pipeline, I was able to develop, deploy, and manage the production ML lifecycle efficiently and reliably.

Jupyter Notebook
1
2 年前

This project focuses on building end-to-end machine learning pipeline using AWS SageMaker to predict the price range of mobile phones based on their specifications, enhancing consumer decision-making and streamlining the development process.

Jupyter Notebook
0
1 年前

Example solution to the MLOps Case Study covering both online and batch processing.

Pkl
0
10 个月前