Repository navigation

#

ml-pipelines

An open-source data logging library for machine learning models and data pipelines. 📚 Provides visibility into data quality & model performance over time. 🛡️ Supports privacy-preserving data collection, ensuring safety & robustness. 📈

Jupyter Notebook
2762
9 个月前
Python
996
9 个月前

Free Open-source ML observability course for data scientists and ML engineers. Learn how to monitor and debug your ML models in production.

Jupyter Notebook
95
2 年前

An AutoML pipeline selection system to quickly select a promising pipeline for a new dataset.

Python
84
4 年前

Free and open source automation platform

Go
49
6 天前

Best practices for engineering ML pipelines.

Jupyter Notebook
36
3 年前

Library for streaming data and incremental learning algorithms.

Python
25
17 天前

Components that I have created for Kubeflow Pipelines. Try them in https://cloud-pipelines.net/pipeline-editor/

Python
14
20 天前

Serverless ML system to predict the direction and volume of electricity flows to and from the Netherlands and its energy transmission partners.

Python
11
6 个月前

This Project is a part of Data Science Nanodegree Program by Udacity in collaboration with Figure Eight. The initial dataset contains pre-labelled tweet and messages from real-life disasters. The aim of this project is to build a Natural Language Processing tool that categorize messages.

Jupyter Notebook
8
5 年前

A collection of real-world machine learning and AI projects. Explore hands-on implementations of cutting-edge models, practical solutions, and techniques to tackle real-world challenges using AI.

Jupyter Notebook
5
2 个月前

This a repo that was created to learn more about Airflow and develop awesome data engineering projects. 🚀🚀

Python
5
2 年前

Fraud detection ML pipeline and serving POC using Dask and hopeit.engine. Project created with nbdev: https://www.fast.ai/2019/12/02/nbdev/

Jupyter Notebook
5
2 年前

🧠A hands-on workspace for practicing machine learning concepts, data preprocessing, and experimenting with small ML projects. This repo includes foundational Python scripts, real-world mini-projects, and experiments that reflect a progressive learning journey in applied machine learning.

Jupyter Notebook
3
2 个月前

This repository contains my code solution to DeepLearning.AIs Practical Data Science On AWS Cloud Specialization.

Jupyter Notebook
3
2 年前

Big data application of Machine Learning concepts for sentiment classification of US Airlines tweets. The focus is on the usage of pyspark libraries (ml-lib) on big data to solve a problem using Machine Learning algorithms and not about the choice of algorithm used in the ML model creation. It also involves data pre-processing using NLP techniques, cross-validation and parameter-grid builder.

Jupyter Notebook
2
3 年前

ML pipeline to categorize emergency messages based on the needs communicated by the sender.

Jupyter Notebook
2
1 个月前

Develop algorithms to classify genetic mutations based on clinical evidence (text).

Jupyter Notebook
1
2 年前

Guide on how to structure and implement machine learning pipelines.

Python
1
1 年前