Repository navigation

#

serving

A flexible, high-performance serving system for machine learning models

C++
6267
7 小时前
Go
4588
1 天前

An MLOps framework to package, deploy, monitor and manage thousands of production machine learning models

HTML
4503
16 小时前

In this repository, I will share some useful notes and references about deploying deep learning-based models in production.

4332
5 个月前
Java
4313
3 天前

⚡️An Easy-to-use and Fast Deep Learning Model Deployment Toolkit for ☁️Cloud 📱Mobile and 📹Edge. Including Image, Video, Text and Audio 20+ main stream scenarios and 150+ SOTA models with end-to-end optimization, multi-platform and multi-framework support.

C++
3158
2 个月前

Deploy high-performance AI models and inference pipelines on FastAPI with built-in batching, streaming and more.

Python
3062
9 天前
ray-project/llm-applications

A comprehensive guide to building RAG-based LLM applications for production.

Jupyter Notebook
1785
9 个月前

DELTA is a deep learning based natural language and speech processing platform. LF AI & DATA Projects: https://lfaidata.foundation/projects/delta/

Python
1590
3 天前

A multi-modal vector database that supports upserts and vector queries using unified SQL (MySQL-Compatible) on structured and unstructured data, while meeting the requirements of high concurrency and ultra-low latency.

Java
1397
1 天前

A flexible, high-performance carrier for machine learning models(『飞桨』服务化部署框架)

C++
905
11 天前

Generic and easy-to-use serving service for machine learning models

JavaScript
758
1 个月前
C++
722
14 小时前

A unified end-to-end machine intelligence platform

Python
533
8 个月前

부스트캠프 AI Tech - Product Serving 자료

Python
454
2 个月前