Repository navigation

serving

Website
Wikipedia

Ray is an AI compute engine. Ray consists of a core distributed runtime and a set of AI Libraries for accelerating ML workloads.

Python

36627

6229

1 小时前

tensorflow / serving

A flexible, high-performance serving system for machine learning models

机器学习深度学习深度神经网络 Python C++神经网络 serving Tensorflow

C++

6267

2199

7 小时前

vespa-engine / vespa

AI + Data, online. https://vespa.ai

vespa search-engine big-data 人工智能 serving serving-recommendation 机器学习 Server Tensorflow Java C++vector-search

Java

6121

630

18 分钟前

volcano-sh / volcano

A Cloud Native Batch System (Project under CNCF)

batch-systems Kubernetes Go hpc bigdata 机器学习 gene 人工智能 serving training

4588

1052

1 天前

SeldonIO / seldon-core

An MLOps framework to package, deploy, monitor and manage thousands of production machine learning models

Kubernetes 机器学习部署 serving mlops aiops machine-learning-operations production-machine-learning

HTML

4503

836

16 小时前

ahkarami / Deep-Learning-in-Production

In this repository, I will share some useful notes and references about deploying deep learning-based models in production.

深度学习深度神经网络 Python PyTorch tesnorflow Keras mxnet caffe2 production serving C++model-serving 教程 Flask REST API React Angular Tensorflow

4332

691

5 个月前

pytorch / serve

Serve, optimize and scale PyTorch models in production

PyTorch 机器学习 mlops serving Docker Kubernetes optimization cpu gpu 监控深度学习

Java

4313

880

3 天前

PaddlePaddle / FastDeploy

⚡️An Easy-to-use and Fast Deep Learning Model Deployment Toolkit for ☁️Cloud 📱Mobile and 📹Edge. Including Image, Video, Text and Audio 20+ main stream scenarios and 150+ SOTA models with end-to-end optimization, multi-platform and multi-framework support.

Android jetson tensorrt onnxruntime object-detection yolov5 openvino picodet uie graphcore intel kunlun rockchip serving onnx stable-diffusion yolov8

C++

3158

476

2 个月前

Lightning-AI / LitServe

Deploy high-performance AI models and inference pipelines on FastAPI with built-in batching, streaming and more.

人工智能 API serving 深度学习 developer-tools FastAPI REST API Web

Python

3062

199

9 天前

georgia-tech-db / evadb

Database system for AI-powered apps

eva video-analytics serving 数据库 labeling object-detection 数据分析人工智能 ChatGPT langchain auto-gpt gpt4all huggingface 大语言模型 gpt-4 agent Hacktoberfest

Python

2665

267

1 年前

tobegit3hub / tensorflow_template_application

TensorFlow template application for deep learning

Tensorflow tfrecords libsvm CSV 深度学习机器学习 mlp cnn lstm inference tensorboard serving

Python

1878

714

2 年前

ray-project / llm-applications

A comprehensive guide to building RAG-based LLM applications for production.

llms 机器学习 ray anyscale fine-tuning llama2 openai serving

Jupyter Notebook

1785

250

9 个月前

Delta-ML / delta

DELTA is a deep learning based natural language and speech processing platform. LF AI & DATA Projects: https://lfaidata.foundation/projects/delta/

自然语言处理深度学习 Tensorflow speech sequence-to-sequence seq2seq speech-recognition text-classification speaker-verification nlu text-generation emotion-recognition tensorflow-lite inference asr serving front-end ops

Python

1590

289

3 天前

dingodb / dingo

A multi-modal vector database that supports upserts and vector queries using unified SQL (MySQL-Compatible) on structured and unstructured data, while meeting the requirements of high concurrency and ultra-low latency.

serving embedding-store vector-database mysql-compatibility embedding-search key-value-distributed-store vector-ocean unified-sql structured-data unstructured-data

Java

1397

241

1 天前

PaddlePaddle / Serving

A flexible, high-performance carrier for machine learning models（『飞桨』服务化部署框架）

rpc-service gpu Python Docker serving pipeline paddle 深度学习 prediction predictor dag micro-service

C++

905

251

11 天前