Repository navigation

#

inference

Python
59422
4 小时前
ggml-org/whisper.cpp
C++
43624
3 天前

DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.

Python
40299
7 小时前
C++
22128
2 天前
Python
18600
1 小时前
gvergnaud/ts-pattern

🎨 The exhaustive Pattern Matching library for TypeScript, with smart type inference.

TypeScript
14297
13 天前

NVIDIA® TensorRT™ is an SDK for high-performance deep learning inference on NVIDIA GPUs. This repository contains the open source components of TensorRT.

C++
12204
10 天前

Example 📓 Jupyter notebooks that demonstrate how to build, train, and deploy machine learning models using 🧠 Amazon SageMaker.

Jupyter Notebook
10753
4 天前

The Triton Inference Server provides an optimized cloud and edge inferencing solution.

Python
9841
1 天前

Replace OpenAI GPT with another LLM in your app by changing a single line of code. Xinference gives you the freedom to use any LLM you need. With Xinference, you're empowered to run inference with any open-source language models, speech recognition models, and multimodal models, whether in the cloud, on-premises, or even on your laptop.

Python
8595
4 天前

Easily fine-tune, evaluate and deploy gpt-oss, Qwen3, DeepSeek-R1, or any open source LLM / VLM!

Python
8502
5 天前

💎1MB lightweight face detection model (1MB轻量级人脸检测模型)

Python
7435
2 年前