Repository navigation

#

inference-engine

FEDML - The unified and scalable ML library for large-scale distributed training, model serving, and federated learning. FEDML Launch, a cross-cloud scheduler, further enables running any AI jobs on any GPU cloud or on-premise cluster. Built on this library, TensorOpera AI (https://TensorOpera.ai) is your generative AI platform at scale.

Python
3840
1 个月前

校招、秋招、春招、实习好项目!带你从零实现一个高性能的深度学习推理库,支持大模型 llama2 、Unet、Yolov5、Resnet等模型的推理。Implement a high-performance deep learning inference library step by step

C++
2862
6 个月前
siliconflow/onediff
Jupyter Notebook
1870
3 个月前

FeatherCNN is a high performance inference engine for convolutional neural networks.

C++
1217
6 年前

Paddle.js is a web project for Baidu PaddlePaddle, which is an open source deep learning framework running in the browser. Paddle.js can either load a pre-trained model, or transforming a model from paddle-hub with model transforming tools provided by Paddle.js. It could run in every browser with WebGL/WebGPU/WebAssembly supported. It could also run in Baidu Smartprogram and WX miniprogram.

JavaScript
1030
1 年前

A highly optimized LLM inference acceleration engine for Llama and its variants.

C++
885
5 天前
C++
798
1 年前

🔥 (yolov3 yolov4 yolov5 unet ...)A mini pytorch inference framework which inspired from darknet.

C++
744
2 年前

The Qualcomm® AI Hub Models are a collection of state-of-the-art machine learning models optimized for performance (latency, memory etc.) and ready to deploy on Qualcomm® devices.

Python
669
10 天前

A library for high performance deep learning inference on NVIDIA GPUs.

C++
552
3 年前

A common base representation of python source code for pylint and other projects

Python
543
5 天前

High performance Cross-platform Inference-engine, you could run Anakin on x86-cpu,arm, nv-gpu, amd-gpu,bitmain and cambricon devices.

C++
533
3 年前

A Machine Learning System for Data Enrichment.

Python
520
2 年前

A rule engine written in Ruby.

Ruby
487
1 年前

PyKnow: Expert Systems for Python

Python
477
5 年前

Julia package for automated Bayesian inference on a factor graph with reactive message passing

Jupyter Notebook
331
4 天前

校招、秋招、春招、实习好项目,带你从零动手实现支持LLama2/3和Qwen2.5的大模型推理框架。

C++
330
17 天前