Repository navigation

#

model-inference

Run any open-source LLMs, such as DeepSeek and Llama, as OpenAI compatible API endpoint in the cloud.

Python
11699
1 天前

Resources of our survey paper "Optimizing Edge AI: A Comprehensive Survey on Data, Model, and System Strategies"

90
7 个月前

Resources of our survey paper "Optimizing Edge AI: A Comprehensive Survey on Data, Model, and System Strategies"

90
7 个月前

CLIP as a service - Embed image and sentences, object recognition, visual reasoning, image classification and reverse image search

Jupyter Notebook
64
22 天前

Large-scale Auto-Distributed Training/Inference Unified Framework | Memory-Compute-Control Decoupled Architecture | Multi-language SDK & Heterogeneous Hardware Support

C++
51
3 个月前

EmbeddedLLM: API server for Embedded Device Deployment. Currently support CUDA/OpenVINO/IpexLLM/DirectML/CPU

Python
42
10 个月前

Accelerating AI Training and Inference from Storage Perspective (Must-read Papers on Storage for AI)

37
1 个月前

Streamlining the process for seamless execution of PyCoral in running TensorFlow Lite models on an Edge TPU USB.

Python
30
1 年前

Генерация описаний к изображениям с помощью различных архитектур нейронных сетей

Jupyter Notebook
18
2 年前

Image Classifiers are used in the field of computer vision to identify the content of an image and it is used across a broad variety of industries, from advanced technologies like autonomous vehicles and augmented reality, to eCommerce platforms, and even in diagnostic medicine.

HTML
4
2 年前

The primary objective of this project was to build and deploy an image classification model for Scones Unlimited, a scone-delivery-focused logistic company, using AWS SageMaker.

HTML
4
3 年前

Successfully developed a fine-tuned BERT transformer model which can accurately classify symptoms to their corresponding diseases upto an accuracy of 89%.

Jupyter Notebook
2
1 年前

A personal journey into model inference engineering — learning, building, and sharing along the way.

Jupyter Notebook
2
5 天前

A cloud run function to invoke a prediction against a machine learning model that has been trained outside of a cloud provider.

Python
1
7 个月前

Successfully fine-tuned a pretrained DistilBERT transformer model that can classify social media text data into one of 4 cyberbullying labels i.e. ethnicity/race, gender/sexual, religion and not cyberbullying with a remarkable accuracy of 99%.

Jupyter Notebook
1
1 年前

Successfully developed a fine-tuned DistilBERT transformer model which can accurately predict the overall sentiment of a piece of financial news up to an accuracy of nearly 81.5%.

Jupyter Notebook
1
1 年前

Example distributed system for ML model inference by using Kafka, including spring boot REST+JPA server with Java consumer program

Java
1
24 天前

This project is a web-based application that uses a pre-trained Mask R-CNN model to detect and classify car damage types (scratch, dent, shatter, dislocation) from images. Users can upload an image of a car, and the application will highlight damaged areas with bounding boxes and masks, providing a clear visual representation of the detected damage

Jupyter Notebook
1
1 年前