Repository navigation

#

model-inference

Run any open-source LLMs, such as DeepSeek and Llama, as OpenAI compatible API endpoint in the cloud.

Python
11153
3 天前

Resources of our survey paper "Optimizing Edge AI: A Comprehensive Survey on Data, Model, and System Strategies"

80
3 个月前

CLIP as a service - Embed image and sentences, object recognition, visual reasoning, image classification and reverse image search

Jupyter Notebook
60
1 年前

Resources of our survey paper "Optimizing Edge AI: A Comprehensive Survey on Data, Model, and System Strategies"

59
3 个月前

Large-scale Auto-Distributed Training/Inference Unified Framework | Memory-Compute-Control Decoupled Architecture | Multi-language SDK & Heterogeneous Hardware Support

C++
38
2 小时前

EmbeddedLLM: API server for Embedded Device Deployment. Currently support CUDA/OpenVINO/IpexLLM/DirectML/CPU

Python
37
6 个月前

Streamlining the process for seamless execution of PyCoral in running TensorFlow Lite models on an Edge TPU USB.

Python
29
1 年前

Build self-hosted RAG AI Agents powered by open-source LLMs, use LLM models from Ollama and Huggingface, add external API calls, python and shell scripts for context-aware LLM interactions, add validation checks, and build Bring Your Own Infrastructure (BYOI) Dockerized AI Agent images.

Go
20
20 分钟前

Генерация описаний к изображениям с помощью различных архитектур нейронных сетей

Jupyter Notebook
17
2 年前

Accelerating AI Training and Inference from Storage Perspective (Must-read Papers on Storage for AI)

14
17 天前

Image Classifiers are used in the field of computer vision to identify the content of an image and it is used across a broad variety of industries, from advanced technologies like autonomous vehicles and augmented reality, to eCommerce platforms, and even in diagnostic medicine.

HTML
4
2 年前

The primary objective of this project was to build and deploy an image classification model for Scones Unlimited, a scone-delivery-focused logistic company, using AWS SageMaker.

HTML
4
2 年前

Successfully fine-tuned a pretrained DistilBERT transformer model that can classify social media text data into one of 4 cyberbullying labels i.e. ethnicity/race, gender/sexual, religion and not cyberbullying with a remarkable accuracy of 99%.

Jupyter Notebook
1
10 个月前

A cloud run function to invoke a prediction against a machine learning model that has been trained outside of a cloud provider.

Python
1
3 个月前

Successfully developed a fine-tuned DistilBERT transformer model which can accurately predict the overall sentiment of a piece of financial news up to an accuracy of nearly 81.5%.

Jupyter Notebook
1
1 年前

Example distributed system for ML model inference by using Kafka, including spring boot REST+JPA server with Java consumer program

Java
1
7 个月前

This project is a web-based application that uses a pre-trained Mask R-CNN model to detect and classify car damage types (scratch, dent, shatter, dislocation) from images. Users can upload an image of a car, and the application will highlight damaged areas with bounding boxes and masks, providing a clear visual representation of the detected damage

Jupyter Notebook
1
8 个月前

The primary objective of this project was to build and deploy an image classification model for Scones Unlimited, a scone-delivery-focused logistic company, using AWS SageMaker.

HTML
0
2 年前

Successfully established an image classification model using PyTorch to classify the images of several distinct natural sceneries such as mountains, glaciers, forests, seas, streets and buildings with an accuracy of 86%.

Jupyter Notebook
0
4 个月前