Repository navigation

#

offloading

Running large language models on a single GPU for throughput-oriented scenarios.

Python
9305
6 个月前
Python
1970
11 小时前

A QoE-Oriented Computation Offloading Algorithm based on Deep Reinforcement Learning (DRL) for Mobile Edge Computing (MEC) | This algorithm captures the dynamics of the MEC environment by integrating the Dueling Double Deep Q-Network (D3QN) model with Long Short-Term Memory (LSTM) networks.

Python
212
14 天前

LLM Inference on consumer devices

Python
105
1 个月前

dpdk infrastructure for software acceleration. Currently working on RX and ACL pre-filter

C
91
4 年前

ZO2 (Zeroth-Order Offloading): Full Parameter Fine-Tuning 175B LLMs with 18GB GPU Memory

Python
89
14 天前

DPU-Powered File System Virtualization over virtio-fs

Jupyter Notebook
69
1 年前

A collection of tests for the Open vSwitch HW offload.

Shell
40
5 个月前

A Dynamic Programming Offloading Algorithm for Mobile Cloud Computing

MATLAB
36
6 年前

LeapIO: Efficient and Portable Virtual NVMe Storage on ARM SoCs (ASPLOS'20)

C
28
4 年前

A framework for IoT devices to offload tasks to the cloud, resulting in efficient computation and decreased cloud costs.

Python
28
3 年前

A lightweight framework that enables serverless users to reduce their bills by harvesting non-serverless compute resources such as their VMs, on-premise servers, or personal computers.

Python
28
8 个月前

Monero hardware wallet protocol implementation for Trezor, agent

Python
26
3 年前

The container-based cloud platform for mobile code offloading

Makefile
20
8 年前

Code for paper "Real-time Neural Network Inference on Extremely Weak Devices: Agile Offloading with Explainable AI" (MobiCom'22)

Python
17
2 年前

Implementation of the RTSS'23 Best Student Paper Award paper Progressive Neural Compression for Adaptive Image Offloading under Timing Constraints

PureBasic
11
25 天前