Repository navigation

#

auto-tuning

SOTA low-bit LLM quantization (INT8/FP8/INT4/FP4/NF4) & sparsity; leading model compression techniques on TensorFlow, PyTorch, and ONNX Runtime

Python
2503
5 天前

bpftune uses BPF to auto-tune Linux systems

C
1676
2 个月前

Must read research papers and links to tools and datasets that are related to using machine learning for compilers and systems optimisation

1606
23 天前

Machine Learning Framework for Operating Systems - Brings ML to Linux kernel

C
249
4 年前

CLTune: An automatic OpenCL & CUDA kernel tuner

C++
182
3 年前
Python
110
3 个月前

Benchmark scripts for TVM

Python
74
4 年前

Collective Knowledge crowd-tuning extension to let users crowdsource their experiments (using portable Collective Knowledge workflows) such as performance benchmarking, auto tuning and machine learning across diverse platforms with Linux, Windows, MacOS and Android provided by volunteers. Demo of DNN crowd-benchmarking and crowd-tuning:

Python
36
4 年前

K2vTune (A Workload-aware Configuration Tuning for RocksDB)

Jupyter Notebook
29
2 年前

A Generic Distributed Auto-Tuning Infrastructure

Python
22
4 年前

A GPU benchmark suite for autotuners

Cuda
19
2 年前

HPCコードの全自動最適化を行う集約並列CLIエージェント

Python
19
3 天前

bowtie Backoff uses an exponential backoff algorithm to backoff between retries with optional auto-tuning functionality.

Go
13
8 年前

This software package accompanies the paper "A Methodology for Comparing Auto-Tuning Optimization Algorithms" (https://doi.org/10.1016/j.future.2024.05.021), making the guidelines in the methodology easy to apply.

Python
6
1 个月前