Repository navigation

model-compression

Website
Wikipedia

An open source AutoML toolkit for automate machine learning lifecycle, including feature engineering, neural architecture search, model compression and hyper-parameter tuning.

automl 深度学习 neural-architecture-search hyperparameter-optimization distributed bayesian-optimization automated-machine-learning 机器学习数据科学 Tensorflow PyTorch 神经网络深度神经网络 model-compression feature-engineering nas Python hyperparameter-tuning mlops

Python

14163

1819

10 个月前

huawei-noah / Efficient-AI-Backbones

Efficient AI Backbones including GhostNet, TNT and MLP, developed by Huawei Noah's Ark Lab.

convolutional-neural-networks efficient-inference imagenet model-compression Tensorflow PyTorch ghostnet transformer pretrained-models vision-transformer

Python

4185

718

1 个月前

dkozlov / awesome-knowledge-distillation

Awesome Knowledge Distillation

knowledge-distillation teacher-student distillation model-compression 深度学习

3645

508

1 个月前

huawei-noah / Pretrained-Language-Model

Pretrained language model and its related optimization techniques developed by Huawei Noah's Ark Lab.

knowledge-distillation model-compression quantization pretrained-models

Python

3081

635

1 年前

VainF / Torch-Pruning

[CVPR 2023] DepGraph: Towards Any Structural Pruning

pruning model-compression network-pruning channel-pruning efficient-deep-learning cvpr2023

Python

2978

346

7 天前

Tencent / PocketFlow

An Automatic Model Compression (AutoMC) framework for developing smaller and faster AI applications.

深度学习 model-compression mobile-app automl 机器视觉

Python

2864

492

2 年前

FLHonker / Awesome-Knowledge-Distillation

Awesome Knowledge-Distillation. 分类整理的知识蒸馏paper(2014-2021)。

distillation 深度学习 transfer-learning model-compression

2571

337

2 年前

he-y / Awesome-Pruning

A curated list of neural network pruning resources.

pruning model-compression Awesome Lists

2437

330

1 年前

666DZY666 / micronet

micronet, a model compression and deploy lib. compression: 1、quantization: quantization-aware-training(QAT), High-Bit(>2b)(DoReFa/Quantization and Training of Neural Networks for Efficient Integer-Arithmetic-Only Inference)、Low-Bit(≤2b)/Ternary and Binary(TWN/BNN/XNOR-Net); post-training-quantization(PTQ), 8-bit(tensorrt); 2、 pruning: normal、regular and group convolutional channel pruning; 3、 group convolution structure; 4、batch-normalization fuse for quantization. deploy: tensorrt, fp32/fp16/int8(ptq-calibration)、op-adapt(upsample)、dynamic_shape

quantization pruning dorefa twn bnn xnor-net PyTorch model-compression group-convolution convolutional-networks quantization-aware-training post-training-quantization tensorrt onnx

Python

2240

476

3 天前

Efficient-ML / Awesome-Model-Quantization

A list of papers, docs, codes about model quantization. This repo is aimed to provide the info for model quantization research, we are continuously improving the project. Welcome to PR the works (papers, repositories) that are missed by the repo.

深度学习 quantization Awesome Lists model-compression efficient-deep-learning model-quantization

2062

220

2 个月前

haitongli / knowledge-distillation-pytorch

A PyTorch implementation for exploring deep and shallow knowledge distillation (KD) experiments with flexibility

PyTorch knowledge-distillation 深度神经网络 cifar10 model-compression 机器视觉

Python

1926

350

2 年前

AberHu / Knowledge-Distillation-Zoo

Pytorch implementation of various Knowledge Distillation (KD) methods.

knowledge-distillation teacher-student model-compression distillation

Python

1682

268

3 年前

tensorflow / model-optimization

A toolkit to optimize ML models for deployment for Keras and TensorFlow, including quantization and pruning.

Tensorflow 机器学习深度学习 optimization Keras model-compression compression pruning sparsity quantization

Python

1531

325

2 个月前

microsoft / NeuronBlocks

NLP DNN Toolkit - Building Your NLP DNN Models Like Playing Lego

question-answering 深度学习 PyTorch 自然语言处理 text-classification 人工智能 dnn qna text-matching knowledge-distillation model-compression sequence-labeling

Python

1454

195

2 年前

huawei-noah / Efficient-Computing

Efficient computing methods developed by Huawei Noah's Ark Lab

knowledge-distillation model-compression binary-neural-networks pruning quantization self-supervised

Jupyter Notebook

1261

217

5 个月前

ethanhe42 / channel-pruning

Channel Pruning for Accelerating Very Deep Neural Networks (ICCV'17)

image-recognition model-compression acceleration object-detection image-classification channel-pruning 深度神经网络

Python

1083

311

1 年前

MingSun-Tse / Efficient-Deep-Learning

Collection of recent methods on (deep) neural network compression and acceleration.

model-compression network-pruning knowledge-distillation 深度学习深度神经网络 efficient-deep-learning

945

131

15 天前

horseee / DeepCache

[CVPR 2024] DeepCache: Accelerating Diffusion Models for Free

diffusion-models efficient-inference model-compression stable-diffusion

Python

886

10 个月前

guan-yuan / awesome-AutoML-and-Lightweight-Models

A list of high-quality (newest) AutoML works and lightweight models including 1.) Neural Architecture Search, 2.) Lightweight Structures, 3.) Model Compression, Quantization and Acceleration, 4.) Hyperparameter Optimization, 5.) Automated Feature Engineering.

automl meta-learning automated-feature-engineering hyperparameter-optimization model-compression Awesome Lists neural-architecture-search nas PyTorch quantization Tensorflow

853

160

4 年前

alibaba / TinyNeuralNetwork

TinyNeuralNetwork is an efficient and easy-to-use deep learning model compression framework.

PyTorch 深度学习 model-compression pruning model-converter quantization-aware-training 深度神经网络 post-training-quantization

Python

823

125

1 个月前