model-compression

Here are 197 public repositories matching this topic...

microsoft / nni

An open source AutoML toolkit for automate machine learning lifecycle, including feature engineering, neural architecture search, model compression and hyper-parameter tuning.

Updated Jul 3, 2024
Python

huawei-noah / Efficient-AI-Backbones

Star

Efficient AI Backbones including GhostNet, TNT and MLP, developed by Huawei Noah's Ark Lab.

tensorflow pytorch transformer imagenet convolutional-neural-networks pretrained-models model-compression efficient-inference ghostnet vision-transformer

Updated Mar 15, 2025
Python

huawei-noah / Pretrained-Language-Model

Star

Pretrained language model and its related optimization techniques developed by Huawei Noah's Ark Lab.

pretrained-models quantization knowledge-distillation model-compression large-scale-distributed

Updated Jan 22, 2024
Python

VainF / Torch-Pruning

Star

[CVPR 2023] DepGraph: Towards Any Structural Pruning; LLMs, Vision Foundation Models, etc.

transformers vision pruning model-compression efficient-deep-learning llm

Updated Jul 4, 2025
Python

Tencent / PocketFlow

Star

An Automatic Model Compression (AutoMC) framework for developing smaller and faster AI applications.

computer-vision deep-learning mobile-app automl model-compression

Updated Mar 31, 2023
Python

micronet, a model compression and deploy lib. compression: 1、quantization: quantization-aware-training(QAT), High-Bit(>2b)(DoReFa/Quantization and Training of Neural Networks for Efficient Integer-Arithmetic-Only Inference)、Low-Bit(≤2b)/Ternary and Binary(TWN/BNN/XNOR-Net); post-training-quantization(PTQ), 8-bit(tensorrt); 2、 pruning: normal、reg…

Updated May 6, 2025
Python

haitongli / knowledge-distillation-pytorch

Star

A PyTorch implementation for exploring deep and shallow knowledge distillation (KD) experiments with flexibility

deep-neural-networks computer-vision pytorch knowledge-distillation cifar10 dark-knowledge model-compression

Updated Mar 25, 2023
Python

AberHu / Knowledge-Distillation-Zoo

Star

Pytorch implementation of various Knowledge Distillation (KD) methods.

knowledge-distillation teacher-student knowledge-transfer model-compression distillation kd kd-methods

Updated Nov 25, 2021
Python

tensorflow / model-optimization

Star

A toolkit to optimize ML models for deployment for Keras and TensorFlow, including quantization and pruning.

machine-learning sparsity compression deep-learning tensorflow optimization keras ml pruning quantization model-compression quantized-training quantized-neural-networks quantized-networks

Updated Jul 7, 2025
Python

microsoft / NeuronBlocks

Star

NLP DNN Toolkit - Building Your NLP DNN Models Like Playing Lego

natural-language-processing deep-learning text-classification dnn pytorch artificial-intelligence question-answering knowledge-distillation sequence-labeling text-matching qna model-compression

Updated Jul 22, 2023
Python

ethanhe42 / channel-pruning

Star

Channel Pruning for Accelerating Very Deep Neural Networks (ICCV'17)

deep-neural-networks acceleration image-classification image-recognition object-detection model-compression channel-pruning

Updated May 2, 2024
Python

horseee / DeepCache

Star

[CVPR 2024] DeepCache: Accelerating Diffusion Models for Free

model-compression efficient-inference diffusion-models stable-diffusion training-free

Updated Jun 27, 2024
Python

alibaba / TinyNeuralNetwork

Star

TinyNeuralNetwork is an efficient and easy-to-use deep learning model compression framework.

deep-neural-networks deep-learning pytorch pruning model-compression model-converter quantization-aware-training post-training-quantization

Updated May 26, 2025
Python

SqueezeAILab / SqueezeLLM

Star

[ICML 2024] SqueezeLLM: Dense-and-Sparse Quantization

natural-language-processing text-generation transformer llama quantization model-compression efficient-inference post-training-quantization large-language-models llm small-models localllm

Updated Aug 13, 2024
Python

SforAiDl / KD_Lib

Star

A Pytorch Knowledge Distillation library for benchmarking and extending works in the domains of Knowledge Distillation, Pruning, and Quantization.

benchmarking data-science machine-learning pytorch deep-learning-library pruning quantization algorithm-implementations knowledge-distillation model-compression

Updated Mar 1, 2023
Python

he-y / filter-pruning-geometric-median

Star

Filter Pruning via Geometric Median for Deep Convolutional Neural Networks Acceleration (CVPR 2019 Oral)

pytorch pruning model-compression

Updated Aug 31, 2023
Python

iamhankai / ghostnet.pytorch

Star

[CVPR2020] GhostNet: More Features from Cheap Operations

pytorch convolutional-neural-networks model-compression fbnet mobilenetv3

Updated Aug 8, 2020
Python

microsoft / archai

Star

Accelerate your Neural Architecture Search (NAS) through fast, reproducible and modular research.

python machine-learning deep-learning pytorch hyperparameter-optimization darts nas automl automated-machine-learning model-compression neural-architecture-search petridish

Updated Oct 23, 2024
Python

mit-han-lab / amc

Star

[ECCV 2018] AMC: AutoML for Model Compression and Acceleration on Mobile Devices

automl model-compression channel-pruning automl-for-compression efficient-model on-device-ai

Updated Nov 22, 2023
Python

Zhen-Dong / HAWQ

Star

Quantization library for PyTorch. Support low-precision and mixed-precision quantization, with hardware implementation through TVM.

pytorch quantization hessian 8-bit model-compression distillation tvm 4-bit mixed-precision tensorcore quantized-neural-networks hardware-aware efficient-neural-networks

Updated May 15, 2023
Python

Improve this page

Add a description, image, and links to the model-compression topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the model-compression topic, visit your repo's landing page and select "manage topics."

Learn more

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

model-compression

Here are 197 public repositories matching this topic...

microsoft / nni

huawei-noah / Efficient-AI-Backbones

huawei-noah / Pretrained-Language-Model

VainF / Torch-Pruning

Tencent / PocketFlow

666DZY666 / micronet

haitongli / knowledge-distillation-pytorch

AberHu / Knowledge-Distillation-Zoo

tensorflow / model-optimization

microsoft / NeuronBlocks

ethanhe42 / channel-pruning

horseee / DeepCache

alibaba / TinyNeuralNetwork

SqueezeAILab / SqueezeLLM

SforAiDl / KD_Lib

he-y / filter-pruning-geometric-median

iamhankai / ghostnet.pytorch

microsoft / archai

mit-han-lab / amc

Zhen-Dong / HAWQ

Improve this page

Add this topic to your repo