Quantization

I have been reading and doing NN model optimization on weight quantization by clusering and weight pruing. I have implemented some basic weight quantization. I will be adding more to this repo.

I have been using libraries like TensorFlow for quantization but I would like to know under the hood of quantization.

I might have made some mistakes. If so, please comment and share so that I can fix it and nonethless we will both get to know something amazing about ongoing research in quantization.

Fixed point weight quantization

float16, float32, float64 -> int16, int8, int4

Clustering based weight quantization

float16, float32, float64 -> int8 [weight indexes] and float32/float16 [cluster centers]

Other weight quantization

Min max weight quantization
Threshold/Probability based weight quantization

Activation quantization

Coming soon

Hoffman encoding based quantization

Coming soon

Weight pruning

Coming soon

Name		Name	Last commit message	Last commit date
Latest commit History 12 Commits
.gitignore		.gitignore
README.md		README.md
fixed_point_quantization.py		fixed_point_quantization.py
kl_divergence_quantization.py		kl_divergence_quantization.py
loss.py		loss.py
other_weight_quantization.py		other_weight_quantization.py
resnet50_quantized.py		resnet50_quantized.py
weight_quantization_clustering.py		weight_quantization_clustering.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Quantization

Fixed point weight quantization

Clustering based weight quantization

Other weight quantization

Activation quantization

Hoffman encoding based quantization

Weight pruning

LSTM weight quantization

About

Uh oh!

Releases

Packages

Languages

anilknayak/quantization

Folders and files

Latest commit

History

Repository files navigation

Quantization

Fixed point weight quantization

Clustering based weight quantization

Other weight quantization

Activation quantization

Hoffman encoding based quantization

Weight pruning

LSTM weight quantization

About

Topics

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages