TensorRT_quantization_demo_cifar10

This demo is to show how to build a TensorRT INT8 engine for cifar10 classification task. It also demonstrates that how the calibration dataset size influences the final accuracy after quantization.

The basic code is derived from one of TensorRT python samples: int8_caffe_mnist. To demonstrate how the calibration dataset size influences the accuracy after int8 quantization, the mnist dataset was changed into cifar10 dataset, and the LeNet was changed into ResNet18. The ResNet18 onnx model comes from the repo of pytorch-onnx-tensorrt-CIFAR10.

Run it step by step

0.1 start a NGC TensorRT container: nvcr.io/nvidia/tensorrt:21.03-py3

0.2 git clone https://github.com/shiyongming/TensorRT_quantization_demo_cifar10.git

0.3 cd TensorRT_quantization_demo_cifar10

0.4 pip install -r requirements.txt

You need to change the ONNX_PATH (line 136 in sample.py) into your own path of resnet18.onnx.
You need to change the cifar10_data_path (line 137 in sample.py) into your own path of cifar10 test datatest_batch.
You need to change the calib_data_path (line 138 in sample.py) into your own path of data for cailbration.
total_images and batch_size (line 144 in sample.py) are the total images number you used for calibration and batch size for loading the calibration data. They should also be changed.
If you want to use the whole test dataset to do the calibration. You can use convert_to_images.py to convert the cifar10 test_batch file into jpeg images. Note that to change the path into your own path.
python sample.py
Select different calibration folder in cifar10_dataset to see the inference caused by calibration dataset size and calibration batch size.
change the condition in line 64~68 of sample.py to fallback some layers into higher precision.

Results

Before quanztization, the orignal top-1 accruacy on test_batch is 87.81%. After quantization, the top-1 accuracy is shown as below.

'10+10' images means that we add another 10 images into the existed '10' images. And '10+10+10' means we add another 10 images into the existed '10+10' images.

'20' and '30' images means the calibration image set was selected randomly.

To evaluate the quality of calibration set. I adopt my another repo calib-dataset-eval to calculate and analysis the distribtion of the calibration dataset.

--------------- 10 calibration images ---------------------------- 20 calibration images ---------------

--------------- 30 calibration images ---------------------------- 40 calibration images ---------------

It can be seen that, from 10 images to 40 calibration images, the distribution covers more and more area. As a result, the accruacy was increased. And it also can be seen that, some area still not be covered. So, it guides us that, if we want to improve the quantization preformance, we need to add more images to cover the uncovered area.

Name		Name	Last commit message	Last commit date
Latest commit History 26 Commits
.idea		.idea
cifar10_data		cifar10_data
.gitignore		.gitignore
README.md		README.md
benchmark.xlsx		benchmark.xlsx
calibrator.py		calibrator.py
common.py		common.py
img_1.png		img_1.png
requirements.txt		requirements.txt
resnet18.onnx		resnet18.onnx
sample.py		sample.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

TensorRT_quantization_demo_cifar10

Run it step by step

Results

About

Releases

Packages

Languages

shiyongming/TensorRT_quantization_demo_cifar10

Folders and files

Latest commit

History

Repository files navigation

TensorRT_quantization_demo_cifar10

Run it step by step

Results

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages