Name		Name	Last commit message	Last commit date
parent directory ..
305-tensorflow-quantization-aware-training.ipynb		305-tensorflow-quantization-aware-training.ipynb
README.md		README.md

README.md

Optimizing TensorFlow models with Neural Network Compression Framework of OpenVINO™ by 8-bit quantization.

This tutorial demonstrates how to use NNCF 8-bit quantization to optimize the TensorFlow model for inference with OpenVINO Toolkit. For more advanced usage, refer to these examples.

To speed up download and training, use a ResNet-18 model with the Imagenette dataset. Imagenette is a subset of 10 easily classified classes from the ImageNet dataset.

Notebook Contents

This tutorial consists of the following steps:

Fine-tuning of FP32 model
Transforming the original FP32 model to INT8
Using fine-tuning to restore the accuracy.
Exporting optimized and original models to Frozen Graph and then to OpenVINO
Measuring and comparing the performance of the models.

Installation Instructions

If you have not installed all required dependencies, follow the Installation Guide.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

305-tensorflow-quantization-aware-training

305-tensorflow-quantization-aware-training

README.md

Optimizing TensorFlow models with Neural Network Compression Framework of OpenVINO™ by 8-bit quantization.

Notebook Contents

Installation Instructions

Files

305-tensorflow-quantization-aware-training

Directory actions

More options

Directory actions

More options

Latest commit

History

305-tensorflow-quantization-aware-training

Folders and files

parent directory

README.md

Optimizing TensorFlow models with Neural Network Compression Framework of OpenVINO™ by 8-bit quantization.

Notebook Contents

Installation Instructions