QAT recommendation #2451

shuyuan-wang · 2022-11-03T06:39:54Z

In the document docs, It recommends not change quantization representation (scale) during training, at least not too frequently. How exactly do I not change scale during QAT training?

zerollzeng · 2022-11-03T16:30:44Z

@ttyio ^ ^ I would also like to learn about this

ttyio · 2022-11-04T03:03:33Z

@shuyuan-wang ,
see https://github.com/NVIDIA/TensorRT/blob/main/tools/pytorch-quantization/examples/torchvision/classification_flow.py#L387, the scale is changed during calibration.

for name, module in model.named_modules():
    if isinstance(module, quant_nn.TensorQuantizer):
        if module._calibrator is not None:
            module.disable_quant()
            module.enable_calib()
        else:
            module.disable()

By call

   module.disable_calib()

The calibration is turned off, and you can call

   module.enable_quant()

to do fine-tuning without change scale.

shuyuan-wang · 2022-11-04T04:04:33Z

Thanks

zsh4614 · 2025-01-06T09:36:51Z

@shuyuan-wang , see https://github.com/NVIDIA/TensorRT/blob/main/tools/pytorch-quantization/examples/torchvision/classification_flow.py#L387, the scale is changed during calibration.
for name, module in model.named_modules():
    if isinstance(module, quant_nn.TensorQuantizer):
        if module._calibrator is not None:
            module.disable_quant()
            module.enable_calib()
        else:
            module.disable()
By call
   module.disable_calib()
The calibration is turned off, and you can call
   module.enable_quant()
to do fine-tuning without change scale.

Sorry, I have a question. During the fine-tuning process of QAT, the model weights will change. If the scale of the weights remains unchanged, is the scale calculated during calibration still reasonable? Should the scale never be updated during fine-tuning? If it can be updated, how should the scale be updated during training? Are there any recommended update strategies?

shuyuan-wang · 2025-01-06T09:58:11Z

@shuyuan-wang , see https://github.com/NVIDIA/TensorRT/blob/main/tools/pytorch-quantization/examples/torchvision/classification_flow.py#L387, the scale is changed during calibration.
for name, module in model.named_modules():
    if isinstance(module, quant_nn.TensorQuantizer):
        if module._calibrator is not None:
            module.disable_quant()
            module.enable_calib()
        else:
            module.disable()
By call
   module.disable_calib()
The calibration is turned off, and you can call
   module.enable_quant()
to do fine-tuning without change scale.
Sorry, I have a question. During the fine-tuning process of QAT, the model weights will change. If the scale of the weights remains unchanged, is the scale calculated during calibration still reasonable? Should the scale never be updated during fine-tuning? If it can be updated, how should the scale be updated during training? Are there any recommended update strategies?

I would say during QAT, the weight is adjusted based on the the newly calculated scale during PTQ

zerollzeng assigned zerollzeng and ttyio Nov 3, 2022

zerollzeng added the triaged Issue has been triaged by maintainers label Nov 3, 2022

ttyio added question Further information is requested Topic: QAT labels Nov 4, 2022

shuyuan-wang closed this as completed Nov 4, 2022

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

QAT recommendation #2451

QAT recommendation #2451

shuyuan-wang commented Nov 3, 2022 •

edited

Loading

zerollzeng commented Nov 3, 2022

ttyio commented Nov 4, 2022 •

edited

Loading

shuyuan-wang commented Nov 4, 2022

zsh4614 commented Jan 6, 2025

shuyuan-wang commented Jan 6, 2025 •

edited

Loading

QAT recommendation #2451

QAT recommendation #2451

Comments

shuyuan-wang commented Nov 3, 2022 • edited Loading

zerollzeng commented Nov 3, 2022

ttyio commented Nov 4, 2022 • edited Loading

shuyuan-wang commented Nov 4, 2022

zsh4614 commented Jan 6, 2025

shuyuan-wang commented Jan 6, 2025 • edited Loading

shuyuan-wang commented Nov 3, 2022 •

edited

Loading

ttyio commented Nov 4, 2022 •

edited

Loading

shuyuan-wang commented Jan 6, 2025 •

edited

Loading