add new format of quantization #41041

yghstill · 2022-03-28T11:44:22Z

PR types

New features

PR changes

APIs

Describe

Add new format of quantization.

补充说明：

新增量化新格式的原因：1）目前的量化格式动态图和静态图各一套，维护成本高，同时Intel部署模型还仍需转换一次，流程繁多。2）fake量化节点多而不统一，用户难以理解，比如fake_quantize_abs_max、fake_quantize_range_abs_max等，并且预测库需要解析多种fake op类型，并且量化时给模型原生op attr增加很多属性，适配起来比较复杂。 3）为了使得量化模型更加统一，同时对齐onnx量化模型标准，提升Paddle中模型量化易用性，易扩展，所以对量化模型格式做新的升级。
新格式和旧格式关系：二者互不影响，新格式在尚未全部模型推广时，模型量化默认还是旧格式，将来逐渐模型量化迁移至新格式。
新格式使用方法：1）静态图量化API中PostTrainingQuantization设置onnx_format=True即选择新格式。2）动态图量化API中save_quantized_model 接口中设置onnx_format=True即选择新格式。
兼容性：此PR支持了初版量化新格式，目前不影响当前量化流程，将来新格式会逐步优化量化过程、各平台各硬件预测部署流程，新格式会逐步替代旧格式，等方案成熟后，量化时会默认首选新格式。量化旧格式仍会保留，旧API及预测部署流程仍可使用。

paddle/fluid/operators/quantize_linear_op.cu

paddle/fluid/operators/quantize_linear_op.h

paddle/fluid/operators/quantize_linear_op.cc

paddle/fluid/operators/quantize_linear_op.cu

python/paddle/fluid/contrib/slim/quantization/imperative/qat.py

python/paddle/fluid/contrib/slim/quantization/quantization_pass.py

paddle/fluid/operators/quantize_linear_op.h

paddle/fluid/operators/quantize_linear_op.cu

paddle/fluid/operators/quantize_linear_op.h

python/paddle/fluid/contrib/slim/quantization/imperative/qat.py

python/paddle/fluid/contrib/slim/quantization/quantization_pass.py

…_quant_format

python/paddle/fluid/contrib/slim/quantization/quantization_pass.py

…_quant_format

jzhang533

LGTM

XiaoguangHu01

LGTM

add new format of quantization

3c4601f

qingqing01 requested review from wanghaoshuang, juncaipeng, qingqing01 and ceci3 March 29, 2022 07:46

add unittest

c4d378a

yghstill force-pushed the add_new_quant_format branch from cfdfc69 to c4d378a Compare March 29, 2022 09:57

fix unittest

b81f3e2

wanghaoshuang reviewed Mar 30, 2022

View reviewed changes

ceci3 reviewed Mar 30, 2022

View reviewed changes

python/paddle/fluid/contrib/slim/quantization/quantization_pass.py Outdated Show resolved Hide resolved

python/paddle/fluid/contrib/slim/quantization/quantization_pass.py Outdated Show resolved Hide resolved

wanghaoshuang reviewed Mar 30, 2022

View reviewed changes

yghstill added 2 commits April 1, 2022 07:03

fix some interface

641d549

fix cuda kernel

80910a4

wanghaoshuang previously approved these changes Apr 1, 2022

View reviewed changes

ceci3 previously approved these changes Apr 1, 2022

View reviewed changes

yghstill added 2 commits April 2, 2022 01:46

Merge branch 'develop' of github.com:PaddlePaddle/Paddle into add_new…

f79f4f9

…_quant_format

fix unittest

da5ddf7

yghstill dismissed stale reviews from ceci3 and wanghaoshuang via da5ddf7 April 2, 2022 01:59

ceci3 previously approved these changes Apr 2, 2022

View reviewed changes

fix coverage

82f2b71

yghstill dismissed ceci3’s stale review via 82f2b71 April 2, 2022 13:03

yghstill added 5 commits April 2, 2022 13:05

Merge branch 'develop' of github.com:PaddlePaddle/Paddle into add_new…

56e6536

…_quant_format

fix clip include

de75349

fix test_quantize_linear_op

7bd3abb

fix unittest

94e7d0e

fix coverage

fd29c72

fix coverage

536838d

Aurelius84 previously approved these changes Apr 4, 2022

View reviewed changes

fix CI-iScan-Python

7c2bec3

yghstill dismissed Aurelius84’s stale review via 7c2bec3 April 4, 2022 01:38

Aurelius84 approved these changes Apr 4, 2022

View reviewed changes

Aurelius84 previously approved these changes Apr 4, 2022

View reviewed changes

TCChenlong reviewed Apr 4, 2022

View reviewed changes

ceci3 previously approved these changes Apr 4, 2022

View reviewed changes

yghstill added 2 commits April 4, 2022 04:01

add code comments

324e04c

Merge branch 'develop' of github.com:PaddlePaddle/Paddle into add_new…

a3aeed9

…_quant_format

yghstill dismissed stale reviews from ceci3 and Aurelius84 via a3aeed9 April 4, 2022 04:02

TCChenlong approved these changes Apr 4, 2022

View reviewed changes

jzhang533 approved these changes Apr 4, 2022

View reviewed changes

XiaoguangHu01 approved these changes Apr 5, 2022

View reviewed changes

yghstill merged commit b72a7eb into PaddlePaddle:develop Apr 5, 2022

yghstill deleted the add_new_quant_format branch April 5, 2022 08:57

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

add new format of quantization #41041

add new format of quantization #41041

yghstill commented Mar 28, 2022 •

edited

Loading

jzhang533 left a comment

XiaoguangHu01 left a comment

add new format of quantization #41041

add new format of quantization #41041

Conversation

yghstill commented Mar 28, 2022 • edited Loading

PR types

PR changes

Describe

jzhang533 left a comment

Choose a reason for hiding this comment

XiaoguangHu01 left a comment

Choose a reason for hiding this comment

yghstill commented Mar 28, 2022 •

edited

Loading