[Paddle Inference] Add float_to_half_pass to support inference with mixed precision #47993

yuanlehome · 2022-11-15T05:39:25Z

PR types

Others

PR changes

Others

Describe

文档API更新PR1、PR2、PR3、PR4

PR工作：

添加 float_to_half_pass，在所有 gpu ir pass 之后执行；
扩展原 enable_use_gpu 接口，可指定 gpu 推理精度；
增加 gpu half 单测；
将原 c++ 接口 Exp_SetBlackListOpsForMixedModel 更改为 Exp_DisableMixedPrecisionOps；
移除某单测文件中不需要的头文件；

使用方式：
python ---> config.enable_use_gpu(512, 0, PrecisionType::kHalf);
c++ ---> config.EnableUseGpu(512, 0, PrecisionType::kHalf);

TODO:

兼容Paddle-TRT fp16推理
切换convert_to_mixed_precision接口的底层实现

paddle-bot · 2022-11-15T05:39:29Z

你的PR提交成功，感谢你对开源项目的贡献!
请关注后续CI自动化测试结果，详情请参考Paddle-CI手册。
Your PR has been submitted. Thanks for your contribution!
Please wait for the result of CI firstly. See Paddle CI Manual for details.

paddle/fluid/inference/api/paddle_analysis_config.h

paddle/fluid/framework/ir/float_to_half_pass.h

paddle/fluid/inference/api/analysis_config.cc

paddle/fluid/framework/ir/float_to_half_pass.cc

XieYunshen

LGTM for
set_tests_properties(gpu_ernie_half_test PROPERTIES TIMEOUT 40)

jiweibo

LGTM

XiaoguangHu01

LGTM

From00

LGTM for mutable_data in float_to_half_pass

…ixed precision (PaddlePaddle#47993)

* [Release2.4] Revert python link prs (#48573) * Revert "Fix mac link python (#48017)" This reverts commit 3fa7a73. * Revert "[Cherry-pick] Fix python link error (#47811)" This reverts commit ff642c6. * Update config.go * [Paddle Inference] Add float_to_half_pass to support inference with mixed precision (#47993) * [Inference] optimize some code and fix some bug (#48780) * clean ir_pass_manager and fix map_depthwise_conv_to_conv_pass * fix unitest timeout * [Paddle Inference] clean unused code (#48392) * fix * update * update Co-authored-by: Chen Weihang <chenweihang@baidu.com>

yuanlehome added 12 commits November 9, 2022 08:36

update

67fe3da

update

3a1b885

update

ba06455

update

d7d8b61

update

06435b9

update

60c8f84

update

17a34ab

update

83df3cf

new stage

c7d24e9

update

691b5d9

update

89a1696

update

0fac471

yuanlehome changed the title ~~[WIP][Paddle Inference] Add float_to_mixed_pass to support mixed precision inference~~ [WIP][Paddle Inference] Add float_to_mixed_pass to support inference with mixed precision Nov 15, 2022

yuanlehome added 5 commits November 15, 2022 09:24

select_input op

8f49d68

only support dense tensor

ab22b48

op kernel only support cpu

c2d7768

update

23fe35f

conv2d_transpose

60d4d72

yuanlehome changed the title ~~[WIP][Paddle Inference] Add float_to_mixed_pass to support inference with mixed precision~~ [Paddle Inference] Add float_to_mixed_pass to support inference with mixed precision Nov 18, 2022

yuanlehome requested review from qingqing01 and jiweibo November 18, 2022 08:48

yuanlehome force-pushed the add_float_to_mixed_pass branch from a116b94 to 60d4d72 Compare November 18, 2022 17:23

yuanlehome added 2 commits November 18, 2022 17:26

add unit test

eacd2db

fix unitest

686c441

yuanlehome force-pushed the add_float_to_mixed_pass branch from b593f0c to 686c441 Compare November 21, 2022 05:29

adjust unitest timeout

d0db320

yuanlehome force-pushed the add_float_to_mixed_pass branch from c286881 to d0db320 Compare November 21, 2022 11:19

yuanlehome added 2 commits November 23, 2022 08:37

merge

db75218

support bf16

3e35263

yuanlehome added 2 commits December 1, 2022 09:43

merge

33d5784

change mixed to half

8478b61

yuanlehome changed the title ~~[Paddle Inference] Add float_to_mixed_pass to support inference with mixed precision~~ [Paddle Inference] Add float_to_half_pass to support inference with mixed precision Dec 1, 2022

yuanlehome added 3 commits December 1, 2022 10:13

-

5c58143

-

e0873da

fix unitest

fbbc60a

zhangjun reviewed Dec 2, 2022

View reviewed changes

paddle/fluid/inference/api/paddle_analysis_config.h Show resolved Hide resolved