-
Notifications
You must be signed in to change notification settings - Fork 5.6k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
[Paddle Inference] Add float_to_half_pass to support inference with mixed precision #47993
[Paddle Inference] Add float_to_half_pass to support inference with mixed precision #47993
Conversation
你的PR提交成功,感谢你对开源项目的贡献! |
a116b94
to
60d4d72
Compare
b593f0c
to
686c441
Compare
c286881
to
d0db320
Compare
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
LGTM for
set_tests_properties(gpu_ernie_half_test PROPERTIES TIMEOUT 40)
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
LGTM
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
LGTM
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
LGTM for mutable_data
in float_to_half_pass
* [Release2.4] Revert python link prs (#48573) * Revert "Fix mac link python (#48017)" This reverts commit 3fa7a73. * Revert "[Cherry-pick] Fix python link error (#47811)" This reverts commit ff642c6. * Update config.go * [Paddle Inference] Add float_to_half_pass to support inference with mixed precision (#47993) * [Inference] optimize some code and fix some bug (#48780) * clean ir_pass_manager and fix map_depthwise_conv_to_conv_pass * fix unitest timeout * [Paddle Inference] clean unused code (#48392) * fix * update * update Co-authored-by: Chen Weihang <chenweihang@baidu.com>
PR types
Others
PR changes
Others
Describe
文档API更新PR1、PR2、PR3、PR4
PR工作:
使用方式:
python ---> config.enable_use_gpu(512, 0, PrecisionType::kHalf);
c++ ---> config.EnableUseGpu(512, 0, PrecisionType::kHalf);
TODO: