Transfer MultiHeadAttention's matmul to v2 op #36222

FrostML · 2021-09-29T10:43:37Z

PR types

Others

PR changes

APIs

Describe

Transfer MultiHeadAttention to v2 op.

matmul -> matmul_v2.

More performance information can be found according to QA report.

测试结论：

静态图、动态图均无无 5% 以上性能异常下降情况
动态图下： 8 卡性能下降最大的 case 为: transformer big bs4096 amp fp16: -3.55%
静态图下： 8 卡性能下降最大的 case 为: bert large seqlen512 fp32 bs10: -2.82%

paddle-bot-old · 2021-09-29T10:44:47Z

Thanks for your contribution!
Please wait for the result of CI firstly. See Paddle CI Manual for details.

paddle-bot-old · 2021-10-15T06:24:54Z

Sorry to inform you that 3a4c032's CIs have passed for more than 7 days. To prevent PR conflicts, you need to re-run all CIs manually.

… scale-part

jeff41404

lgtm

XiaoguangHu01

LGTM

raindrops2sea · 2021-12-10T10:56:04Z

Please update the description.

promote to v2

3a4c032

FrostML requested a review from jeff41404 September 29, 2021 10:58

FrostML added 3 commits November 23, 2021 05:41

Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into…

78813ce

… scale-part

alter

0610feb

Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into…

802c38c

… scale-part

FrostML requested review from raindrops2sea, XiaoguangHu01 and guoshengCS December 10, 2021 03:18

jeff41404 approved these changes Dec 10, 2021

View reviewed changes

XiaoguangHu01 approved these changes Dec 10, 2021

View reviewed changes

guoshengCS approved these changes Dec 10, 2021

View reviewed changes

raindrops2sea approved these changes Dec 10, 2021

View reviewed changes

jeff41404 merged commit 6549405 into PaddlePaddle:develop Dec 10, 2021

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Transfer MultiHeadAttention's matmul to v2 op #36222

Transfer MultiHeadAttention's matmul to v2 op #36222

FrostML commented Sep 29, 2021 •

edited by raindrops2sea

Loading

paddle-bot-old bot commented Sep 29, 2021

paddle-bot-old bot commented Oct 15, 2021

jeff41404 left a comment

XiaoguangHu01 left a comment

raindrops2sea commented Dec 10, 2021

Transfer MultiHeadAttention's matmul to v2 op #36222

Transfer MultiHeadAttention's matmul to v2 op #36222

Conversation

FrostML commented Sep 29, 2021 • edited by raindrops2sea Loading

PR types

PR changes

Describe

paddle-bot-old bot commented Sep 29, 2021

paddle-bot-old bot commented Oct 15, 2021

jeff41404 left a comment

Choose a reason for hiding this comment

XiaoguangHu01 left a comment

Choose a reason for hiding this comment

raindrops2sea commented Dec 10, 2021

FrostML commented Sep 29, 2021 •

edited by raindrops2sea

Loading