[Paddle Inference] Fix mmha when src_mask is not equal to zero #57936

xiaoxiaohehe001 · 2023-10-08T18:05:38Z

PR types

Others

PR changes

Others

Description

Fix mmha diff when src_mask is not equal to zero.
Pcard-71502

paddle-bot · 2023-10-08T18:05:42Z

你的PR提交成功，感谢你对开源项目的贡献!
请关注后续CI自动化测试结果，详情请参考Paddle-CI手册。
Your PR has been submitted. Thanks for your contribution!
Please wait for the result of CI firstly. See Paddle CI Manual for details.

carryyu · 2023-10-09T07:58:09Z

paddle/phi/kernels/fusion/gpu/masked_multihead_attention.cu

@@ -652,7 +652,7 @@ __global__ void masked_multihead_attention_kernel(
 #pragma unroll
    for (int ii = 0; ii < K_VECS_PER_THREAD; ++ii) {
      int jj = ii * params.max_seq_length + ti;
-      if (ti < act_time_step) {
+      if (ti < act_time_step + 1) {


这里不需要加1

carryyu · 2023-10-09T07:58:45Z

paddle/phi/kernels/fusion/gpu/masked_multihead_attention.cu

@@ -674,7 +674,7 @@ __global__ void masked_multihead_attention_kernel(
    float qk = Qk_dot<T, THREADS_PER_KEY>::dot(q, k, params.inv_sqrt_dh);

    // bool is_mask = false;
-    if (ti < act_time_step && tid % THREADS_PER_KEY == 0) {
+    if (ti < act_time_step + 1 && tid % THREADS_PER_KEY == 0) {


这里的加1相关的逻辑是否可以放到613行，加入一个判断，因为大多数情况下当前位置都是不会加mask的。

carryyu · 2023-10-10T08:01:44Z

paddle/phi/kernels/fusion/gpu/masked_multihead_attention.cu

@@ -609,6 +609,11 @@ __global__ void masked_multihead_attention_kernel(
    //    bi * (params.timestep + 1) + params.timestep];
    // qk += static_cast<float>(mask);
    qk *= params.inv_sqrt_dh;
+    auto mask_bhi = params.mask_broadcast_num_heads ? bi : bhi;


这部分可以放到if分支里面，当有attn mask的时候才计算mask_bhi

…ePaddle#57936) * fix_mmha_scrmask * fix_mmha_scrmask * remove_mask_to_qk_smem_act_time_step * remove_add_! * mask_bhi

fix_mmha_scrmask

044759f

fix_mmha_scrmask

b5aee4b

carryyu reviewed Oct 9, 2023

View reviewed changes

xiaoxiaohehe001 added 2 commits October 9, 2023 16:53

remove_mask_to_qk_smem_act_time_step

1f4a7a0

remove_add_!

d8cffa6

xiaoxiaohehe001 changed the title ~~[Paddle Inference] Fix mmha when src_mask is not equal to zero.~~ [Paddle Inference] Fix mmha when src_mask is not equal to zero Oct 10, 2023

xiaoxiaohehe001 closed this Oct 10, 2023

xiaoxiaohehe001 reopened this Oct 10, 2023

carryyu self-requested a review October 10, 2023 08:00

carryyu previously approved these changes Oct 10, 2023

View reviewed changes

carryyu reviewed Oct 10, 2023

View reviewed changes

carryyu self-requested a review October 10, 2023 08:02

mask_bhi

047ac9f

xiaoxiaohehe001 dismissed carryyu’s stale review via 047ac9f October 10, 2023 08:28

carryyu approved these changes Oct 11, 2023

View reviewed changes

carryyu merged commit d78ec37 into PaddlePaddle:develop Oct 11, 2023

Frida-a pushed a commit to Frida-a/Paddle that referenced this pull request Oct 14, 2023

[Paddle Inference] Fix mmha when src_mask is not equal to zero (Paddl…

3cdb2e8

…ePaddle#57936) * fix_mmha_scrmask * fix_mmha_scrmask * remove_mask_to_qk_smem_act_time_step * remove_add_! * mask_bhi

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Paddle Inference] Fix mmha when src_mask is not equal to zero #57936

[Paddle Inference] Fix mmha when src_mask is not equal to zero #57936

xiaoxiaohehe001 commented Oct 8, 2023 •

edited

Loading

paddle-bot bot commented Oct 8, 2023

carryyu Oct 9, 2023

carryyu Oct 9, 2023

carryyu Oct 10, 2023

xiaoxiaohehe001 Oct 10, 2023

[Paddle Inference] Fix mmha when src_mask is not equal to zero #57936

[Paddle Inference] Fix mmha when src_mask is not equal to zero #57936

Conversation

xiaoxiaohehe001 commented Oct 8, 2023 • edited Loading

PR types

PR changes

Description

paddle-bot bot commented Oct 8, 2023

carryyu Oct 9, 2023

Choose a reason for hiding this comment

carryyu Oct 9, 2023

Choose a reason for hiding this comment

carryyu Oct 10, 2023

Choose a reason for hiding this comment

xiaoxiaohehe001 Oct 10, 2023

Choose a reason for hiding this comment

xiaoxiaohehe001 commented Oct 8, 2023 •

edited

Loading