[OPT] FlashAttention && ModelParallel #51617

ForFishes · 2023-03-14T04:12:22Z

PR types

Performance optimization

PR changes

Others

Describe

opt gpt performace

paddle-bot · 2023-03-14T04:12:27Z

你的PR提交成功，感谢你对开源项目的贡献!
请关注后续CI自动化测试结果，详情请参考Paddle-CI手册。
Your PR has been submitted. Thanks for your contribution!
Please wait for the result of CI firstly. See Paddle CI Manual for details.

sneaxiy · 2023-03-14T05:52:52Z

paddle/phi/kernels/gpu/flash_attn_kernel.cu

-  phi::TensorFromVector<int64_t>(seed_offset_vec, ctx, seed_offset);
+  paddle::platform::CPUPlace cpu_place;
+  seed_offset->Resize({2});
+  auto* seed_offset_data = seed_offset->mutable_data<uint64_t>(cpu_place);


Use auto* seed_offset_data = ctx.template HostAlloc<uint64_t>(seed_offset) instead.

sneaxiy · 2023-03-14T05:53:05Z

python/paddle/distributed/fleet/layers/mpu/mp_layers.py

+    @staticmethod
+    def forward(ctx, x, mp_degree):
+        out = paddle.scale(x, 1.0 / mp_degree)
+        ctx.mp_degree = mp_degree


Remove this line.

ForFishes force-pushed the opt_mp branch from 0681417 to 2961d07 Compare March 14, 2023 05:43

sneaxiy reviewed Mar 14, 2023

View reviewed changes

fix flash_attention

2a0af04

ForFishes force-pushed the opt_mp branch from ed37b20 to 2a0af04 Compare March 19, 2023 13:47

ForFishes changed the title ~~opt gpt performace~~ [OPT] FlashAttention && ModelParallel Mar 20, 2023

ForFishes and others added 2 commits March 20, 2023 03:42

Merge branch 'develop' into opt_mp

d506eec

Update mp_layers.py

9bf203a

sneaxiy approved these changes Mar 21, 2023

View reviewed changes

ForFishes merged commit 4640f4b into PaddlePaddle:develop Mar 21, 2023

ForFishes deleted the opt_mp branch March 21, 2023 06:32

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[OPT] FlashAttention && ModelParallel #51617

[OPT] FlashAttention && ModelParallel #51617

ForFishes commented Mar 14, 2023

paddle-bot bot commented Mar 14, 2023

sneaxiy Mar 14, 2023

ForFishes Mar 14, 2023

sneaxiy Mar 14, 2023

ForFishes Mar 14, 2023

[OPT] FlashAttention && ModelParallel #51617

[OPT] FlashAttention && ModelParallel #51617

Conversation

ForFishes commented Mar 14, 2023

PR types

PR changes

Describe

paddle-bot bot commented Mar 14, 2023

sneaxiy Mar 14, 2023

Choose a reason for hiding this comment

ForFishes Mar 14, 2023

Choose a reason for hiding this comment

sneaxiy Mar 14, 2023

Choose a reason for hiding this comment

ForFishes Mar 14, 2023

Choose a reason for hiding this comment