[Semi-Auto] Support parallel cross entropy in static semi-auto training #59187

pkuzyc · 2023-11-21T02:57:44Z

PR types

Performance optimization

PR changes

Others

Description

Pcard-76459
Support parallel cross entropy in static semi-auto training.

Introduction of parallel cross entropy
Parallel cross entropy is a performance optimization strategy when the tensor is sharded on the softmax normalize axis. Parallel cross entropy will first perform some local computation and then do communication. Compared to the original pipline that performs communication before computation, parallel cross entropy reduces the communication elements size from (b,s,v) to (b,s), and also reduces the computation elements number on each process. The following figure shows the pipline difference between parallel cross entropy and the original cross entropy.

What this pr does
This pr supports parallel cross entropy in static semi-auto training. When there is a softmax_with_cross_entropy operator in the model, we will first use spmd rule (#58913) to infer the sharding status of its input and output tensors. If the input tensor is sharded on the softmax normalize axis, we will select c_softmax_with_cross_entropy kernel.

paddle-bot · 2023-11-21T02:57:49Z

你的PR提交成功，感谢你对开源项目的贡献!
请关注后续CI自动化测试结果，详情请参考Paddle-CI手册。
Your PR has been submitted. Thanks for your contribution!
Please wait for the result of CI firstly. See Paddle CI Manual for details.

paddle-ci-bot · 2023-11-29T03:11:04Z

Sorry to inform you that 96bdc78's CIs have passed for more than 7 days. To prevent PR conflicts, you need to re-run all CIs manually.

JZ-LIANG

LGTM

pkuzyc added 5 commits November 29, 2023 10:36

adapt cross_entropy_with_softmax rule to phi

63b72d1

support parallel cross_entropy in auto parallel

a080b2d

small fix

1f35d07

temporary save

228f1e7

add unit test for parallel_cross_entropy

9f2ba56

pkuzyc added 2 commits November 29, 2023 12:45

resolve conflicts

ab0b31d

small fix

c03fb42

pkuzyc force-pushed the semi_auto/c_cross_entropy branch from 96bdc78 to c03fb42 Compare November 29, 2023 13:56

tianshuo78520a approved these changes Nov 30, 2023

View reviewed changes

JZ-LIANG approved these changes Nov 30, 2023

View reviewed changes

JZ-LIANG merged commit 8fb0f1f into PaddlePaddle:develop Nov 30, 2023
29 checks passed

pkuzyc deleted the semi_auto/c_cross_entropy branch December 22, 2023 09:42

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Semi-Auto] Support parallel cross entropy in static semi-auto training #59187

[Semi-Auto] Support parallel cross entropy in static semi-auto training #59187

pkuzyc commented Nov 21, 2023 •

edited

Loading

paddle-bot bot commented Nov 21, 2023

paddle-ci-bot bot commented Nov 29, 2023

JZ-LIANG left a comment

[Semi-Auto] Support parallel cross entropy in static semi-auto training #59187

[Semi-Auto] Support parallel cross entropy in static semi-auto training #59187

Conversation

pkuzyc commented Nov 21, 2023 • edited Loading

PR types

PR changes

Description

paddle-bot bot commented Nov 21, 2023

paddle-ci-bot bot commented Nov 29, 2023

JZ-LIANG left a comment

Choose a reason for hiding this comment

pkuzyc commented Nov 21, 2023 •

edited

Loading