Skip to content

feat(moe): add gshard token rearrange optim #1127

feat(moe): add gshard token rearrange optim

feat(moe): add gshard token rearrange optim #1127

Annotations

4 errors and 1 warning

training_16GPU_4DP2TP2PP_MSP (910B)

failed Oct 25, 2024 in 3m 25s