Skip to content

feat(moe): add gshard token rearrange optim #1127

feat(moe): add gshard token rearrange optim

feat(moe): add gshard token rearrange optim #1127

Annotations

2 warnings

training_8GPU_4DP2PP_ZB

succeeded Oct 25, 2024 in 1m 45s