Skip to content

Fix expert grad scaling problem with ZeRO optimizer#6546

Merged
tohtana merged 7 commits intodeepspeedai:masterfrom wyooyw:fix_expert_weight_grad_with_zeroOct 23, 2024

Commits

Commits on Sep 17, 2024

Commits on Sep 18, 2024

Commits on Oct 11, 2024

Commits on Oct 14, 2024