-
Notifications
You must be signed in to change notification settings - Fork 3.4k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Checkpoint fails in single node multi-GPU mode using DDP #1119
Comments
yeah we shall run all examples in CI too |
I believe this happens with multiple gpus as well. And it only seems to happen if |
@Borda fixed. part of the code that caused the bug was removed a few commits back. |
fix is in master? |
fix for DDP checkpoint is in #1125, still waiting for it to be reviewed and merged.
as for this issue, on my side it seems to work fine. can you double check? |
🐛 Bug
Checkpoint fails in single node multi-GPU mode using DDP.
To Reproduce
The text was updated successfully, but these errors were encountered: