You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
After diffusers 0.17.0, non-reentrant variant of torch.utils.checkpoint.checkpoint becomes default but is incompatible with Deepspeed Stage3.
I found #4118, then replaced torch.utils.checkpoint.checkpoint with deepspeed.checkpointing.non_reentrant_checkpoint before import diffusers but still met an error said "RuntimeError: The size of tensor a (0) must match the size of tensor b (1280) at non-singleton dimension 1".
Maybe I use non_reentrant_checkpoint improperly. Any suggestions for me to make non-reentrant checkpoint compatible with stage3?
See #4332.
After diffusers 0.17.0, non-reentrant variant of
torch.utils.checkpoint.checkpoint
becomes default but is incompatible with Deepspeed Stage3.I found #4118, then replaced
torch.utils.checkpoint.checkpoint
withdeepspeed.checkpointing.non_reentrant_checkpoint
before import diffusers but still met an error said "RuntimeError: The size of tensor a (0) must match the size of tensor b (1280) at non-singleton dimension 1".Maybe I use
non_reentrant_checkpoint
improperly. Any suggestions for me to make non-reentrant checkpoint compatible with stage3?cc @hughpu
The text was updated successfully, but these errors were encountered: