Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

disable direct load to gpu to avoid OOM #317

Merged
merged 3 commits into from
May 10, 2024

Conversation

akoumpa
Copy link
Member

@akoumpa akoumpa commented May 10, 2024

Don't load directly on GPU to avoid OOM errors,

See also NVIDIA/NeMo#9125

Signed-off-by: Alexandros Koumparoulis <akoumparouli@nvidia.com>
@akoumpa akoumpa changed the title disable direct load to gpu disable direct load to gpu to avoid OOM May 10, 2024
akoumpa and others added 2 commits May 10, 2024 15:33
Signed-off-by: Alexandros Koumparoulis <akoumparouli@nvidia.com>
@akoumpa
Copy link
Member Author

akoumpa commented May 10, 2024

I'm waiting for a slurm job to finish to remove the draft status. Thanks.

@akoumpa akoumpa marked this pull request as ready for review May 10, 2024 22:43
@akoumpa akoumpa requested a review from pablo-garay May 10, 2024 22:43
@akoumpa akoumpa merged commit 2cd0a71 into main May 10, 2024
3 checks passed
@akoumpa akoumpa deleted the akoumparouli/fix_dist_ckpt_format branch June 29, 2024 07:46
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants