Skip to content
This repository has been archived by the owner on Oct 31, 2023. It is now read-only.

OOM Error while training Electra QA model #23

Open
todiketan opened this issue Jan 8, 2022 · 0 comments
Open

OOM Error while training Electra QA model #23

todiketan opened this issue Jan 8, 2022 · 0 comments

Comments

@todiketan
Copy link

todiketan commented Jan 8, 2022

Hello, thanks a lot for sharing the code for the paper. I was trying to train electra base model from scratch but the CPU RAM usage is increasing with every iteration, and eventually the process is getting killed due to the CPU RAM being full. The GPU RAM usage is constant across training. I am using a system with 64GB CPU RAM. Can any of the authors (or anyone who has trained or fine-tuned the QA model) share the exact version of pytorch used for the experiments, and did they face any similar issue while training the model?

Thanks in advance.

Sign up for free to subscribe to this conversation on GitHub. Already have an account? Sign in.
Labels
None yet
Projects
None yet
Development

No branches or pull requests

1 participant