fix accelerator prepare during eval only mode #24014

pacman100 · 2023-06-05T10:08:56Z

What does this PR do?

As mentioned in find cpu_amp is incorrect set, it's only set if self.sharded_ddp is n… #23957 (comment), currently the accelerate prepare method is happening only during training loop. If the user is directly doing eval/predict without the training loop, the model isn't prepared leading to wrong behaviour. This PR is aimed at fixing it.
Should be merged after Instead go with evaluation_mode accelerate#1540

HuggingFaceDocBuilderDev · 2023-06-05T10:28:40Z

The documentation is not available anymore as the PR was closed or merged.

sgugger

Unless I'm missing something, this changes the whole logic of the evaluation in the Trainer and should not be done.

sgugger · 2023-06-05T11:49:30Z

src/transformers/trainer.py


        model = self._wrap_model(self.model, training=False, dataloader=dataloader)

+        if len(self.accelerator._models) == 0 and model is self.model:
+            model = self.accelerator.prepare(model)


No we only want to do this for DeepSpeed, not all the time. Putting a model in DistributedDataParallel just for evaluation will waste some memory.

I do agree on the DDP case and hence I didn't update it earlier but as mentioned below we will be missing mixed precision coverage for eval-only mode

pacman100 · 2023-06-05T11:51:34Z

The thing is that mixed precision application for eval only mode won't work unless we prepare model

* fix mixed precision prep during eval only mode * update to address comments * update to reflect the changes in accelerate

fix mixed precision prep during eval only mode

0263e01

pacman100 requested review from muellerzr and sgugger June 5, 2023 10:08

sgugger reviewed Jun 5, 2023

View reviewed changes

pacman100 added 2 commits June 6, 2023 17:22

update to address comments

4930654

update to reflect the changes in accelerate

ef092e2

sgugger approved these changes Jun 7, 2023

View reviewed changes

Merge branch 'main' into smangrul/fix-mp-in-eval-pred

06e7dcc

pacman100 merged commit d1c039e into main Jun 7, 2023

pacman100 deleted the smangrul/fix-mp-in-eval-pred branch June 7, 2023 19:33

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

fix accelerator prepare during eval only mode #24014

fix accelerator prepare during eval only mode #24014

pacman100 commented Jun 5, 2023 •

edited

Loading

HuggingFaceDocBuilderDev commented Jun 5, 2023 •

edited

Loading

sgugger left a comment

sgugger Jun 5, 2023

pacman100 Jun 5, 2023

pacman100 commented Jun 5, 2023

fix accelerator prepare during eval only mode #24014

fix accelerator prepare during eval only mode #24014

Conversation

pacman100 commented Jun 5, 2023 • edited Loading

What does this PR do?

HuggingFaceDocBuilderDev commented Jun 5, 2023 • edited Loading

sgugger left a comment

Choose a reason for hiding this comment

sgugger Jun 5, 2023

Choose a reason for hiding this comment

pacman100 Jun 5, 2023

Choose a reason for hiding this comment

pacman100 commented Jun 5, 2023

pacman100 commented Jun 5, 2023 •

edited

Loading

HuggingFaceDocBuilderDev commented Jun 5, 2023 •

edited

Loading