How to continue training from a checkpoint with Trainer? #2656

Superskyyy · 2025-01-25T14:46:10Z

Superskyyy
Jan 25, 2025

I've been using PPO/RLOOTrainer and it seems the resume_from_checkpoint doens't work, upon seeing the code I was surprised that nothing implemented the checkpointing loading mechanism not even using the one from huggingface transfomers, (the trainer.train method doesn't take a resume_from_checkpoint arg)

How can I load back the checkpoint and resume the training? I assume people have been using this feature and I somehow missed the guide to do so.

wojtess · 2025-02-25T12:40:45Z

wojtess
Feb 25, 2025

Up

0 replies

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

How to continue training from a checkpoint with Trainer? #2656

{{title}}

{{editor}}'s edit

{{editor}}'s edit

Replies: 1 comment

{{title}}

Select a reply

How to continue training from a checkpoint with Trainer? #2656

Superskyyy Jan 25, 2025

Replies: 1 comment

wojtess Feb 25, 2025

Superskyyy
Jan 25, 2025

wojtess
Feb 25, 2025