Add model-last saving mechanism to pretraining #12459

thomashacker · 2023-03-23T14:12:44Z

Description

This PR takes inspiration from the training loop to save the last epoch of pretraining as model_last.bin instead of model<last_epoch>.bin. The PR aims to make it easier for users to work with pretrained weights in an automated workflow (e.g. spaCy projects) and reduce manual adjustment based on the set max_epochs in the training config.

Types of change

Feature

Checklist

I confirm that I have the right to submit this contribution under the project's MIT license.
I ran the tests, and all new and existing tests passed.
My changes don't require a change to the documentation, or if they do, I've added all required information.

thomashacker · 2023-03-23T14:14:03Z

I'll fix and add some unit tests. If we decide to implement this feature, do we want to mention this in the documentation?

spacy/training/pretrain.py

Co-authored-by: Adriane Boyd <adrianeboyd@gmail.com>

adrianeboyd · 2023-03-27T12:47:04Z

And I would definitely add this to the docs.

adrianeboyd · 2023-03-29T06:40:15Z

The CI seems to have gotten into a weird state, let me close and reopen this.

website/docs/usage/embeddings-transformers.mdx

svlandeg

Looks good to me.

The one thing I wonder about is whether this could influence existing workflows somehow, like users iterating over all .bin files in the output folder, assuming the names all look like modelX.bin with X an int, and that that could now fail when they encounter model-last.bin.

Could we make this opt-in for the upcoming bugfix release? Or at least provide an easy switch to turn this off if users don't want the additional duplicate file?

* Adjust pretrain command * chane naming and add finally block * Add unit test * Add unit test assertions * Update spacy/training/pretrain.py Co-authored-by: Adriane Boyd <adrianeboyd@gmail.com> * change finally block * Add to docs * Update website/docs/usage/embeddings-transformers.mdx * Add flag to skip saving model-last --------- Co-authored-by: Adriane Boyd <adrianeboyd@gmail.com>

Adjust pretrain command

8d50a80

thomashacker added enhancement Feature requests and improvements feat / tok2vec Feature: Token-to-vector layer and pretraining labels Mar 23, 2023

thomashacker changed the title ~~Add model-latest saving mechanism to pretraining~~ Add model-last saving mechanism to pretraining Mar 27, 2023

thomashacker added 3 commits March 27, 2023 13:58

chane naming and add finally block

853ef78

Add unit test

44704a1

Add unit test assertions

1da1474

adrianeboyd reviewed Mar 27, 2023

View reviewed changes

spacy/training/pretrain.py Outdated Show resolved Hide resolved

thomashacker and others added 2 commits March 27, 2023 14:12

Update spacy/training/pretrain.py

33b39ba

Co-authored-by: Adriane Boyd <adrianeboyd@gmail.com>

change finally block

bce15df

adrianeboyd closed this Mar 29, 2023

adrianeboyd reopened this Mar 29, 2023

Add to docs

be24c0a

thomashacker mentioned this pull request Mar 31, 2023

Enhancing Morphological Analysis with spaCy Pretraining explosion/projects#188

Merged

3 tasks

adrianeboyd reviewed Mar 31, 2023

View reviewed changes

website/docs/usage/embeddings-transformers.mdx Show resolved Hide resolved

adrianeboyd reviewed Apr 3, 2023

View reviewed changes

website/docs/usage/embeddings-transformers.mdx Outdated Show resolved Hide resolved

Update website/docs/usage/embeddings-transformers.mdx

a562767

svlandeg reviewed Apr 3, 2023

View reviewed changes

Add flag to skip saving model-last

6d38cc1

svlandeg approved these changes Apr 3, 2023

View reviewed changes

adrianeboyd merged commit de32011 into explosion:master Apr 3, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add model-last saving mechanism to pretraining #12459

Add model-last saving mechanism to pretraining #12459

thomashacker commented Mar 23, 2023 •

edited

Loading

thomashacker commented Mar 23, 2023 •

edited

Loading

adrianeboyd commented Mar 27, 2023

adrianeboyd commented Mar 29, 2023

svlandeg left a comment

Add model-last saving mechanism to pretraining #12459

Add model-last saving mechanism to pretraining #12459

Conversation

thomashacker commented Mar 23, 2023 • edited Loading

Description

Types of change

Checklist

thomashacker commented Mar 23, 2023 • edited Loading

adrianeboyd commented Mar 27, 2023

adrianeboyd commented Mar 29, 2023

svlandeg left a comment

Choose a reason for hiding this comment

thomashacker commented Mar 23, 2023 •

edited

Loading

thomashacker commented Mar 23, 2023 •

edited

Loading