Issue1416 #1533

ptrcklv · 2020-01-17T09:29:32Z

Description of proposed changes

save() and load() method for Trainer to serialize the optimizer + trainer config (dependent on whether model has been fitted with that Trainer instance)

Related issue(s)

1416
Fixes # (issue)
1416

Test plan

test_trainer.py adapted save_load_test()

Checklist

Need help on these? Just ask!

I have read the CONTRIBUTING document.
I have updated the documentation accordingly.
I have added tests to cover my changes.
I have run tox -e complex and/or tox -e spark if appropriate.
All new and existing tests passed.

codecov · 2020-01-17T11:03:21Z

Codecov Report

❗ No coverage uploaded for pull request base (master@a84e10b). Click here to learn what that means.
The diff coverage is 69.23%.

@@            Coverage Diff            @@
##             master    #1533   +/-   ##
=========================================
  Coverage          ?   97.11%           
=========================================
  Files             ?       55           
  Lines             ?     2077           
  Branches          ?      341           
=========================================
  Hits              ?     2017           
  Misses            ?       31           
  Partials          ?       29

Impacted Files	Coverage Δ
snorkel/labeling/lf/nlp.py	`100% <100%> (ø)`
snorkel/preprocess/nlp.py	`86.66% <33.33%> (ø)`
snorkel/classification/training/trainer.py	`89.83% <75%> (ø)`

henryre · 2020-01-19T22:29:57Z

@ptrcklv thanks for taking this on! One of the maintainers will take a look soon!

vincentschen

@ptrcklv — thanks so much for this contribution and great documentation!!

left a few formatting comments / requests for additional docs. please re-request a review once you've made changes!

vincentschen · 2020-01-20T23:00:42Z

test/classification/training/test_trainer.py

+                    trainer1.optimizer.state_dict()["state"][k]["exp_avg"],
+                    trainer2.optimizer.state_dict()["state"][k]["exp_avg"],


why are we only checking equivalence of these fields? could we check the entire state_dict's values?

vincentschen · 2020-01-20T23:06:39Z

test/classification/training/test_trainer.py

@@ -216,6 +216,35 @@ def test_warmup(self):
        trainer.fit(model, [dataloaders[0]])
        self.assertEqual(trainer.warmup_steps, 1)

+    def test_save_load(self):
+        fd, checkpoint_path = tempfile.mkstemp()


can we put this in a try/except/finally or use a context with tempfile.NamedTemporaryFile() as f: to ensure proper cleanup?

I put a NamedTemporaryFile, however, the tempfile.mkstemp() I copied from test_save_load() from test_multitask_classifier.py. Maybe you want to update it there, too?

got it — yes, we likely need to clean up other parts of the codebase as well. :)

vincentschen · 2020-01-20T23:07:53Z

snorkel/classification/training/trainer.py

+
+        Parameters
+        ----------
+        trainer_path :


nit: no : here. see https://github.com/snorkel-team/snorkel/blob/master/snorkel/classification/multitask_classifier.py for example of doc strings

vincentschen · 2020-01-20T23:11:16Z

test/classification/training/test_trainer.py

@@ -216,6 +216,35 @@ def test_warmup(self):
        trainer.fit(model, [dataloaders[0]])
        self.assertEqual(trainer.warmup_steps, 1)

+    def test_save_load(self):


have you tested this with resuming training for a saved checkpoint?

yes, I have included it in the test now

vincentschen · 2020-01-20T23:11:37Z

snorkel/classification/training/trainer.py

+        trainer_path :
+            The path to the saved trainer config to be loaded
+        model :
+            MultitaskClassifier for which the optimizer has been set. Parameters of optimizer must fit to model parameters. This model
+            shall be the model which was fit by the stored Trainer.


nit: no : for parameters (see above)

snorkel/classification/training/trainer.py

vincentschen

fantastic, lgtm! thank you for the contribution!!

Patrick added 7 commits January 13, 2020 00:30

WIP load/save routines in trainer

bef3283

issue1416

a84e10b

Merge remote-tracking branch 'upstream/master'

03e4b75

wip

f9b63d9

test finalized

cd5c83b

type correction

58229d9

reformatted after last change

7e4f962

ptrcklv marked this pull request as ready for review January 17, 2020 11:04

brahmaneya requested a review from vincentschen January 17, 2020 17:42

vincentschen requested changes Jan 20, 2020

View reviewed changes

Patrick added 6 commits January 26, 2020 16:38

test improved

b9306ff

trainer.py review comments resolved

add4590

docstring issue solved

9d13d10

trainer.load default None for model removed

963526a

tests extended

5092b77

style fixes

14c3d93

ptrcklv requested a review from vincentschen January 26, 2020 21:38

vincentschen approved these changes Jan 26, 2020

View reviewed changes

vincentschen merged commit e878d48 into snorkel-team:master Jan 27, 2020

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Issue1416 #1533

Issue1416 #1533

ptrcklv commented Jan 17, 2020

codecov bot commented Jan 17, 2020 •

edited

Loading

henryre commented Jan 19, 2020

vincentschen left a comment •

edited

Loading

vincentschen Jan 20, 2020

vincentschen Jan 20, 2020

ptrcklv Jan 26, 2020

vincentschen Jan 26, 2020

vincentschen Jan 20, 2020

vincentschen Jan 20, 2020

ptrcklv Jan 26, 2020

vincentschen Jan 20, 2020

vincentschen left a comment

		trainer1.optimizer.state_dict()["state"][k]["exp_avg"],
		trainer2.optimizer.state_dict()["state"][k]["exp_avg"],

Issue1416 #1533

Issue1416 #1533

Conversation

ptrcklv commented Jan 17, 2020

Description of proposed changes

Related issue(s)

Test plan

Checklist

codecov bot commented Jan 17, 2020 • edited Loading

Codecov Report

henryre commented Jan 19, 2020

vincentschen left a comment • edited Loading

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

vincentschen left a comment

Choose a reason for hiding this comment

codecov bot commented Jan 17, 2020 •

edited

Loading

vincentschen left a comment •

edited

Loading