Version 1.0 #639

NickleDave · 2023-03-06T02:19:20Z

Going to go ahead and merge this into main so we can

release an alpha
develop it directly instead of having a long running version-1.0 branch that we juggle

I made a version 0.8 branch and will do any maintenance work on that until 1.0 is ready for release.

Change `lbl_tb2labels` to an instance of `transforms.labeled_timebins.ToLabels`. Fix needed after rebasing version 1.0 on #621

Remove since they are basically unused by anyone but us, but require maintenance, test, etc. Squash commits: - Remove vak/entry_points.py - Remove import of entry_points from vak/__init__.py - Remove model and metric entry points from pyproject.toml - Rewrite models/models.py to not use entry points - Fix use of models.find in config/validators.py - Fix use of find in config/models.py - Remove entry point test in tests/test_models/test_teenytweetynet.py - Fix entry point test in tests/test_models/test_tweetynet.py

to hopefully get full log of failures -- not seeing them now

Fix two unit tests to not look for TweetyNet datasets, so they don't crash on CI - Fix tests/test_models/test_base.py unit test that tests loading state dict from path, by using `model` fixture so it only runs with teenytweetynet. - Fix tests/test_models/test_windowed_frame_classification_model.py unit test that tests definitions, by adding a MOCK_INPUT_SHAPE so we don't actually need to load a dataset to test. We really only used the dataset to get its input shape.

> The config allows instantiating multiple models for training / prediction but this was never fully implemented Squash commits: - Make config just use "model" option, not "models": - Remove `comma_separated_list` from converters - Change option name 'models' -> 'model' in config/valid.toml - Rewrite is_valid_model_name to test a single string, not a list of strings - Change attribute `models` -> `model` in config/eval.py - Change attribute `models` -> `model` in config/learncurve.py - Change attribute `models` -> `model` in config/predict.py - Change attribute `models` -> `model` in config/train.py - Rewrite/rename config.models -> model.config_from_toml_path, model.config_from_toml_dict - Fix option 'models' -> model = 'str', in all .toml files in tests/data_for_tests/configs - Rewrite `models.from_model_config_map` as `models.get`: - Add src/vak/models/_api.py with BUILTIN_MODELS and MODEL_NAMES to use for validation in `models.get` - Rewrite models/models.py `from_model_config_map` as `models.get` - Import get and _api in vak/models/__init__.py - Rewrite core/train.py to take model_name and model_config then use models.get - Fix cli/train.py to pass model_name and model_config into core.train - Rewrite core/eval.py to take model_name and model_config then use models.get - Fix cli/eval.py to pass model_name and model_config into core.eval - Rewrite core/learncurve.py to take model_name and model_config - Fix cli/learncurve.py to pass model_name and model_config into core.learncurve - Rewrite core/predict.py to take model_name and model_config then use models.get - Fix cli/predict.py to pass model_name and model_config into core.predict - Make 'model' not 'models' required in src/vak/config/parse.py - Use models.MODEL_NAMES in src/vak/config/validators.py - Use models.MODEL_NAMES in config/model.py - Fix tests - Fix tests to use vak.config.model.config_from_toml_path: - tests/test_models/test_windowed_frame_classification_model.py - tests/test_core/test_train.py - tests/test_core/test_predict.py - tests/test_core/test_learncurve.py - tests/test_core/test_eval.py - Fix test to use 'model' option not 'models' in tests/test_config/test_parse.py - Fix assert helper function in tests/test_core - test_eval.py - test_learncurve.py - test_predict.py - test_prep.py - test_train.py - Rewrite fixture module with constants we can import in test modules to parametrize: tests/fixtures/config.py - Add tests/test_config/test_model.py

After rebasing on top of main where we added the ability to run eval with and without post-processing, we lost that in version 1.0 because the transform is not passed in to the new model abstraction / backend, even though everything is in place to do so. This fixes that. - Add post_tfm parameter to models.get - fix docstring to say PostProcess - pass post_tfm into Model.from_config - Fix post_tfm_kwargs definition in EvalConfig docstring in src/vak/config/eval.py to say PostProcess transform - In core/eval.py, fix post_tfm_kwargs docstring and actually pass post_tfm into models.get - Fix validation_step method of WindowedFrameClassificationModel to match what engine.Model._eval does with post_tfm in version 0.x

Multiprocessing fails when it tries to pickle instances of a subclass made with the `vak.models.model` decorator, i.e. TweetyNet and TeenyTweetyNet. This is because we were giving all attributes from the family we were subclassing to the subclass, including `__module__`. So the subclass would appear to have the `__module__` of the superclass, and when pickle tried to do attribute lookup on that module, it would fail. The fix is to directly set `subclass.__module` to `definition.__module__` after making the subclass with type. After doing so, multiprocessing works.

Default should be None, not an empty dict. Setting it to an empty dict causes logic in core/eval to instantiate a PostProcess transform with majority_vote = False and min_segment_dur = None, so that we end up applying post-processing that does nothing, and the computed metric is the same as without "post-processing". This sets the default to None, which fixes the issue.

because `csv_path` is not very specific, and because the path to a dataset may not always be the path to a csv (although it is for now) - Rename `csv_path` -> `dataset_path` in vak/config - Rename `csv_path` -> `dataset_path` in vak/core - Rename `csv_path` -> `dataset_path` in vak/cli - Rename `csv_path` -> `dataset_path` in tests

NickleDave added 30 commits March 3, 2023 06:26

DEV: Add lightning as dependency in pyproject.toml

237f43d

Add TweetyNet as a LightningModule subclass

27ea7df

Remove engine sub-package

26a173c

Add TweetyNet as vak.models entry point

29592a5

Rewrite TeenyTweetyNet as LightningModule also

c721a57

Add labelmap parameter to models.from_config_model_map function

0b60ce5

Remove tweetynet from test dependencies in pyproject.toml

9244f95

Fix how vak/config/models loads model config sections

0ed420b

Add src/vak/trainer.py with get_trainer function

3ad04f9

Rewrite core/train to use vak.trainer.get_trainer

c9c1ec7

Import net and model classes in vak/models/__init__.py

f6d9b96

Rewrite core/eval.py to use lightning trainer

ea45ffc

Rewrite core/predict.py to use lightning

f531055

TST: Fix test asserts that check for log files

748a776

DEV: Fix session in noxfile.py: test_data_download_generated_all

60b026f

Fix tests/scripts/fix_prep_csv_paths.py

9d002d1

DEV: Update generated test data URLs in noxfile.py

5a7fde8

DOC: Update CHANGELOG after merging #598 [skip ci]

580a6ce

Remove forward method from TweetyNet LightningModule

234ae97

Add vak/nets/

4b19ed7

Add src/vak/models/base.py

8a8613a

Add src/vak/models/definition.py

8d7f47a

Add src/vak/models/decorator.py

c6cd9c0

Add vak/models/windowed_frame_classification_model.py

753daa5

Rewrite models using model decorator

2a29cd6

Fix entry points in pyproject.toml

5191e2a

Add/fix/remove imports in models/__init__.py

3d5a19a

Depend on pytorch_lightning, not lightning

1241038

Use model from model_config_map in core/eval.py

6855ba6

Use model from models_map in core/predict.py

2f29f9b

NickleDave added 28 commits March 3, 2023 06:26

Add tests/test_models/test_definition.py

32a897a

WIP: Add doc/reference/models.md

c6ae12c

Add tests/test_models/test_windowed_frame_classification_model.py

11db71e

CI: Test if using 'ubuntu-20.04' fixes inscrutable errors

279b906

Update CHANGELOG after merging #605 [skip ci]

ae66038

BUG: Fix WindowedFrameClassificationModel attribute

bf06658

Change `lbl_tb2labels` to an instance of `transforms.labeled_timebins.ToLabels`. Fix needed after rebasing version 1.0 on #621

CLN: Remove unused import from src/vak/models/teenytweetynet.py

66436e9

CI: Run nox session as verbose in ci-linux.yml

5317545

to hopefully get full log of failures -- not seeing them now

DOC: Update CHANGELOG after merging #621 [skip ci]

f735c9c

DEV: Fix filenames of generated test data tars to be '1.x'

8a4b1b4

DEV/CI: Update GENERATED_TEST_DATA_CI_URL in noxfile.py

5b46b24

TST: Fix script tests/scripts/fix_prep_csv_paths.py

4747735

DOC: Update CHANGELOG after merging #625 [skip ci]

9139c27

DOC: Update CHANGELOG after merging #626 [skip ci]

b4f21d9

DEV: Specify python=3.10 in two sessions in noxfile.py

5950aa5

CLN: Remove unused import from models/decorator.py [skip ci]

b12966f

CLN: Remove vak/engine/model.py, no longer used

48ce9be

DOC: Update CHANGELOG after merging #627 [skip ci]

8e36d04

ENH: Log training time in core.train, fix #2

d74bb1f

DOC: Update CHANGELOG after merging #628 [skip ci]

9ea478d

DOC: Update CHANGELOG after merging #632 [skip ci]

a913a60

NickleDave merged commit 877e55b into main Mar 6, 2023

NickleDave deleted the version-1.0 branch March 8, 2023 15:49

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Version 1.0 #639

Version 1.0 #639

NickleDave commented Mar 6, 2023

Version 1.0 #639

Version 1.0 #639

Conversation

NickleDave commented Mar 6, 2023