Fix warning from torch.load starting in torch 2.4 #1064

BenjaminBossan · 2024-08-22T14:37:11Z

See discussion in #1063

Starting from PyTorch 2.4, there is a warning when torch.load is called without setting the weights_only argument. This is because in the future, the default will switch from False to True, which can result in a lot of errors when trying to load torch files (which are pickle files and thus insecure).

In this PR, we add a possibility for the user to influence the kwargs passed to torch.load so that they can control that behavior. If not further indicated by the user, we will use the same defaults as the installed torch version. Therefore, users will only encounter this issue via skorch if they would have encountered it via torch anyway.

Since it's not 100% certain if the default will switch in torch 2.6.0, we may have to adjust the version check in the future.

Besides directly testing the kwargs being passed on, a test was also added that net.load_params does not give any warnings. This is already indirectly tested through some accelerate tests that are currently failing with torch 2.4, but it's better to have an explicit test.

After this is merged, the CI should pass when using torch 2.4.0.

See discussion in #1063 Starting from PyTorch 2.4, there is a warning when torch.load is called without setting the weights_only argument. This is because in the future, the default will switch from False to True, which can result in a lot of errors when trying to load torch files (which are pickle files and thus insecure). In this PR, we add a possibility for the user to influence the kwargs passed to torch.load so that they can control that behavior. If not further indicated by the user, we will use the same defaults as the installed torch version. Therefore, users will only encounter this issue via skorch if they would have encountered it via torch anyway. Since it's not 100% certain if the default will switch in torch 2.6.0, we may have to adjust the version check in the future. Besides directly testing the kwargs being passed on, a test was also added that net.load_params does not give any warnings. This is already indirectly tested through some accelerate tests that are currently failing with torch 2.4, but it's better to have an explicit test. After this is merged, the CI should pass when using torch 2.4.0.

thomasjpfan · 2024-08-23T14:32:36Z

skorch/utils.py

@@ -768,3 +769,16 @@ def _check_f_arguments(caller_name, **kwargs):
            key = 'module_' if key == 'f_params' else key[2:] + '_'
            kwargs_module[key] = val
    return kwargs_module, kwargs_other
+
+
+def check_torch_weights_only_default_true():


Given how specific this function is to torch.load, can this return torch_load_kwargs itself?

Good point, I made the suggested change.

ottonemo · 2024-08-29T15:22:31Z

skorch/utils.py

+
+
+def get_torch_load_kwargs():
+    """Returns the kwargs passed to torch.load the correspond to the current


Suggested change

"""Returns the kwargs passed to torch.load the correspond to the current

"""Returns the kwargs passed to torch.load that correspond to the current

ottonemo · 2024-08-29T15:25:41Z

skorch/utils.py

@@ -768,3 +769,18 @@ def _check_f_arguments(caller_name, **kwargs):
            key = 'module_' if key == 'f_params' else key[2:] + '_'
            kwargs_module[key] = val
    return kwargs_module, kwargs_other
+
+
+def get_torch_load_kwargs():


Suggested change

def get_torch_load_kwargs():

def get_default_torch_load_kwargs():

ottonemo · 2024-08-29T15:26:02Z

skorch/net.py

@@ -2620,10 +2650,14 @@ def _get_state_dict(f_name):

                return state_dict
        else:
+            torch_load_kwargs = self.torch_load_kwargs
+            if torch_load_kwargs is None:
+                torch_load_kwargs = get_torch_load_kwargs()


Suggested change

torch_load_kwargs = get_torch_load_kwargs()

torch_load_kwargs = get_default_torch_load_kwargs()

skorch/tests/test_net.py

Instead, rely on the installed torch version and skip if it doesn't fit.

BenjaminBossan · 2024-09-03T15:39:20Z

CI is failing for unrelated reasons since the latest accelerate release, I opened an issue about it:

huggingface/accelerate#3070

byi8220 · 2024-09-03T22:49:36Z

Quick question about the (unrelated) failing CI, are the CI and integration tests run on multigpu environments at all?

BenjaminBossan · 2024-09-04T09:53:14Z

Quick question about the (unrelated) failing CI, are the CI and integration tests run on multigpu environments at all?

No, we're only using the free runners from GitHub on this repo. Is there anything that we should check specifically on GPU?

byi8220 · 2024-09-04T13:04:44Z

Is there anything that we should check specifically on GPU?

Not sure. I think the only way GPU training would affect pickling is on distributed setups. I'm actually not sure how reliable pickling a running distributed accelerator is (e.g. there are a LOT of stackoverflow or forum posts about running into issues with pickling generators or in a multiprocessing context)

BenjaminBossan · 2024-09-04T13:23:55Z

If such a setting causes trouble, it's probably not just because of accelerator, so I think we can disregard that for now.

BenjaminBossan · 2024-09-09T12:09:06Z

@ottonemo have your points been addressed?

BenjaminBossan requested review from thomasjpfan and ottonemo August 22, 2024 15:11

thomasjpfan reviewed Aug 23, 2024

View reviewed changes

Reviewer feedback: return kwargs directly

9acfb84

thomasjpfan approved these changes Aug 26, 2024

View reviewed changes

ottonemo reviewed Aug 29, 2024

View reviewed changes

BenjaminBossan added 2 commits September 2, 2024 15:42

Reviewer feedback: One more test w/o monkeypatch

ab9c536

Instead, rely on the installed torch version and skip if it doesn't fit.

Reviewer feedback: rename function, fix typo

f4162ac

Merge branch 'master' into fix-torch-load-warning-weights-only

8b66f98

ottonemo merged commit e724424 into master Sep 19, 2024
15 checks passed

BenjaminBossan deleted the fix-torch-load-warning-weights-only branch September 19, 2024 13:52

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fix warning from torch.load starting in torch 2.4 #1064

Fix warning from torch.load starting in torch 2.4 #1064

BenjaminBossan commented Aug 22, 2024

thomasjpfan Aug 23, 2024

BenjaminBossan Aug 23, 2024

ottonemo Aug 29, 2024

ottonemo Aug 29, 2024

ottonemo Aug 29, 2024

BenjaminBossan commented Sep 3, 2024

byi8220 commented Sep 3, 2024

BenjaminBossan commented Sep 4, 2024

byi8220 commented Sep 4, 2024

BenjaminBossan commented Sep 4, 2024

BenjaminBossan commented Sep 9, 2024



		def get_torch_load_kwargs():
		"""Returns the kwargs passed to torch.load the correspond to the current

	def get_torch_load_kwargs():
	def get_default_torch_load_kwargs():

	torch_load_kwargs = get_torch_load_kwargs()
	torch_load_kwargs = get_default_torch_load_kwargs()

Fix warning from torch.load starting in torch 2.4 #1064

Fix warning from torch.load starting in torch 2.4 #1064

Conversation

BenjaminBossan commented Aug 22, 2024

thomasjpfan Aug 23, 2024

Choose a reason for hiding this comment

BenjaminBossan Aug 23, 2024

Choose a reason for hiding this comment

ottonemo Aug 29, 2024

Choose a reason for hiding this comment

ottonemo Aug 29, 2024

Choose a reason for hiding this comment

ottonemo Aug 29, 2024

Choose a reason for hiding this comment

BenjaminBossan commented Sep 3, 2024

byi8220 commented Sep 3, 2024

BenjaminBossan commented Sep 4, 2024

byi8220 commented Sep 4, 2024

BenjaminBossan commented Sep 4, 2024

BenjaminBossan commented Sep 9, 2024