[fx_importer] Convert non-persistent buffers lifted as tensor constants #2902

sjain-stanford · 2024-02-13T13:53:40Z

The investigation is largely recorded in #2881, but this change allows us to capture non-persistent buffers that were lifted as tensor constants (after pytorch/pytorch#118969 landed in upstream PyTorch), and propagate them to Torch dialect as "frozen" torch.vtensor.literal. I believe this patch should work with both nightly and stable PyTorch, but will let CI confirm the same. Thanks @stellaraccident for the valuable pointers and guidance.

Set PyTorch and TorchVision version to nightly release 2024-02-07. Signed-Off By: Vivek Khandelwal <vivekkhandelwal1424@gmail.com>

stellaraccident

If the CI passes, this lgtm and can land. Once it does, let's close the original PR.

stellaraccident · 2024-02-13T19:58:34Z

You'll need to hasattr(..., "constants") to guard the new clause: " AttributeError: 'ExportedProgram' object has no attribute 'constants'"

If doing that, I would probably make it fully conditional with a note about where the change happened: If there is no "constants" attribute, consult the "state_dict". Otherwise, only look at "constants". Future us will thank us because the freezing of states will be something we want to undo eventually (just like the upstream patch had to fix this).

sjain-stanford · 2024-02-13T20:12:35Z

If doing that, I would probably make it fully conditional with a note about where the change happened: If there is no "constants" attribute, consult the "state_dict". Otherwise, only look at "constants". Future us will thank us because the freezing of states will be something we want to undo eventually (just like the upstream patch had to fix this).

Done. Just a clarification that we will need to continue using state_dict for parameters (irrespective of how buffers are handled). Is that correct?

stellaraccident · 2024-02-13T20:38:29Z

If doing that, I would probably make it fully conditional with a note about where the change happened: If there is no "constants" attribute, consult the "state_dict". Otherwise, only look at "constants". Future us will thank us because the freezing of states will be something we want to undo eventually (just like the upstream patch had to fix this).

Done. Just a clarification that we will need to continue using state_dict for parameters (irrespective of how buffers are handled). Is that correct?

I think that how you have this now gets this in the safe state. If we were unconditionally looking at state_dict, then we would likely be lifting things that future-us actually wants to be treating as mutable, model-level things. Many systems are just figuring out what to do about that, but better to have this do the right thing at the front door for now.

vivekkhandelwal1 and others added 2 commits February 8, 2024 15:03

build: manually update PyTorch version

3556065

Set PyTorch and TorchVision version to nightly release 2024-02-07. Signed-Off By: Vivek Khandelwal <vivekkhandelwal1424@gmail.com>

bring basic_test.py up to current HEAD

a453907

sjain-stanford requested review from vivekkhandelwal1 and stellaraccident February 13, 2024 13:53

sjain-stanford closed this Feb 13, 2024

fixes to adapt to pytorch/pytorch#118969

f9d63a7

sjain-stanford changed the title ~~Bump pytorch nightly~~ [fx_importer] Fixes to convert non-persistent buffers lifted as tensor constants Feb 13, 2024

sjain-stanford reopened this Feb 13, 2024

sjain-stanford mentioned this pull request Feb 13, 2024

build: manually update PyTorch version #2881

Closed

stellaraccident approved these changes Feb 13, 2024

View reviewed changes

sjain-stanford changed the title ~~[fx_importer] Fixes to convert non-persistent buffers lifted as tensor constants~~ [fx_importer] Convert non-persistent buffers lifted as tensor constants Feb 13, 2024

make the constants vs state_dict for buffers conditional

06a893e

stellaraccident merged commit 3e836d8 into llvm:main Feb 13, 2024
3 checks passed

sjain-stanford deleted the roll-pytorch branch February 13, 2024 20:43

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[fx_importer] Convert non-persistent buffers lifted as tensor constants #2902

[fx_importer] Convert non-persistent buffers lifted as tensor constants #2902

sjain-stanford commented Feb 13, 2024 •

edited

Loading

stellaraccident left a comment

stellaraccident commented Feb 13, 2024 •

edited

Loading

sjain-stanford commented Feb 13, 2024 •

edited

Loading

stellaraccident commented Feb 13, 2024

[fx_importer] Convert non-persistent buffers lifted as tensor constants #2902

[fx_importer] Convert non-persistent buffers lifted as tensor constants #2902

Conversation

sjain-stanford commented Feb 13, 2024 • edited Loading

stellaraccident left a comment

Choose a reason for hiding this comment

stellaraccident commented Feb 13, 2024 • edited Loading

sjain-stanford commented Feb 13, 2024 • edited Loading

stellaraccident commented Feb 13, 2024

sjain-stanford commented Feb 13, 2024 •

edited

Loading

stellaraccident commented Feb 13, 2024 •

edited

Loading

sjain-stanford commented Feb 13, 2024 •

edited

Loading