PyTorch inline constants in dispatch to avoid graph breaks #1118

ricardoV94 · 2024-12-12T10:08:26Z

When we have static inputs, inlining helps torch not breaking the graph.

Related to #1110

📚 Documentation preview 📚: https://pytensor--1118.org.readthedocs.build/en/1118/

ricardoV94 · 2024-12-12T10:12:11Z

Still need to do something about the runtime broadcast in elemwise. Can we use torch._check for that instead of Python loops/asserts?

codecov · 2024-12-12T10:32:01Z

Codecov Report

Attention: Patch coverage is 68.88889% with 14 lines in your changes missing coverage. Please review.

Project coverage is 82.27%. Comparing base (4ea4259) to head (75eef40).
Report is 18 commits behind head on main.

Files with missing lines	Patch %	Lines
pytensor/link/pytorch/dispatch/basic.py	52.17%	9 Missing and 2 partials ⚠️
pytensor/link/pytorch/dispatch/scalar.py	66.66%	1 Missing ⚠️
pytensor/link/pytorch/dispatch/shape.py	90.00%	1 Missing ⚠️
pytensor/link/pytorch/dispatch/subtensor.py	87.50%	1 Missing ⚠️

❌ Your patch status has failed because the patch coverage (68.88%) is below the target coverage (100.00%). You can increase the patch coverage or adjust the target coverage.

Additional details and impacted files

@@           Coverage Diff           @@
##             main    #1118   +/-   ##
=======================================
  Coverage   82.27%   82.27%           
=======================================
  Files         186      186           
  Lines       48000    48066   +66     
  Branches     8621     8633   +12     
=======================================
+ Hits        39490    39546   +56     
- Misses       6353     6360    +7     
- Partials     2157     2160    +3

Files with missing lines	Coverage Δ
pytensor/link/pytorch/linker.py	`100.00% <100.00%> (ø)`
pytensor/link/pytorch/dispatch/scalar.py	`73.68% <66.66%> (-0.39%)`	⬇️
pytensor/link/pytorch/dispatch/shape.py	`85.71% <90.00%> (ø)`
pytensor/link/pytorch/dispatch/subtensor.py	`89.53% <87.50%> (-0.21%)`	⬇️
pytensor/link/pytorch/dispatch/basic.py	`87.40% <52.17%> (-7.10%)`	⬇️

... and 13 files with indirect coverage changes

ricardoV94 · 2024-12-12T10:47:47Z

Even without the runtime broadcast check, elemwise seems to break the graph

Ch0ronomato · 2024-12-14T16:19:01Z

Did you get a chance to profile this pr?

Ch0ronomato · 2024-12-27T15:39:57Z

Btw I did profile this. My machine actually failed to even compile dlogp for a model but I suspect that's unrelated. The logp method did show some improvement. The thing that intrigued me is this change reduced the number of guards by a lot (it was 10:1 with the other ones). I thought that maybe that was the cause of the runtime switch, but that didn't have the payoff I was expecting

ricardoV94 · 2024-12-27T18:33:28Z

The cost of the guards may be non-linear so we should try to remove all

Ch0ronomato · 2024-12-27T18:41:49Z

The cost of the guards may be non-linear so we should try to remove all

Idk about removing all, since guards are the primitive that ensures runtime correctness. Significantly reduce, i agree

Ch0ronomato · 2024-12-27T19:29:54Z

Btw, for the actual perf benefit, these are the numbers i see.

# ricardo shape: 772 μs ± 12 μs per loop (mean ± std. dev. of 100 runs, 100 loops each)
# no ricardo shape: 818 μs ± 9.48 μs per loop (mean ± std. dev. of 100 runs, 100 loops each)

So it's like ~5%, probably more on slower cpus. The graph breaks are definitely a problem :(

Ch0ronomato · 2024-12-27T19:41:21Z

If we add these two flags with the changes in this PR:

torch._dynamo.config.capture_func_transforms=True
torch._dynamo.config.capture_scalar_outputs = True

we come down to almost 500us.

504 μs ± 12.2 μs per loop (mean ± std. dev. of 100 runs, 100 loops each)

ricardoV94 · 2025-01-16T09:05:03Z

@Ch0ronomato can you revert the removal of the Elemwise bcast check (for now), and add those flags? Then we can merge this PR and keep playing with stuff

Ch0ronomato · 2025-01-17T23:26:16Z

The ci doesn't like those flags. I'll investigate

Ch0ronomato · 2025-01-25T16:43:17Z

I think the path to fix this is not use those flags by default, but when we have a shape operation. The torch compiler might be really restrictive

Ch0ronomato · 2025-01-26T17:48:22Z

pytensor/link/pytorch/linker.py

@@ -37,6 +37,9 @@ def conversion_func_register(*args, **kwargs):
    def jit_compile(self, fn):
        import torch

+        # flag that tend to help our graphs
+        torch._dynamo.config.capture_dynamic_output_shape_ops = True


Hopefully when #1159 gets merged we can just delete this flag altogether since torch will know these aren't dynamic

tests/link/pytorch/test_basic.py

ricardoV94 added performance torch PyTorch backend labels Dec 12, 2024

ricardoV94 force-pushed the torch_constant_dispatch branch from c08d288 to 566145a Compare December 12, 2024 10:48

Ch0ronomato force-pushed the torch_constant_dispatch branch from c5f26fd to dbc95e4 Compare January 26, 2025 17:46

Ch0ronomato reviewed Jan 26, 2025

View reviewed changes

Ch0ronomato marked this pull request as ready for review January 26, 2025 20:20

ricardoV94 commented Jan 28, 2025

View reviewed changes

tests/link/pytorch/test_basic.py Outdated Show resolved Hide resolved

ricardoV94 added 2 commits February 9, 2025 16:38

Split and inverse

52c0e89

PyTorch inline constants in dispatch to avoid graph breaks

75eef40

Ch0ronomato force-pushed the torch_constant_dispatch branch from eb3ff29 to 75eef40 Compare February 10, 2025 00:39

Ch0ronomato merged commit 4fa9bb8 into pymc-devs:main Feb 10, 2025
62 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

PyTorch inline constants in dispatch to avoid graph breaks #1118

PyTorch inline constants in dispatch to avoid graph breaks #1118

ricardoV94 commented Dec 12, 2024 •

edited by github-actions bot

Loading

ricardoV94 commented Dec 12, 2024

codecov bot commented Dec 12, 2024 •

edited

Loading

ricardoV94 commented Dec 12, 2024

Ch0ronomato commented Dec 14, 2024

Ch0ronomato commented Dec 27, 2024

ricardoV94 commented Dec 27, 2024

Ch0ronomato commented Dec 27, 2024

Ch0ronomato commented Dec 27, 2024

Ch0ronomato commented Dec 27, 2024 •

edited

Loading

ricardoV94 commented Jan 16, 2025

Ch0ronomato commented Jan 17, 2025

Ch0ronomato commented Jan 25, 2025

Ch0ronomato Jan 26, 2025

PyTorch inline constants in dispatch to avoid graph breaks #1118

PyTorch inline constants in dispatch to avoid graph breaks #1118

Conversation

ricardoV94 commented Dec 12, 2024 • edited by github-actions bot Loading

ricardoV94 commented Dec 12, 2024

codecov bot commented Dec 12, 2024 • edited Loading

Codecov Report

ricardoV94 commented Dec 12, 2024

Ch0ronomato commented Dec 14, 2024

Ch0ronomato commented Dec 27, 2024

ricardoV94 commented Dec 27, 2024

Ch0ronomato commented Dec 27, 2024

Ch0ronomato commented Dec 27, 2024

Ch0ronomato commented Dec 27, 2024 • edited Loading

ricardoV94 commented Jan 16, 2025

Ch0ronomato commented Jan 17, 2025

Ch0ronomato commented Jan 25, 2025

Ch0ronomato Jan 26, 2025

Choose a reason for hiding this comment

ricardoV94 commented Dec 12, 2024 •

edited by github-actions bot

Loading

codecov bot commented Dec 12, 2024 •

edited

Loading

Ch0ronomato commented Dec 27, 2024 •

edited

Loading