Make generated functions safe for extension #426

willtebbutt · 2024-12-16T21:00:45Z

As discussed in #422 , my use of generated functions throughout Mooncake is somewhat unsafe, in the sense that they often use functions which I expect will have methods added to them as part of codegen (see #422 (comment) for further discussion). I discovered that this is a problem while trying to write the extensions necessary to handle GPUArrays properly. Since this is quite a pervasive issue, I need to resolve it asap in order to finish up our initial GPU support work.

It is helpful to have this working PR open in order to regularly run CI to check that nothing has broken as I work through the various fixes which are needed.

todo:

fix up remaining problematic generated functions
add new benchmark case for highly nested tuple -- turned into separate issue
resolve all perf problems

codecov · 2024-12-16T21:01:54Z

Codecov Report

Attention: Patch coverage is 89.57055% with 17 lines in your changes missing coverage. Please review.

Files with missing lines	Patch %	Lines
src/tangents.jl	84.31%	8 Missing ⚠️
src/fwds_rvs_data.jl	97.22%	2 Missing ⚠️
src/interpreter/s2s_reverse_mode_ad.jl	60.00%	2 Missing ⚠️
src/rrules/memory.jl	0.00%	2 Missing ⚠️
ext/MooncakeCUDAExt.jl	0.00%	1 Missing ⚠️
src/rrules/iddict.jl	0.00%	1 Missing ⚠️
src/rrules/twice_precision.jl	0.00%	1 Missing ⚠️

Files with missing lines	Coverage Δ
src/interpreter/abstract_interpretation.jl	`86.44% <100.00%> (+3.10%)`	⬆️
src/interpreter/ir_utils.jl	`87.50% <100.00%> (ø)`
src/rrules/fastmath.jl	`100.00% <ø> (ø)`
src/test_utils.jl	`93.10% <100.00%> (+0.08%)`	⬆️
src/utils.jl	`87.09% <100.00%> (+0.88%)`	⬆️
ext/MooncakeCUDAExt.jl	`88.00% <0.00%> (-8.00%)`	⬇️
src/rrules/iddict.jl	`4.08% <0.00%> (-93.88%)`	⬇️
src/rrules/twice_precision.jl	`96.92% <0.00%> (-0.77%)`	⬇️
src/fwds_rvs_data.jl	`96.66% <97.22%> (+0.13%)`	⬆️
src/interpreter/s2s_reverse_mode_ad.jl	`94.28% <60.00%> (-0.78%)`	⬇️
... and 2 more

... and 7 files with indirect coverage changes

github-actions · 2024-12-16T21:21:10Z

Performance Ratio:
Ratio of time to compute gradient and time to compute function.
Warning: results are very approximate! See here for more context.

┌────────────────────────────┬──────────┬─────────┬─────────────┬─────────┐
│                      Label │ Mooncake │  Zygote │ ReverseDiff │  Enzyme │
│                     String │   String │  String │      String │  String │
├────────────────────────────┼──────────┼─────────┼─────────────┼─────────┤
│                   sum_1000 │     86.1 │     1.0 │        5.61 │    8.21 │
│                  _sum_1000 │     6.65 │  1450.0 │        33.5 │    1.09 │
│               sum_sin_1000 │     2.28 │    1.71 │        11.0 │     2.0 │
│              _sum_sin_1000 │     2.55 │   250.0 │        13.1 │    2.33 │
│                   kron_sum │     63.8 │    3.58 │       205.0 │    9.67 │
│              kron_view_sum │     21.4 │    3.36 │        82.9 │    36.1 │
│      naive_map_sin_cos_exp │      2.5 │ missing │        7.47 │    2.32 │
│            map_sin_cos_exp │     2.81 │    1.53 │         6.1 │    2.89 │
│      broadcast_sin_cos_exp │     2.56 │    2.25 │        1.47 │    2.26 │
│                 simple_mlp │     7.91 │    3.19 │        12.0 │    3.72 │
│                     gp_lml │     4.63 │    3.62 │     missing │    2.16 │
│ turing_broadcast_benchmark │     3.17 │ missing │        25.6 │ missing │
│         large_single_block │     4.42 │  3990.0 │        29.7 │    2.18 │
└────────────────────────────┴──────────┴─────────┴─────────────┴─────────┘

willtebbutt · 2024-12-17T09:59:30Z

Note: current performance-related test failures are not replicated when running without the flags used on the runners (I'm not seeing the performance issues locally). I'll need to figure out how to fix this...

Not true: I'm now seeing them locally.

yebai · 2024-12-17T11:23:07Z

@willtebbutt, can we add a new benchmark test case based on

Mooncake.jl/src/tangents.jl

Lines 1057 to 1059 in 8e7ee73

    
           # Regression tests to catch type inference failures, see https://github.com/compintell/Mooncake.jl/pull/422 
        
           (((((randn(33)...,),),),),), 
        
           (((((((((randn(33)...,),),),),), randn(5)...),),),),

? I understand that the regression tests should be able to catch type inference failures, but an extra benchmark case would help us to track the performance variations across PRs, which I am curious to see.

This reverts commit 824acd0.

This reverts commit 2997c78.

This reverts commit 9a427e7.

This reverts commit 931e27f.

This reverts commit 96ff8b9.

…m/compintell/Mooncake.jl into wct/more-safe-generated-functions

willtebbutt added 2 commits December 16, 2024 20:55

Tidy up comments in code

613643e

Remove redundant method of backing_type

7646f38

willtebbutt marked this pull request as draft December 16, 2024 21:00

Formatting

829f0af

willtebbutt added 4 commits December 16, 2024 21:21

Make tangent type not generated and tidy up a bit

e396f8b

Typo

b8bbecd

stable_ntuple

07158f8

Remove more generated functions

76e049b

willtebbutt added 6 commits December 17, 2024 10:00

make safe __make_ref generated function

cc03e15

Formatting

7a02be9

temporarily revert change

6c705d2

Add extra perf tests to fwds_rvs data

28bd919

Fix perf problem

918df57

Fix battery tests

eecc483

willtebbutt added 12 commits December 17, 2024 11:28

Fix up perf

a809bd3

Start making fdata and rdata functions safe

727b90b

Remove redundant generated

337586a

Make tuple fdata rdata safe

44826c5

Fix all on 1.10

9fecea7

Formatting

f22068b

Fix performance

824acd0

Stop using generated function

9a427e7

Revert "Fix performance"

2997c78

This reverts commit 824acd0.

Revert "Revert "Fix performance""

4067f52

This reverts commit 2997c78.

Revert "Stop using generated function"

44da086

This reverts commit 9a427e7.

Unrevert change to generated function

46666ab

willtebbutt added 13 commits December 21, 2024 00:26

Generated function to enforce specialisation

529c42d

Try disabling debug mode on bulk of tests

0247389

Remove entirely redundant stable_ntuple function

5badf8c

Test overall performance with uninferred tangent type

931e27f

Revert "Test overall performance with uninferred tangent type"

5541ae6

This reverts commit 931e27f.

Remove redundant comment

f39465c

Remove compiler-level assertions

ad29312

Fix typo

69fed64

Typo

ddbbb66

Just assume effects

96ff8b9

Revert "Just assume effects"

cac2536

This reverts commit 96ff8b9.

Assume some effects

b738dad

Formatting

546b193

willtebbutt mentioned this pull request Dec 23, 2024

New Benchmark Case #432

Open

willtebbutt added 4 commits December 23, 2024 15:56

Bump patch version

25cf777

Remove redundant function

c1be9a3

Tidy up a bit

f9aff07

Tidy up further

b523f35

willtebbutt marked this pull request as ready for review December 23, 2024 16:31

willtebbutt and others added 9 commits December 23, 2024 19:00

Remove unused functionality

c60d5b2

Tidy tidy tidy

54c82ae

Enforce effects in unit tests

6f07a68

Formatting and effects macro

9c97dea

Effects for CuArray

28db2a9

More effects

e20d9e5

Assume more effects

c72cb1c

Tidy up more

5ef2463

Merge branch 'wct/more-safe-generated-functions' of https://github.co…

3ed7c15

…m/compintell/Mooncake.jl into wct/more-safe-generated-functions

willtebbutt merged commit 658d566 into main Dec 24, 2024
71 of 72 checks passed

willtebbutt deleted the wct/more-safe-generated-functions branch December 24, 2024 12:16

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Make generated functions safe for extension #426

Make generated functions safe for extension #426

willtebbutt commented Dec 16, 2024 •

edited

Loading

codecov bot commented Dec 16, 2024 •

edited

Loading

github-actions bot commented Dec 16, 2024 •

edited

Loading

willtebbutt commented Dec 17, 2024 •

edited

Loading

yebai commented Dec 17, 2024

Make generated functions safe for extension #426

Make generated functions safe for extension #426

Conversation

willtebbutt commented Dec 16, 2024 • edited Loading

codecov bot commented Dec 16, 2024 • edited Loading

Codecov Report

github-actions bot commented Dec 16, 2024 • edited Loading

willtebbutt commented Dec 17, 2024 • edited Loading

yebai commented Dec 17, 2024

willtebbutt commented Dec 16, 2024 •

edited

Loading

codecov bot commented Dec 16, 2024 •

edited

Loading

github-actions bot commented Dec 16, 2024 •

edited

Loading

willtebbutt commented Dec 17, 2024 •

edited

Loading