Autodiff Upstreaming - rustc_codegen_ssa, rustc_middle #133429

ZuseZ4 · 2024-11-25T02:19:09Z

This PR should not be merged until the rustc_codegen_llvm part is merged.
I will also alter it a little based on what get's shaved off from the cg_llvm PR,
and address some of the feedback I received in the other PR (including cleanups).

I am putting it already up to

Discuss with @jieyouxu if there is more work needed to add tests to this and
Pray that there is someone reviewing who can tell me why some of my autodiff invocations get lost.

Re 1: My test require fat-lto. I also modify the compilation pipeline. So if there are any other llvm-ir tests in the same compilation unit then I will likely break them. Luckily there are two groups who currently have the same fat-lto requirement for their GPU code which I have for my autodiff code and both groups have some plans to enable support for thin-lto. Once either that work pans out, I'll copy it over for this feature. I will also work on not changing the optimization pipeline for functions not differentiated, but that will require some thoughts and engineering, so I think it would be good to be able to run the autodiff tests isolated from the rest for now. Can you guide me here please?
For context, here are some of my tests in the samples folder: https://github.com/EnzymeAD/rustbook

Re 2: This is a pretty serious issue, since it effectively prevents publishing libraries making use of autodiff: EnzymeAD#173. For some reason my dummy code persists till the end, so the code which calls autodiff, deletes the dummy, and inserts the code to compute the derivative never gets executed. To me it looks like the rustc_autodiff attribute just get's dropped, but I don't know WHY? Any help would be super appreciated, as rustc queries look a bit voodoo to me.

Tracking:

Tracking Issue for autodiff #124509

r? @jieyouxu

ZuseZ4 · 2024-11-25T03:30:47Z

To expand on 2)
Assume you have the following code

#[autodiff(bar, Reverse, ...)]
fn foo(x: f32) -> f32 { x*x }

it will expand to

#[rustc_autodiff]
fn foo(x: f32) -> f32 {x*x}
#[rustc_autodiff(Reverse,...)]
fn bar(x: f32, scalar_factor: f32) -> (f32, f32) {
   // some_dummy_code()
}

Now I have some logic in this PR which picks up the rustc_autodiff attributes and passes them onto the backend, where for every single rustc_autodiff attribute with arguments we pick the function (thus bar here) and replace the dummy code with the right code to return the derivative. So bar would afterwards return (x*x, 2.0 * x). But as mentioned above, once you use autodiff in a library and call it in another module, the dummy code get's executed, as shown in the linked PR.
Any hints would be appreciated.

jieyouxu

I have some interim feedback for this draft PR

compiler/rustc_codegen_ssa/src/back/write.rs

compiler/rustc_codegen_ssa/src/assert_module_sources.rs

compiler/rustc_codegen_ssa/src/codegen_attrs.rs

compiler/rustc_monomorphize/src/partitioning.rs

compiler/rustc_session/src/options.rs

jieyouxu · 2024-11-25T11:58:00Z

compiler/rustc_session/src/options.rs

@@ -996,6 +997,35 @@ mod parse {
        }
    }

+    pub(crate) fn parse_autodiff(slot: &mut Vec<AutoDiff>, v: Option<&str>) -> bool {


Remark: this has no error messages if the autodiff options failed to parse, acceptable for unstable flag but still poor UX, unacceptable when it comes to stabilization time.

It should be handled with a proper description in parse_autodiff above.

I assume there is some rustc function to generate suggestions when users make a typo?
I'll update it for now to print the unrecognized value.

I looked again at all the other options which accept more values (e.g. instrument_xray), and they have no error-handling either. Is there a preferred way to print errors here?

We can do this later imo.

compiler/rustc_session/src/config.rs

compiler/rustc_session/src/options.rs

jieyouxu · 2024-11-25T12:12:03Z

My test require fat-lto.

For codegen/assembly/ui tests you can make it build w/ fat LTO via

//@ compile-flags: -Clto=fat

I also modify the compilation pipeline. So if there are any other llvm-ir tests in the same compilation unit then I will likely break them.

I don't quite understand the implication of this. Is there some small example I can refer to?

I will also work on not changing the optimization pipeline for functions not differentiated, but that will require some thoughts and engineering, so I think it would be good to be able to run the autodiff tests isolated from the rest for now. Can you guide me here please?

Can you elaborate on what test conditions you need? What do you mean exactly by "isolated"? Can you not run autodiff but only if there is autodiff support, or do you mean like don't run by default even if there is autodiff support?

ZuseZ4 · 2024-11-25T12:51:17Z

https://github.com/rust-lang/rust/pull/130060/files#diff-a56b374664e290a55d70fa80e456b6280913830b382b73fb70c4483d3d4cf246
adjust's the llvm opt pipeline (if autodiff is enabled at build time and used).
We have a first opt run which skips the late llvm opts (which tend to increase code size), and runs opt a second time (now with the full pipeline) once autodiff is done. I will manage to not make it optimize unrelated code in the future, but for now it means other functions are now optimized 1.5 times by llvm. And in reality llvm opts don't really run to a fixpoint, so that is highly likely to change the IR.

jieyouxu · 2024-11-27T23:54:13Z

functions are now optimized 1.5 times by llvm. And in reality llvm opts don't really run to a fixpoint, so that is highly likely to change the IR.

This just means that for now, you'll have to gate autodiff-related tests with //@ needs-autodiff or somehow allow compiletest to determine that the llvm used is built with autodiff support and the test is exercising said autodiff support.

Note that this cannot break and should not modify code that does not use autodiff at all, which is indicated by codegen tests that do not use / opt-in to autodiff support.

rustbot · 2024-12-13T00:57:29Z

This PR modifies config.example.toml.

If appropriate, please update CONFIG_CHANGE_HISTORY in src/bootstrap/src/utils/change_tracker.rs.

Some changes occurred in coverage instrumentation.

cc @Zalathar

Some changes occurred in cfg and check-cfg configuration

cc @Urgau

These commits modify the Cargo.lock file. Unintentional changes to Cargo.lock can be introduced when switching branches and rebasing PRs.

If this was unintentional then you should revert the changes before this PR is merged.
Otherwise, you can ignore this comment.

compiler/rustc_session/src/config/cfg.rs

ZuseZ4 · 2025-01-02T17:31:46Z

I rebased now that the other autodiff PR got merged, fixed all conflicts, and got it to compile locally.
I will work through the existing feedback over the next days.

bors · 2025-01-25T02:30:42Z

☔ The latest upstream changes (presumably #136030) made this pull request unmergeable. Please resolve the merge conflicts.

ZuseZ4 · 2025-01-25T21:00:05Z

@oli-obk Just to keep track, so far we have 3/4 things which should be fixed here. Potentially not all in this specific PR, but preferably before enabling it for default nightly builds.

(Most orthogonal) the macro getting lost when used in dependencies.
Performance: Have two opt runs, with AD being applied at the end of the first run. I need to look again at it, last time I wasn't 100% sure how to trigger these two runs after the refactoring.
Build setup: Don't force users to pass RUSTFLAGS="-Z llvm-plugins=/home/manuel/prog/rust-working/build/x86_64-unknown-linux-gnu/enzyme/build/Enzyme/LLVMEnzyme-19.so -C passes=enzyme. I'll probably ask on zulip/bootstrap for help with that.
(of course I'll also need to apply the other code quality recommendations above).

After thinking about it for a bit I'll probably just do 4) for now, and leave 1-3 for a follow-up PR, just for the sake of having a working version upstream. I'll need to talk to jieyouxu to see if we then add test's already here, or in the follow-up PR where I fix 3).

ZuseZ4 · 2025-01-27T01:19:06Z

I'll need to add the fn-ptr error message and left a few questions.
I decided to leave the bootstrapping change for an extra PR, so the tests will probably also need to wait for that (since we use a -Z llvm-plugin flag which requires the absolute path to Enzyme, something which we probably don't want to add as a temporary workaround to the test infra).

ZuseZ4 · 2025-01-30T02:33:12Z

Since it hasn't been added to a rollup yet, I quickly removed an unwanted print statement and especially adjusted a type check in my wrapper, which wasn't working correctly by checking the return type of a function, instead of a call. Wrappers are fun. But now even higher-order derivatives from my (not yet upstream) testset pass again!

ZuseZ4 · 2025-01-30T03:34:51Z

@bors r=@oli-obk

bors · 2025-01-30T03:34:54Z

📌 Commit 1f30517 has been approved by oli-obk

It is now in the queue for this repository.

Rollup of 9 pull requests Successful merges: - rust-lang#132156 (When encountering unexpected closure return type, point at return type/expression) - rust-lang#133429 (Autodiff Upstreaming - rustc_codegen_ssa, rustc_middle) - rust-lang#136281 (`rustc_hir_analysis` cleanups) - rust-lang#136297 (Fix a typo in profile-guided-optimization.md) - rust-lang#136300 (atomic: extend compare_and_swap migration docs) - rust-lang#136310 (normalize `*.long-type.txt` paths for compare-mode tests) - rust-lang#136312 (Disable `overflow_delimited_expr` in edition 2024) - rust-lang#136313 (Filter out RPITITs when suggesting unconstrained assoc type on too many generics) - rust-lang#136323 (Fix a typo in conventions.md) r? `@ghost` `@rustbot` modify labels: rollup

Rollup merge of rust-lang#133429 - EnzymeAD:autodiff-middle, r=oli-obk Autodiff Upstreaming - rustc_codegen_ssa, rustc_middle This PR should not be merged until the rustc_codegen_llvm part is merged. I will also alter it a little based on what get's shaved off from the cg_llvm PR, and address some of the feedback I received in the other PR (including cleanups). I am putting it already up to 1) Discuss with `@jieyouxu` if there is more work needed to add tests to this and 2) Pray that there is someone reviewing who can tell me why some of my autodiff invocations get lost. Re 1: My test require fat-lto. I also modify the compilation pipeline. So if there are any other llvm-ir tests in the same compilation unit then I will likely break them. Luckily there are two groups who currently have the same fat-lto requirement for their GPU code which I have for my autodiff code and both groups have some plans to enable support for thin-lto. Once either that work pans out, I'll copy it over for this feature. I will also work on not changing the optimization pipeline for functions not differentiated, but that will require some thoughts and engineering, so I think it would be good to be able to run the autodiff tests isolated from the rest for now. Can you guide me here please? For context, here are some of my tests in the samples folder: https://github.com/EnzymeAD/rustbook Re 2: This is a pretty serious issue, since it effectively prevents publishing libraries making use of autodiff: EnzymeAD#173. For some reason my dummy code persists till the end, so the code which calls autodiff, deletes the dummy, and inserts the code to compute the derivative never gets executed. To me it looks like the rustc_autodiff attribute just get's dropped, but I don't know WHY? Any help would be super appreciated, as rustc queries look a bit voodoo to me. Tracking: - rust-lang#124509 r? `@jieyouxu`

lqd · 2025-02-01T07:50:13Z

Hey @ZuseZ4, it looks like this PR had significant perf implications on a small number of benchmarks, as you can see in the post-merge perf results for the rollup where it landed. It could be related to LTO, from a quick look.

I don’t see a pre-merge perf run here, so I assume this is not expected. Could you take a look? Thanks!

Let’s also cc the reviewer @oli-obk.

oli-obk · 2025-02-01T09:39:13Z

Oh yea that was absolutely not expected. looking...

compiler/rustc_codegen_ssa/src/back/write.rs

compiler/rustc_monomorphize/src/partitioning.rs

Add link attribute for Enzyme's LLVMRust FFI Since rust-lang#133429 landed, the compiler doesn't build with `-Zcross-crate-inline-threshold=always`. I don't expect anyone else to test or fix issues with that goofy configuration, so I'm fixing it. This PR adds a link attribute just like rust-lang#118142 for all the new LLVMRust functions. They were actually added in rust-lang#130060 but weren't used until just now.

Rollup merge of rust-lang#136374 - saethlin:enzyme-linkage, r=oli-obk Add link attribute for Enzyme's LLVMRust FFI Since rust-lang#133429 landed, the compiler doesn't build with `-Zcross-crate-inline-threshold=always`. I don't expect anyone else to test or fix issues with that goofy configuration, so I'm fixing it. This PR adds a link attribute just like rust-lang#118142 for all the new LLVMRust functions. They were actually added in rust-lang#130060 but weren't used until just now.

…ssion, r=<try> test autodiff compile time fixes Tries to fix the regression from rust-lang#133429

oli-obk · 2025-02-02T09:16:43Z

perf regression fixed by #136413

…ssion, r=oli-obk fix autodiff compile time regression Tries to fix the regression from rust-lang#133429 Tracking: - rust-lang#124509

…oli-obk fix autodiff compile time regression Tries to fix the regression from rust-lang/rust#133429 Tracking: - rust-lang/rust#124509

Rollup of 9 pull requests Successful merges: - rust-lang#132156 (When encountering unexpected closure return type, point at return type/expression) - rust-lang#133429 (Autodiff Upstreaming - rustc_codegen_ssa, rustc_middle) - rust-lang#136281 (`rustc_hir_analysis` cleanups) - rust-lang#136297 (Fix a typo in profile-guided-optimization.md) - rust-lang#136300 (atomic: extend compare_and_swap migration docs) - rust-lang#136310 (normalize `*.long-type.txt` paths for compare-mode tests) - rust-lang#136312 (Disable `overflow_delimited_expr` in edition 2024) - rust-lang#136313 (Filter out RPITITs when suggesting unconstrained assoc type on too many generics) - rust-lang#136323 (Fix a typo in conventions.md) r? `@ghost` `@rustbot` modify labels: rollup

ZuseZ4 · 2025-04-18T23:20:57Z

(Turns out that this introduced the autodiff + debug build failure during bootstrap. ~~Investigating~~).
Fixed in #140030

rustbot assigned jieyouxu Nov 25, 2024

traviscross mentioned this pull request Nov 4, 2024

Tracking Issue for autodiff #124509

Open

7 tasks

jieyouxu requested changes Nov 25, 2024

View reviewed changes

rustbot added S-waiting-on-author Status: This is awaiting some action (such as code changes or more information) from the author. and removed S-waiting-on-review Status: Awaiting review from the assignee but also interested parties. labels Nov 25, 2024

ZuseZ4 changed the title ~~upstream rustc_codegen_ssa/rustc_middle changes for enzyme/autodiff~~ Autodiff Upstreaming - rustc_codegen_ssa, rustc_middle Nov 26, 2024

ZuseZ4 mentioned this pull request Nov 27, 2024

Expose experimental LLVM features for GPU offloading rust-lang/rust-project-goals#109

Open

4 tasks

oli-obk self-assigned this Dec 6, 2024

ZuseZ4 marked this pull request as ready for review December 13, 2024 00:57

Urgau reviewed Dec 16, 2024

View reviewed changes

compiler/rustc_session/src/config/cfg.rs Outdated Show resolved Hide resolved

ZuseZ4 force-pushed the autodiff-middle branch from b30f1d7 to 2ad340e Compare January 2, 2025 17:17

This comment has been minimized.

Sign in to view

ZuseZ4 force-pushed the autodiff-middle branch from 2ad340e to c4af0ba Compare January 25, 2025 03:13

This comment has been minimized.

Sign in to view

ZuseZ4 force-pushed the autodiff-middle branch from 1ab7aa0 to 950d91b Compare January 27, 2025 01:15

ZuseZ4 requested a review from jieyouxu January 27, 2025 01:19

rustbot added the S-waiting-on-review Status: Awaiting review from the assignee but also interested parties. label Jan 27, 2025

upstream rustc_codegen_ssa/rustc_middle changes for enzyme/autodiff

1f30517

ZuseZ4 force-pushed the autodiff-middle branch from 48d4a6c to 1f30517 Compare January 30, 2025 02:31

jhpratt mentioned this pull request Jan 31, 2025

Rollup of 9 pull requests #136332

Merged

bors merged commit c19c4b9 into rust-lang:master Jan 31, 2025
6 checks passed

rustbot added this to the 1.86.0 milestone Jan 31, 2025

ZuseZ4 deleted the autodiff-middle branch January 31, 2025 15:48

saethlin mentioned this pull request Feb 1, 2025

Add link attribute for Enzyme's LLVMRust FFI #136374

Merged

oli-obk reviewed Feb 1, 2025

View reviewed changes

compiler/rustc_codegen_ssa/src/back/write.rs Show resolved Hide resolved

oli-obk reviewed Feb 1, 2025

View reviewed changes

compiler/rustc_monomorphize/src/partitioning.rs Show resolved Hide resolved

ZuseZ4 mentioned this pull request Feb 2, 2025

fix autodiff compile time regression #136413

Merged

bors added a commit to rust-lang-ci/rust that referenced this pull request Feb 2, 2025

Auto merge of rust-lang#136413 - EnzymeAD:fix-autodiff-comptime-regre…

719d8b8

…ssion, r=<try> test autodiff compile time fixes Tries to fix the regression from rust-lang#133429

oli-obk added the perf-regression-triaged The performance regression has been triaged. label Feb 2, 2025

ZuseZ4 added the F-autodiff `#![feature(autodiff)]` label Mar 17, 2025

ZuseZ4 mentioned this pull request Apr 19, 2025

building rustc with autodiff and debug enabled fails. #139704

Closed

Autodiff Upstreaming - rustc_codegen_ssa, rustc_middle #133429

Autodiff Upstreaming - rustc_codegen_ssa, rustc_middle #133429

Uh oh!

Conversation

ZuseZ4 commented Nov 25, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

ZuseZ4 commented Nov 25, 2024

Uh oh!

jieyouxu left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

jieyouxu Nov 25, 2024

Choose a reason for hiding this comment

Uh oh!

Urgau Dec 16, 2024

Choose a reason for hiding this comment

Uh oh!

ZuseZ4 Jan 27, 2025

Choose a reason for hiding this comment

Uh oh!

ZuseZ4 Jan 27, 2025

Choose a reason for hiding this comment

Uh oh!

oli-obk Jan 28, 2025

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

jieyouxu commented Nov 25, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

ZuseZ4 commented Nov 25, 2024

Uh oh!

jieyouxu commented Nov 27, 2024

Uh oh!

rustbot commented Dec 13, 2024

Uh oh!

Uh oh!

This comment has been minimized.

ZuseZ4 commented Jan 2, 2025

Uh oh!

bors commented Jan 25, 2025

Uh oh!

This comment has been minimized.

ZuseZ4 commented Jan 25, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

This comment has been minimized.

ZuseZ4 commented Jan 27, 2025

Uh oh!

ZuseZ4 commented Jan 30, 2025

Uh oh!

ZuseZ4 commented Jan 30, 2025

Uh oh!

bors commented Jan 30, 2025

Uh oh!

Uh oh!

lqd commented Feb 1, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

oli-obk commented Feb 1, 2025

Uh oh!

Uh oh!

Uh oh!

oli-obk commented Feb 2, 2025

Uh oh!

ZuseZ4 commented Apr 18, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Uh oh!

ZuseZ4 commented Nov 25, 2024 •

edited

Loading

jieyouxu commented Nov 25, 2024 •

edited

Loading

ZuseZ4 commented Jan 25, 2025 •

edited

Loading

lqd commented Feb 1, 2025 •

edited

Loading

ZuseZ4 commented Apr 18, 2025 •

edited

Loading