[perf] Disable inline_in_all_cgus when LTO is enabled #65052

ishitatsuyuki · 2019-10-03T08:52:52Z

I tried to run lolbench with this flag altered; however lolbench either errored or got stuck, so I end up running only 3 of the benchmarks manually. One showed no changes, another one showed 20% perf improvement somehow, and another showed slight (1%) regression.

So I'm thinking of benchmarking it with perf instead; if we still have ThinLTO on CI then we can measure stage2 compiler performance as a metric of how much it affects generated code.

rust-highfive · 2019-10-03T08:52:56Z

r? @estebank

(rust_highfive has picked a reviewer for you, use r? to override)

ishitatsuyuki · 2019-10-03T08:53:03Z

cc @Mark-Simulacrum

Mark-Simulacrum · 2019-10-03T11:23:05Z

@bors try @rust-timer queue

A perf run can't hurt, I guess.

rust-timer · 2019-10-03T11:23:06Z

Awaiting bors try build completion

bors · 2019-10-03T11:23:17Z

⌛ Trying commit 5da9200 with merge 8b81ee4...

[perf] Disable inline_in_all_cgus when LTO is enabled I tried to run lolbench with this flag altered; however lolbench either errored or got stuck, so I end up running only 3 of the benchmarks manually. One showed no changes, another one showed 20% perf improvement somehow, and another showed slight (1%) regression. So I'm thinking of benchmarking it with perf instead; if we still have ThinLTO on CI then we can measure stage2 compiler performance as a metric of how much it affects generated code.

bors · 2019-10-03T14:32:54Z

☀️ Try build successful - checks-azure
Build commit: 8b81ee4 (8b81ee4b7012d18cf19ef2744bc8457290dd4136)

rust-timer · 2019-10-03T14:32:55Z

Queued 8b81ee4 with parent 0221e26, future comparison URL.

rust-timer · 2019-10-04T02:28:38Z

Finished benchmarking try commit 8b81ee4, comparison URL.

ishitatsuyuki · 2019-10-04T03:26:42Z

Everything within the margin of noise, is ThinLTO used for benchmarks?

Mark-Simulacrum · 2019-10-04T12:17:33Z

It should be, yes, for release builds by default I believe.

bjorn3 · 2019-10-04T12:21:48Z

@ishitatsuyuki @rust-timer only benchmarks compilation time, so this means that rustc perf didn't change. lolbench is a different program, which does benchmark execution time.

ishitatsuyuki · 2019-10-04T13:11:18Z

@bjorn3 I am aware of that, and what I expected is that this change will indirectly affect stage2 compiler, which in turn can be benchmarked for performance difference.

So far what I have heard is that ThinLTO is enabled for both compiler builds and perf, so everything being identical sounds really strange. Something more likely is that the code/flag being changed has no effect at all, so maybe I should ask who had implemented this originally -

r? @alexcrichton

nikic · 2019-10-04T13:28:35Z

src/librustc_mir/monomorphize/item.rs

@@ -65,7 +65,7 @@ pub trait MonoItemExt<'tcx>: fmt::Debug {
    fn instantiation_mode(&self, tcx: TyCtxt<'tcx>) -> InstantiationMode {
        let inline_in_all_cgus =
            tcx.sess.opts.debugging_opts.inline_in_all_cgus.unwrap_or_else(|| {
-                tcx.sess.opts.optimize != OptLevel::No
+                tcx.sess.opts.optimize != OptLevel::No && if tcx.sess.lto() == Lto::No


There is an if after the &&. Why does this code compile?

Oh good catch, and yeah I have no idea why this happened to compile. I'll push a fix later.

Strange. I can't reproduce on play.rust-lang.org.

ishitatsuyuki · 2019-10-04T13:53:58Z

Oh shit this file is dead code. The code is inside src/librustc/mir/mono.rs instead.

ishitatsuyuki · 2019-10-04T14:08:47Z

@Mark-Simulacrum Sorry, can I get another perf run?

Mark-Simulacrum · 2019-10-04T14:13:30Z

@bors try @rust-timer queue

rust-timer · 2019-10-04T14:13:31Z

Awaiting bors try build completion

Mark-Simulacrum · 2019-10-04T14:14:21Z

@bors try

Mark-Simulacrum · 2019-10-04T14:16:04Z

@bors try

bors · 2019-10-04T14:16:08Z

⌛ Trying commit 3e0a02a with merge c5643b3...

[perf] Disable inline_in_all_cgus when LTO is enabled I tried to run lolbench with this flag altered; however lolbench either errored or got stuck, so I end up running only 3 of the benchmarks manually. One showed no changes, another one showed 20% perf improvement somehow, and another showed slight (1%) regression. So I'm thinking of benchmarking it with perf instead; if we still have ThinLTO on CI then we can measure stage2 compiler performance as a metric of how much it affects generated code.

bors · 2019-10-04T17:04:59Z

☀️ Try build successful - checks-azure
Build commit: c5643b3 (c5643b3524a2437116951cffe217391943c6fd51)

rust-timer · 2019-10-04T17:05:00Z

Queued c5643b3 with parent 9e35a28, future comparison URL.

rust-timer · 2019-10-04T23:50:30Z

Finished benchmarking try commit c5643b3, comparison URL.

ishitatsuyuki · 2019-10-05T00:47:05Z

Checked this and it seems it’s mostly a big lose. Closing.

rust-highfive assigned estebank Oct 3, 2019

rust-highfive added the S-waiting-on-review Status: Awaiting review from the assignee but also interested parties. label Oct 3, 2019

rust-highfive assigned alexcrichton and unassigned estebank Oct 4, 2019

nikic reviewed Oct 4, 2019

View reviewed changes

ishitatsuyuki added 2 commits October 4, 2019 22:55

Remove dead module

c97f5f0

Disable inline_in_all_cgus when LTO is enabled

3e0a02a

ishitatsuyuki force-pushed the no-inline-cgu branch from 5da9200 to 3e0a02a Compare October 4, 2019 13:58

Mark-Simulacrum closed this Oct 4, 2019

Mark-Simulacrum reopened this Oct 4, 2019

ishitatsuyuki closed this Oct 5, 2019

andjo403 mentioned this pull request Oct 10, 2019

Less codegen parallelism than expected with -C codegen-units=16 #64913

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[perf] Disable inline_in_all_cgus when LTO is enabled #65052

[perf] Disable inline_in_all_cgus when LTO is enabled #65052

ishitatsuyuki commented Oct 3, 2019

rust-highfive commented Oct 3, 2019

ishitatsuyuki commented Oct 3, 2019

Mark-Simulacrum commented Oct 3, 2019

rust-timer commented Oct 3, 2019

bors commented Oct 3, 2019

bors commented Oct 3, 2019

rust-timer commented Oct 3, 2019

rust-timer commented Oct 4, 2019

ishitatsuyuki commented Oct 4, 2019

Mark-Simulacrum commented Oct 4, 2019

bjorn3 commented Oct 4, 2019

ishitatsuyuki commented Oct 4, 2019

nikic Oct 4, 2019

ishitatsuyuki Oct 4, 2019

bjorn3 Oct 4, 2019

ishitatsuyuki commented Oct 4, 2019

ishitatsuyuki commented Oct 4, 2019

Mark-Simulacrum commented Oct 4, 2019

rust-timer commented Oct 4, 2019

Mark-Simulacrum commented Oct 4, 2019

Mark-Simulacrum commented Oct 4, 2019

bors commented Oct 4, 2019

bors commented Oct 4, 2019

rust-timer commented Oct 4, 2019

rust-timer commented Oct 4, 2019

ishitatsuyuki commented Oct 5, 2019

[perf] Disable inline_in_all_cgus when LTO is enabled #65052

[perf] Disable inline_in_all_cgus when LTO is enabled #65052

Conversation

ishitatsuyuki commented Oct 3, 2019

rust-highfive commented Oct 3, 2019

ishitatsuyuki commented Oct 3, 2019

Mark-Simulacrum commented Oct 3, 2019

rust-timer commented Oct 3, 2019

bors commented Oct 3, 2019

bors commented Oct 3, 2019

rust-timer commented Oct 3, 2019

rust-timer commented Oct 4, 2019

ishitatsuyuki commented Oct 4, 2019

Mark-Simulacrum commented Oct 4, 2019

bjorn3 commented Oct 4, 2019

ishitatsuyuki commented Oct 4, 2019

nikic Oct 4, 2019

Choose a reason for hiding this comment

ishitatsuyuki Oct 4, 2019

Choose a reason for hiding this comment

bjorn3 Oct 4, 2019

Choose a reason for hiding this comment

ishitatsuyuki commented Oct 4, 2019

ishitatsuyuki commented Oct 4, 2019

Mark-Simulacrum commented Oct 4, 2019

rust-timer commented Oct 4, 2019

Mark-Simulacrum commented Oct 4, 2019

Mark-Simulacrum commented Oct 4, 2019

bors commented Oct 4, 2019

bors commented Oct 4, 2019

rust-timer commented Oct 4, 2019

rust-timer commented Oct 4, 2019

ishitatsuyuki commented Oct 5, 2019