JIT: consider running some optimizations in Tier0 #9120

AndyAyersMS · 2017-10-13T01:05:38Z

The goal of Tier0 is to jit code as quickly as possible. Currently this is done by having Tier0 enable the "minopts" mode in the jit, which disables all optimization. But there are good reasons to believe that running some optimizations in Tier0 can improve the speed of jitting.

To first order the time it takes to jit a method (especially when lightly optimizing) is proportional to the amount of code the jit produces for the method. So any cheap optimization that reduces the overall size of the generated code is a candidate for running in Tier0. Some ideas:

enable the importer branch folding
enable the early type opts that feed importer branch folding. Perhaps doubly relevant since R2R prejitting will leave generic code to the jit, so tier0 will see more generic instantations then "normal", and these are the method bodies that often can be greatly simplified by early type opts. These opts are also pretty cheap.
other kinds of simple expression tree simplifications -- maybe some parts of morph are cheap enough to enable?
avoid inline expansion of helpers. Tricky because while this helps jit time it slows down the jitted code -- but we hope not to run the Tier0 code very often, so it seems like it could pay off.
perhaps inline very small methods. I have some older and perhaps flawed data that indicates this should be TP win, but recent results haven't borne this out. But it is worth revisiting. Certainly, inlining small methods often reduces code size, and it should cut down on the number of jit invocations.

There are notes about this scattered about in other issues, Will track these down and link them back here.

category:throughput
theme:optimization
skill-level:expert
cost:medium

AndyAyersMS · 2017-10-13T20:12:50Z

Also:

Use GT_IND(GT_INDEX_ADDR)) in debuggable code. #8806 use more compact IR representation for array access
LclVars and throughput #8715 change approach in the jit to maintaining local var ref counts

AndyAyersMS · 2017-10-27T18:35:04Z

Considering for 2.1.

AndyAyersMS · 2018-02-06T21:29:33Z

Did some prototyping of this a while back and wasn't really able to get a clear picture of potential improvements. So am going to hold off on this until after 2.1.

tannergooding · 2018-02-06T21:41:49Z

@AndyAyersMS, is removing dead code for constant branches currently part of this?

There are a few places today where we return gtIconNode (such as Hardware Intrinsics, SIMD Intrinsics, etc) that look like they are clear/easy wins for minopts.

AndyAyersMS · 2018-02-06T22:11:34Z

Yeah that's the "importer branch folding" bit mentioned above.

I agree there are be wins to be had here. At the time I ran these investigations, I was not seeing consistent wins on the scenarios I was able to measure. But it was early and the measurements were somewhat ad-hoc. I believe @noahfalk is working on a more comprehensive set of well defined scenarios to use in evaluations. Would be nice to have something in there that leverages intrinsics.

So I'll revisit this, but likely not until after 2.1 is more or less done.

AndyAyersMS · 2019-03-07T03:12:44Z

See dotnet/coreclr#22984 for some notes on a prototype: EnableBoxingOptsTier0.

AndyAyersMS · 2019-12-18T23:51:32Z

We should also consider enabling intrinsic expansion, for both HW intrinsics and for intrinsics that are likely to lead to early control flow trimming.

AndyAyersMS · 2020-04-24T01:30:32Z

Note large compiled regexes are currently jitted with minopts.

They would benefit somewhat from intrinsic expansion for String.get_Chars.

If we add optimizations to Tier0 then we might consider compiling these (or any other too large to optimize method) at Tier0 instead of minopts.

tannergooding · 2020-04-24T01:48:22Z

Is that still going to apply for regexes pre-compiled using source generators?

AndyAyersMS · 2020-04-24T01:58:07Z

Pre-compiled as in compiled to IL, or pre-compiled as in crosssgen?

Pre-compiled-to IL and then jitted regex methods will have similar codegen to what you get now, with the excepion that since they are no longer dynamic methods they'll be eligible for tiering and all that entails (possibly faster startup/initial jitting, slower matching for initial iterations, then faster after rejitting -- that is, if they are not too big to optimize).

If you prejit your precompiled regexes, you may get better codegen, I think jit has higher circuit-breaker limits prejitting, but it still does have limits. I was planning to document those (see #31942) but haven't done so yet. But given R2R overhead and the fact that calls from the regex code to the regex library won't be inlined, it is hard to say for sure where you might end up.

Prejitting large methods may lead to perf anomalies where the R2R code is optimized, then tiering decides to replace it, and then the jit decides not to optimize since it has tighter constraints on jit time than prejit time. This is something we need to watch closely, presumably we'd be better of just keeping the prejitted code for such cases.

tannergooding · 2020-04-24T02:04:53Z

One of the proposals for source generators is to use them to convert a regex into C# code, which would then be naturally compiled down to IL.

But it sounds like it would still be applicable there as well :)

msftgits transferred this issue from dotnet/coreclr Jan 31, 2020

msftgits added this to the Future milestone Jan 31, 2020

AndyAyersMS mentioned this issue Feb 12, 2020

Clean up usage of string.IndexOf / ToUpper / ToLower / Trim throughout the framework #31968

Merged

AndyAyersMS mentioned this issue Mar 5, 2020

Implement simple version of On Stack Replacement (OSR) #32969

Merged

AndyAyersMS mentioned this issue Jul 12, 2020

Fix ASG struct (copy) with different src and dst sizes after global morph. #39137

Merged

AndyAyersMS mentioned this issue Oct 15, 2020

Replaced flag checking expressions by Enum.HasValue AvaloniaUI/Avalonia#4873

Closed

BruceForstall added the JitUntriaged CLR JIT issues needing additional triage label Oct 28, 2020

BruceForstall removed the JitUntriaged CLR JIT issues needing additional triage label Nov 24, 2020

YohDeadfall mentioned this issue Feb 21, 2021

Don't use Enum.HasFlag npgsql/npgsql#3556

Merged

jkotas mentioned this issue Jun 24, 2022

Do we still need [AggressiveOptimization] on so many methods? #71261

Closed

AndyAyersMS mentioned this issue Oct 23, 2022

Allow some intrinsics in Tier0 #77357

Merged

EgorBo mentioned this issue Oct 23, 2022

Dynamic PGO startup improvements in NET 8 #76969

Closed

23 tasks

AndyAyersMS mentioned this issue Feb 22, 2023

Enable constant folding in Tier0 #82412

Merged

EgorBo mentioned this issue Aug 14, 2023

Handle side-effects in impBoxPatternMatch #90496

Merged

EgorBo mentioned this issue Sep 19, 2024

JIT: Don't expand virtual calls too early for minopts #108002

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

JIT: consider running some optimizations in Tier0 #9120

JIT: consider running some optimizations in Tier0 #9120

AndyAyersMS commented Oct 13, 2017

AndyAyersMS commented Oct 13, 2017

AndyAyersMS commented Oct 27, 2017

AndyAyersMS commented Feb 6, 2018

tannergooding commented Feb 6, 2018

AndyAyersMS commented Feb 6, 2018

AndyAyersMS commented Mar 7, 2019

AndyAyersMS commented Dec 18, 2019

AndyAyersMS commented Apr 24, 2020

tannergooding commented Apr 24, 2020

AndyAyersMS commented Apr 24, 2020 •

edited

Loading

tannergooding commented Apr 24, 2020

JIT: consider running some optimizations in Tier0 #9120

JIT: consider running some optimizations in Tier0 #9120

Comments

AndyAyersMS commented Oct 13, 2017

AndyAyersMS commented Oct 13, 2017

AndyAyersMS commented Oct 27, 2017

AndyAyersMS commented Feb 6, 2018

tannergooding commented Feb 6, 2018

AndyAyersMS commented Feb 6, 2018

AndyAyersMS commented Mar 7, 2019

AndyAyersMS commented Dec 18, 2019

AndyAyersMS commented Apr 24, 2020

tannergooding commented Apr 24, 2020

AndyAyersMS commented Apr 24, 2020 • edited Loading

tannergooding commented Apr 24, 2020

AndyAyersMS commented Apr 24, 2020 •

edited

Loading