[wasm] Re-enable null check optimization for mid-method traces #84058

kg · 2023-03-29T03:13:00Z

This PR moves the ldloca analysis from the jiterpreter trace generator to the pass that inserts entry points, producing a bitset with a 1 for each interpreter stack slot that has its address taken. As a result we're able to remove the 'trace must start at the beginning of the method' restriction for null check optimization, and we pick up some small correctness improvements along with it.

ghost · 2023-03-29T03:13:09Z

Tagging subscribers to 'arch-wasm': @lewing
See info in area-owners.md if you want to be subscribed.

Issue Details

Right now traces in the middle of methods don't support null check optimizations since we don't know if ldloca happened previously in the method. This PR adds a quick scan of the method leading up to the trace for ldlocas so we have the information we need for (partial) null check optimization.

Author:	kg
Assignees:	-
Labels:	`arch-wasm`, `area-Codegen-Jiterpreter-mono`
Milestone:	-

kg · 2023-03-29T15:12:46Z

cc @BrzVlad if you have time to look, does this make sense to you?

BrzVlad · 2023-03-29T16:35:33Z

Interp promotes address taken vars to be global in initialize_global_vars. I think this would be a good starting point for interp to start providing information to jiterp in a well structured fashion, that can be expanded upon later. Even a simple boolean on whether the method has ldloca would still be something so we can avoid unnecessary iteration for the majority of methods.

This is also fine if you prefer to do it later.

src/mono/wasm/runtime/jiterpreter-trace-generator.ts

kg · 2023-03-29T17:27:19Z

Interp promotes address taken vars to be global in initialize_global_vars. I think this would be a good starting point for interp to start providing information to jiterp in a well structured fashion, that can be expanded upon later. Even a simple boolean on whether the method has ldloca would still be something so we can avoid unnecessary iteration for the majority of methods.

This is also fine if you prefer to do it later.

That is perfect, thanks for pointing out where that happens. I've previously experimented with flowing variable data through with mixed success but I think the next time I try I'll be able to get it in good shape. It's already on the todo list in the tracking issue, I just haven't had time to redo this stuff from the ground up and flow the information through.

A 'this method has ldloca' flag sounds like a really simple starting point that would be valuable.

…spaces in the stack have had their address taken Use the interpreter bitset for jiterpreter null check optimization Enable null check optimization for all traces

kg · 2023-03-30T18:14:20Z

Reworked inspired by @BrzVlad 's feedback: We now build a bitset when inserting trace(s) into a method, where each bit indicates whether the 8 bytes in that slot ever have their address taken during the method.

In my testing this matches the old analysis 99%+ of the time, so it shouldn't cause any performance regressions, but it's also more correct since it will handle the case vlad pointed out (ldlocas after the current instruction we haven't seen yet, in methods with loops). This also handles the hypothetical case of us null check optimizing a field of a stack-allocated struct and then someone ldloca'ing the whole struct, because we can ensure all the slots occupied by the struct have the address-taken flag set now. (The jiterpreter doesn't currently know the type or size of locals)

The memory cost of the bitset feels potentially bad but I'm not sure how I could do much better here. Maybe a packed list of offsets and sizes instead?

BrzVlad · 2023-04-03T09:11:01Z

src/mono/mono/mini/interp/jiterpreter.c

@@ -942,6 +942,30 @@ trace_info_alloc () {
 	return index;
 }

+static void
+build_address_taken_bitset (TransformData *td, InterpBasicBlock *bb, guint32 bitset_size)


If you want, you can further optimize this a bit. For example, you can add a new field to TransformData, something like max_indirect_offset that you set in

runtime/src/mono/mono/mini/interp/transform.c

Line 10316 in d0913fc

td->locals [var].flags |= INTERP_LOCAL_FLAG_GLOBAL;

to be td->total_locals_size. You can then run this pass only if this field is non zero and you can also have less memory use for the bitset array (note you would need to do a range check in jiterp for overflowing offsets)

That sounds great, but I'd prefer to do it in a follow-up PR. Is that OK?

sure, also if you consider this to be worth it

kg added arch-wasm WebAssembly architecture area-Codegen-Jiterpreter-mono labels Mar 29, 2023

kg requested review from lewing and pavelsavara as code owners March 29, 2023 03:13

ghost assigned kg Mar 29, 2023

This was referenced Mar 29, 2023

IOException running NuGet-Migrations during tests in dotnet CLI first run #80619

Closed

[release/6.0] Doublelinklist GC failures on Mono #83245

Closed

WasmTestOnBrowser-System.* test failures in CI #83655

Closed

Wasm debugger test timing out #83847

Closed

vargaz approved these changes Mar 29, 2023

View reviewed changes

BrzVlad reviewed Mar 29, 2023

View reviewed changes

src/mono/wasm/runtime/jiterpreter-trace-generator.ts Outdated Show resolved Hide resolved

kg added the NO-MERGE The PR is not ready for merge yet (see discussion for detailed reasons) label Mar 29, 2023

Generate a bitset in the interpreter's codegen that identifies which …

808ac5b

…spaces in the stack have had their address taken Use the interpreter bitset for jiterpreter null check optimization Enable null check optimization for all traces

kg force-pushed the wasm-jiterp-null-check-mid-method branch from d566fb6 to 808ac5b Compare March 30, 2023 18:10

kg requested a review from kotlarmilos as a code owner March 30, 2023 18:10

kg removed the NO-MERGE The PR is not ready for merge yet (see discussion for detailed reasons) label Mar 30, 2023

kg requested a review from vargaz March 31, 2023 04:42

kg added 2 commits March 31, 2023 14:59

Address feedback from vlad

387c389

Free the bitset if it's not needed

1718534

BrzVlad reviewed Apr 3, 2023

View reviewed changes

kg mentioned this pull request Apr 3, 2023

[wasm] Jiterpreter tracking issue #78428

Open

kg merged commit 4b53510 into dotnet:main Apr 3, 2023

ghost locked as resolved and limited conversation to collaborators May 3, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[wasm] Re-enable null check optimization for mid-method traces #84058

[wasm] Re-enable null check optimization for mid-method traces #84058

kg commented Mar 29, 2023 •

edited

Loading

ghost commented Mar 29, 2023

kg commented Mar 29, 2023

BrzVlad commented Mar 29, 2023

kg commented Mar 29, 2023

kg commented Mar 30, 2023

BrzVlad Apr 3, 2023

kg Apr 3, 2023

BrzVlad Apr 3, 2023

[wasm] Re-enable null check optimization for mid-method traces #84058

[wasm] Re-enable null check optimization for mid-method traces #84058

Conversation

kg commented Mar 29, 2023 • edited Loading

ghost commented Mar 29, 2023

kg commented Mar 29, 2023

BrzVlad commented Mar 29, 2023

kg commented Mar 29, 2023

kg commented Mar 30, 2023

BrzVlad Apr 3, 2023

Choose a reason for hiding this comment

kg Apr 3, 2023

Choose a reason for hiding this comment

BrzVlad Apr 3, 2023

Choose a reason for hiding this comment

kg commented Mar 29, 2023 •

edited

Loading