precompute offsets ahead-of-time rather than on each dereference #141

Ekleog-NEAR · 2022-09-22T15:07:03Z

Criterion results on a test related to near/nearcore-private-1#2:

compile                 time:   [9.7873 ms 9.8477 ms 9.9119 ms]
                        change: [-5.2388% -4.2981% -3.2369%] (p = 0.00 < 0.05)
                        Performance has improved.

Manual testing on the exact test of near/nearcore-private-1#2 confirms the ~5% speedup:

Non-rayon goes down to 24.5-25ms from 25.5-26ms
Rayon goes down to 12ms from 12.5ms

This also feels more like a code cleanup than added complexity for optimization’s sake to me, but I would understand if other people thought the added invariants are added complexity

Criterion results on a test related to nearcore-private-1#2: compile time: [9.7873 ms 9.8477 ms 9.9119 ms] change: [-5.2388% -4.2981% -3.2369%] (p = 0.00 < 0.05) Performance has improved. Manual testing on the exact test of nearcore-private-1#2 confirms the ~5% speedup: Non-rayon goes down to 24.5-25ms from 25.5-26ms Rayon goes down to 12ms from 12.5ms

matklad · 2022-09-22T16:30:50Z

Rayon goes down to 12ms from 12.5ms

I think we agrdeed to just kill rayon (near/nearcore#8948), could you send a quick PR to do that as a follow up?

nagisa · 2022-09-22T17:24:55Z

lib/vm/src/vmoffsets.rs

+
+    fn precompute(&mut self) {
+        self.vmctx_signature_ids_begin = 0;
+        self.vmctx_imported_functions_begin = self


This is so much more comprehensible than the previous code already, but I do wonder if there’s an opportunity to further simplify this code with some helpers.

For instance: instead of manually having checked_add and checked_mul, having that in a utility function along the lines of fma (fused multiply-add).

That way we’d be looking at something like

self.vmctx_imported_functions_begin = offset_fma( self.num_signature_ids, u32::from(self.size_of_vmshared_signature_index()) self.vmctx_signature_ids_begin );

which seems like less noise overall. (names/argument ordering at your choice)

Great idea thanks! I’ve just pushed a commit doing this change :)

3292: Precompute offsets in VMOffsets r=ptitSeb a=ptitSeb # Description Small optimisation: Precompute Offsets in VMOffsets based on near/wasmer#141 For #3305 Co-authored-by: ptitSeb <sebastien.chev@gmail.com>

Ekleog-NEAR force-pushed the precompute-offsets branch from 0726318 to f992fcf Compare September 22, 2022 15:07

Ekleog-NEAR force-pushed the precompute-offsets branch from f992fcf to ba4b977 Compare September 22, 2022 15:10

nagisa reviewed Sep 22, 2022

View reviewed changes

introduce an offset_by function for code clarity

5967aa1

Ekleog-NEAR requested a review from nagisa September 23, 2022 11:49

nagisa approved these changes Sep 23, 2022

View reviewed changes

Ekleog-NEAR merged commit d30991d into near:near-main Sep 23, 2022

This was referenced Nov 10, 2022

Precompute offsets in VMOffsets wasmerio/wasmer#3292

Merged

Backport Singlepass optimisations from Near wasmerio/wasmer#3305

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

precompute offsets ahead-of-time rather than on each dereference #141

precompute offsets ahead-of-time rather than on each dereference #141

Ekleog-NEAR commented Sep 22, 2022

matklad commented Sep 22, 2022

nagisa Sep 22, 2022 •

edited

Loading

Ekleog-NEAR Sep 23, 2022

precompute offsets ahead-of-time rather than on each dereference #141

precompute offsets ahead-of-time rather than on each dereference #141

Conversation

Ekleog-NEAR commented Sep 22, 2022

matklad commented Sep 22, 2022

nagisa Sep 22, 2022 • edited Loading

Choose a reason for hiding this comment

Ekleog-NEAR Sep 23, 2022

Choose a reason for hiding this comment

nagisa Sep 22, 2022 •

edited

Loading