Montgomery multiplication improvements #203

fjarri · 2023-03-29T07:34:30Z

Added benchmarks for Montgomery multiplication and DynResidueParams creation
Replaced the final reduction in montgomery_reduction() with sub_mod_with_hi(). Speeds it up by ~20% (for U256), I guess due to better vectorization?
Extracted muladdcarry() from montgomery_reduction()
Simplified Uint::sub_mod(), sub_mod_special(), add_mod(), add_mod_special() by reusing existing methods.
Made DynResidueParams::new() a const fn
Optimized mod_neg_inv calculation in DynResidueParams::new(), speeds it up by ~10% (for U256).

Originally I wanted to implement Montgomery multiplication by simultaneous multiplication + reduction instead of mul_wide, but it showed exactly the same performance, with the exception of the last reduction part - that's how I discovered the performance improvement. Still not quite sure what causes it.

(By the way, I measured the performance on arm64 - would be interesting to see the results for x64)

tarcieri · 2023-03-29T16:06:28Z

Seems like a nice simplification. Thanks!

fjarri added 5 commits March 28, 2023 23:00

Add Montgomery multiplication benchmark

c7574a8

Speed up Montgomery reduction

eb46875

Make DynResidueParams::new() a const function

24cc80a

Add DynResidueParams creation benchmark

5f8dba9

Speed up DynResidueParams creation

04dc017

fjarri force-pushed the montgomery-mul branch from 3160dc2 to 04dc017 Compare March 29, 2023 07:38

tarcieri merged commit 74b481b into RustCrypto:master Mar 29, 2023

fjarri deleted the montgomery-mul branch March 29, 2023 23:53

tarcieri mentioned this pull request Apr 26, 2023

v0.5.2 #217

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Montgomery multiplication improvements #203

Montgomery multiplication improvements #203

fjarri commented Mar 29, 2023 •

edited

Loading

tarcieri commented Mar 29, 2023

Montgomery multiplication improvements #203

Montgomery multiplication improvements #203

Conversation

fjarri commented Mar 29, 2023 • edited Loading

tarcieri commented Mar 29, 2023

fjarri commented Mar 29, 2023 •

edited

Loading