uint: Implement modulo operations for special moduli #108

haslersn · 2022-08-07T11:57:33Z

For a project of mine where the modulus can be chosen to be close to UInt::MAX, I created optimized implementations of modular operations. I thought maybe others can benefit from them, too, so I ported my implementation to the crypto-bigint crate.

This commit implements modulo operations (neg, add, sub, mul) for special moduli that are so close to MAX that the difference to overflow fits in a single Limb. For such moduli, these new implementations are much faster than the existing generic modulus implementations. (For mul there's no comparison since there's no corresponding generic modulus implementation, yet.)

For U256, I benchmarked the generic against the specialized implementations using criterion-rs on Intel Core i7-8565U @ 1.80GHz and obtained the following average times. Note that I used a const modulus known at compile-time, which enables some compiler optimizations after inlining. With a modulus known only at runtime, times might differ.

	`U256::add_mod`	`U256::sub_mod`	`U256::mul_mod`
generic (after #109 got merged)	10.857 ns	9.6262 ns	not implemented
`_special`	3.8276 ns	4.1339 ns	20.188 ns

Implement modulo operations (`neg`, `add`, `sub`, `mul`) for special moduli that are so close to `MAX` that the difference to overflow fits in a single `Limb`. For such moduli, these new implementations are much faster than the existing generic modulus implementations. (For `mul` there's no comparison since there's no corresponding generic modulus implementation, yet.)

tarcieri · 2022-08-08T11:56:23Z

This looks reasonable enough at first glance. Give me a little more time to more thoroughly review it.

haslersn · 2022-08-08T14:49:01Z

I had an error in my benchmarks, namely I forgot to pass the inputs through black_box. I corrected the original post with updated benchmark results.

tarcieri · 2022-08-15T00:19:20Z

Thank you!

haslersn force-pushed the special-moduli-ops branch 3 times, most recently from ce2ea4a to 0dee5d2 Compare August 7, 2022 12:11

haslersn force-pushed the special-moduli-ops branch from 0dee5d2 to 4206e88 Compare August 7, 2022 12:27

PopcornPaws mentioned this pull request Aug 8, 2022

Optimize modular multiplication guildxyz/guild-zk#32

Open

haslersn mentioned this pull request Aug 8, 2022

limb: always inline bitand #109

Merged

tarcieri merged commit f28cef1 into RustCrypto:master Aug 15, 2022

haslersn deleted the special-moduli-ops branch August 15, 2022 14:36

tarcieri mentioned this pull request Oct 11, 2022

v0.4.9 #131

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

uint: Implement modulo operations for special moduli #108

uint: Implement modulo operations for special moduli #108

haslersn commented Aug 7, 2022 •

edited

Loading

tarcieri commented Aug 8, 2022

haslersn commented Aug 8, 2022

tarcieri commented Aug 15, 2022

uint: Implement modulo operations for special moduli #108

uint: Implement modulo operations for special moduli #108

Conversation

haslersn commented Aug 7, 2022 • edited Loading

tarcieri commented Aug 8, 2022

haslersn commented Aug 8, 2022

tarcieri commented Aug 15, 2022

haslersn commented Aug 7, 2022 •

edited

Loading