[WIP] BoxedResidue: fix constant-time TODO in `montgomery_mul` #415

tarcieri · 2023-12-10T20:29:53Z

Replaces branching with an arithmetic-based approach. This unfortunately seems to double the time multiplication takes (and with it, modpow).

Benchmarks

Montgomery arithmetic/multiplication, BoxedUint*BoxedUint
                        time:   [10.027 µs 10.048 µs 10.068 µs]
                        change: [+101.70% +102.44% +103.19%] (p = 0.00 < 0.05)
                        Performance has regressed.

Montgomery arithmetic/modpow, BoxedUint^BoxedUint
                        time:   [48.200 ms 48.273 ms 48.352 ms]
                        change: [+97.268% +97.613% +97.972%] (p = 0.00 < 0.05)
                        Performance has regressed.

Replaces branching with an arithmetic-based approach. This unfortunately seems to double the time multiplication takes (and with it, modpow). Montgomery arithmetic/multiplication, BoxedUint*BoxedUint time: [10.027 µs 10.048 µs 10.068 µs] change: [+101.70% +102.44% +103.19%] (p = 0.00 < 0.05) Performance has regressed. Montgomery arithmetic/modpow, BoxedUint^BoxedUint time: [48.200 ms 48.273 ms 48.352 ms] change: [+97.268% +97.613% +97.972%] (p = 0.00 < 0.05) Performance has regressed.

tarcieri · 2023-12-10T20:30:50Z

src/modular/boxed_residue/mul.rs

+/// Compare limbs in constant time, returning `Limb::ONE` if the left size is less than the right.
+#[inline(always)]
+fn limb_ct_lt(a1: Limb, b1: Limb, a2: Limb, b2: Limb) -> Limb {
+    (a1.sbb(b1, Limb::ZERO).1 | a2.sbb(b2, Limb::ZERO).1) & Limb::ONE
+}


This was faster than using the ct_lt approach previously used on L214, but still slower than using branch instructions

fjarri · 2023-12-14T06:59:06Z

This could benefit from #418

fjarri · 2023-12-16T05:57:43Z

Doesn't seem like it benefits.

But judging by Godbolt, using overflowing_add() there doesn't lead to branches. Directly casting the booleans to Word produces the same code as the usage of overflowing_add().

tarcieri · 2023-12-17T23:10:37Z

Closing this for now

tarcieri commented Dec 10, 2023

View reviewed changes

tarcieri mentioned this pull request Dec 12, 2023

Limb: optimize constant-time comparisons #413

Merged

tarcieri mentioned this pull request Dec 16, 2023

Use wide arithmetic in CtChoice #418

Closed

tarcieri closed this Dec 17, 2023

tarcieri deleted the boxed-residue/constant-time-mul branch December 17, 2023 23:10

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[WIP] BoxedResidue: fix constant-time TODO in `montgomery_mul` #415

[WIP] BoxedResidue: fix constant-time TODO in `montgomery_mul` #415

Uh oh!

tarcieri commented Dec 10, 2023

Uh oh!

tarcieri Dec 10, 2023

Uh oh!

fjarri commented Dec 14, 2023

Uh oh!

fjarri commented Dec 16, 2023

Uh oh!

tarcieri commented Dec 17, 2023

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

[WIP] BoxedResidue: fix constant-time TODO in montgomery_mul #415

[WIP] BoxedResidue: fix constant-time TODO in montgomery_mul #415

Uh oh!

Conversation

tarcieri commented Dec 10, 2023

Benchmarks

Uh oh!

tarcieri Dec 10, 2023

Choose a reason for hiding this comment

Uh oh!

fjarri commented Dec 14, 2023

Uh oh!

fjarri commented Dec 16, 2023

Uh oh!

tarcieri commented Dec 17, 2023

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

[WIP] BoxedResidue: fix constant-time TODO in `montgomery_mul` #415

[WIP] BoxedResidue: fix constant-time TODO in `montgomery_mul` #415