ecmult_multi: reduce strauss memory usage by 30% #1761

jonasnick · 2025-10-17T14:28:28Z

This is a draft because I'm not sure about the cleanest way to implement it.

real-or-random · 2025-10-27T08:02:41Z

This is a draft because I'm not sure about the cleanest way to implement it.

The current approach looks clean. What other approaches do you have in mind?

src/ecmult_impl.h

hebasto

Concept ACK.

hebasto

On my x86_64 system, this PR reduces the memory allocated on the scratch space from 2224 bytes to 1452 bytes per point.

Please ping me once it’s undrafted.

jonasnick · 2025-10-27T18:52:25Z

The current approach requires a temporary array int wnaf_tmp[256]; to provide to secp256k1_ecmult_wnaf which looks unclean. The alternatives are

copy almost all of secp256k1_ecmult_wnaf into secp256k1_ecmult_wnaf_small, or
remove secp256k1_ecmult_wnaf_small and write an secp256k1_ecmult macro.

Both options seem to be worse.

hebasto · 2025-10-27T22:00:59Z

The current approach requires a temporary array int wnaf_tmp[256]; to provide to secp256k1_ecmult_wnaf which looks unclean. The alternatives are

copy almost all of secp256k1_ecmult_wnaf into secp256k1_ecmult_wnaf_small, or

remove secp256k1_ecmult_wnaf_small and write an secp256k1_ecmult macro.

Both options seem to be worse.

I might suggest a third option: hebasto@5c0d6ee.

real-or-random · 2025-11-07T09:37:13Z

src/ecmult_impl.h

+    int wnaf_tmp[256];
+    int ret, i;
+
+    VERIFY_CHECK(2 <= w && w <= 8);


Suggested change

VERIFY_CHECK(2 <= w && w <= 8);

VERIFY_CHECK(2 <= w && w <= 7);

I don't see why w = 8 wouldn't work. The documentation of wnaf says

* - each wnaf[i] is either 0, or an odd integer between -(1<<(w-1) - 1) and (1<<(w-1) - 1)

So for w = 8, wnaf[i] is in [-127, 127] which fits in an int8_t.

Sorry, yes, you're right. I was getting confused. secp256k1_ecmult_wnaf itself needs w <= 31 (and not 32), if only because it performs a carry << w shift (for int carry) which is certainly UB if int is 32 bits. (In fact, if carry == 1, then even 1 << 31 is UB. This is another edge case that we should fix! Let me add this to the other issue.)

But since your function only copies the results, everything is fine.

In fact, if carry == 1, then even 1 << 31 is UB. This is another edge case that we should fix! Let me add this to the other issue

Oh, great catch!

real-or-random

I think the current approach in the PR is good. It may not be elegant to have a tmp array, but it's simple and correct. We'd need to benchmark if the tmp array makes a difference in the end. But I think this PR needs a benchmark in general to make sure that using int8_t does not increase the running time (much).

If we want to avoid it, here's yet another variant: real-or-random@f83731b
It uses a macro to define different variants of secp256k1_ecmult_wnaf parametrized in the output type. The macro is not elegant either, but this variant is better for type safety than just turning secp256k1_ecmult_wnaf into a macro.

In fact, the current secp256k1_ecmult_wnaf needs the unstated and unchecked assumption that int has at least 32 value bits when it VERIFY_CHECKs that w <= 31. In practice, we call it only with WINDOW_A == 5 and WINDOW_G == ECMULT_WINDOW_SIZE where the latter is configurable in the range 2..24.

A consequence of this "bug" is that the code fails on a 16-bit platform if you set ECMULT_WINDOW_SIZE > 16. I don't think we need to support this, but code without unchecked assumptions is bad. So I suggest that we rewrite the function to use int32_t instead of int even if we don't use my macro approach. Alternatively, we could add the assumption that INT_MAX >= INT32_MAX but this forbids 16-bit platforms, and the code seems to work on them in principle; see #792 (comment).

real-or-random · 2025-11-07T10:17:40Z

I might suggest a third option: hebasto@5c0d6ee.

Sorry, I forgot to comment on that option. That's also clean, but it introduces a lot of code complexity.

The way I see it:

If the current approach is fine performance-wise, let's take it.
If not, use either @hebasto's approach or mine. If you ask me, I prefer mine slightly because it's less code and more "direct" even though it uses a macro.
If none of this is satisfactory, we can still duplicate the code.

jonasnick · 2025-11-12T19:26:22Z

Thanks for demonstrating creative alternative solutions :) If it weren't for the layers of indirection, I'd consider @hebasto's approach to be the most elegant. The PR's current approach is just so much simpler. And I ran benchmarks with bench_ecmult, which showed at most a 0.1us slowdown (for some number of points) on my Intel i7 machine.

real-or-random

utACK 26166c4

ecmult_multi: reduce strauss memory usage by 30%

26166c4

real-or-random added the performance label Oct 27, 2025

real-or-random added the tweak/refactor label Oct 27, 2025

real-or-random reviewed Oct 27, 2025

View reviewed changes

src/ecmult_impl.h Show resolved Hide resolved

hebasto reviewed Oct 27, 2025

View reviewed changes

real-or-random reviewed Nov 7, 2025

View reviewed changes

real-or-random mentioned this pull request Nov 7, 2025

_ecmult_wnaf relies on int having at least 32 value bits #1769

Open

This was referenced Nov 7, 2025

[WIP] Fix assumption on int size in ecmult_wnaf function #1770

Closed

Use int32_t for wnaf values to remove assumption that int has 32 value bits #1772

Closed

jonasnick marked this pull request as ready for review November 12, 2025 19:26

real-or-random approved these changes Nov 12, 2025

View reviewed changes

	VERIFY_CHECK(2 <= w && w <= 8);
	VERIFY_CHECK(2 <= w && w <= 7);

ecmult_multi: reduce strauss memory usage by 30% #1761

Are you sure you want to change the base?

ecmult_multi: reduce strauss memory usage by 30% #1761

Uh oh!

Conversation

jonasnick commented Oct 17, 2025

Uh oh!

real-or-random commented Oct 27, 2025

Uh oh!

Uh oh!

hebasto left a comment

Choose a reason for hiding this comment

Uh oh!

hebasto left a comment • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

jonasnick commented Oct 27, 2025

Uh oh!

hebasto commented Oct 27, 2025

Uh oh!

real-or-random Nov 7, 2025

Choose a reason for hiding this comment

Uh oh!

jonasnick Nov 12, 2025

Choose a reason for hiding this comment

Uh oh!

real-or-random Nov 12, 2025

Choose a reason for hiding this comment

Uh oh!

jonasnick Nov 12, 2025

Choose a reason for hiding this comment

Uh oh!

real-or-random left a comment • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

real-or-random commented Nov 7, 2025

Uh oh!

jonasnick commented Nov 12, 2025

Uh oh!

real-or-random left a comment

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

hebasto left a comment •

edited

Loading

real-or-random left a comment •

edited

Loading