faster algorithm #15

warner · 2020-03-04T05:42:34Z

5x speedup by switching from generating new keypairs for each trial. Instead, we count scalars and add points. This will also help with distributing the search among untrusted worker machines.

closes #12

warner · 2020-03-04T06:02:20Z

@hdevalence hey, if you get some time, could you take a look at this? I think I applied everything we talked about, but afterwards I discovered the difficulty of switching back and forth between the two scalar representations. The approach I came up with seems sound, but I'd appreciate another pair of eyeballs on it. Also if you can think of any cleanups or better ways to approach this. The big comment in lib.rs should explain my reasoning.

I'm bummed that a significant part of the search time (3.2us out of 3.4us) is spent in converting the Edwards point (where we can use addition) to the Montgomery form (which is what gets base64-converted). I was hoping that point addition would be the dominant factor, but it's only 0.2us . I can't think of any way around that, however.

@tarcieri, we discussed this approach a long time ago too, I'd love to hear your thoughts.

The basic operation used to take 17us/iter on my 2019 mac mini (3.2GHz Core i7). The new approach takes 3.8us/iter. refs #12

eliliam · 2021-12-01T17:15:06Z

Could we get this merged in?

megapro17 · 2022-12-10T08:30:05Z

wireguard-vanity-address.zip

eliliam · 2022-12-13T21:00:42Z

wireguard-vanity-address.zip

I can't speak for anyone else, but there's no way I'm going to download this mysterious zip file with no explanation or anything, from someone who isn't already a part of the existing issue.

If you want to help, please explain what you have as a solution, and link to source code we can review instead of a black box zip file.

mchangrh · 2023-02-18T22:21:31Z

I made a statically linked build running in a docker container that targets this branch - source code

hopefully this is less sketchy ;)

megapro17 · 2023-02-18T23:07:13Z

i like how you sent it to virustotal, and what you got? nothing?

AlexanderYastrebov · 2025-01-11T11:02:14Z

Hello, I've created a similar tool https://github.com/AlexanderYastrebov/wireguard-vanity-key based on your ideas from here 👍

To squeeze last drops of performance I've eliminated allocations and adjust scalar only once outside of the main search loop.

I'm bummed that a significant part of the search time (3.2us out of 3.4us) is spent in converting the Edwards point (where we can use addition) to the Montgomery form (which is what gets base64-converted). I was hoping that point addition would be the dominant factor, but it's only 0.2us . I can't think of any way around that, however.

Indeed, my benchmark shows the same.

// We offset by 8 to make sure that each new privkey will meet the same
// clamping criteria: we assume the keyspace is large enough that we're
// unlikely to wrap around.

Looks like any offset may work but I found other offset values fail clamping test much often.
It might be possible to make addition faster for special offset values e.g. identity point but this likely won't make any difference given that BytesMontgomery dominate the time spent (96%)

AlexanderYastrebov · 2025-02-05T21:18:59Z

Hello, I found a way to speed up Montomery bytes encoding using vector division, see AlexanderYastrebov/wireguard-vanity-key#3

The speedup is 7x and makes point addition dominate over montgomery byte encoding:

warner force-pushed the 12-count-scalars branch from da601dc to b83e8ee Compare March 4, 2020 05:55

warner self-assigned this Mar 4, 2020

warner added 3 commits March 27, 2020 18:18

add curve25519-dalek dependency

f1eb405

new search algorithm: point addition, not scalarmult

047a2da

The basic operation used to take 17us/iter on my 2019 mac mini (3.2GHz Core i7). The new approach takes 3.8us/iter. refs #12

update benchmarks to new scheme

803e4c5

warner force-pushed the 12-count-scalars branch from b83e8ee to 803e4c5 Compare March 28, 2020 01:22

rasa mentioned this pull request May 12, 2024

Add timeout option? axllent/wireguard-vanity-keygen#15

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

faster algorithm #15

faster algorithm #15

warner commented Mar 4, 2020 •

edited

Loading

warner commented Mar 4, 2020

eliliam commented Dec 1, 2021

megapro17 commented Dec 10, 2022

eliliam commented Dec 13, 2022

mchangrh commented Feb 18, 2023

megapro17 commented Feb 18, 2023

AlexanderYastrebov commented Jan 11, 2025

AlexanderYastrebov commented Feb 5, 2025

faster algorithm #15

Are you sure you want to change the base?

faster algorithm #15

Conversation

warner commented Mar 4, 2020 • edited Loading

warner commented Mar 4, 2020

eliliam commented Dec 1, 2021

megapro17 commented Dec 10, 2022

eliliam commented Dec 13, 2022

mchangrh commented Feb 18, 2023

megapro17 commented Feb 18, 2023

AlexanderYastrebov commented Jan 11, 2025

AlexanderYastrebov commented Feb 5, 2025

warner commented Mar 4, 2020 •

edited

Loading