Draft: Better performance for larger inputs #70

malvidin · 2022-11-30T04:05:59Z

Performance for ~30 bytes and larger inputs should be faster. It is much faster for 256 bytes, and should be able to handle inputs in the MB range.

keis · 2022-12-01T21:07:55Z

This is interesting for sure. But I don't think adding this complexity for the case of bigger payloads really makes sense.

keis · 2022-12-01T21:32:14Z

base58/__init__.py

-        result.append(mod)
-
-    return b'\0' * (origlen - newlen) + bytes(reversed(result))
+    return acc.to_bytes(origlen - newlen + (acc.bit_length() + 7) // 8, "big")


This change alone seems to have a pretty good impact and even simplifies the code 😍

malvidin · 2022-12-03T03:32:07Z

I am decoding base58 inputs that are ~200 characters long (~150 bytes) that I do not control, and want to mitigate the potential for performance issues if larger values are seen in the future.

The nested loop is O(n^2), and becomes closer to O(n^3) for large inputs. Using gmpy2 is consistently fast, but slower for inputs that are about half the size of a bitcoin address.

When gmpy is not used, splitting the inputs approximately in half on pre-computed powers of 2, 45, or 58, it moves closer to O(n^2*log(n)). By avoiding the Karatsuba cutoff for the nested loop, Karatsuba large integer multiplications are performed in a divide and conquer manner.

The change adds complexity, but it provides better performance for inputs up to ~2MB.

Add optional gmpy2.mpz for even faster encode/decode Add longer random benchmark

Apply Black code style Quiet mypy errors

keis reviewed Dec 1, 2022

View reviewed changes

malvidin added 2 commits December 5, 2022 13:27

Add faster base58 encode/decode

1c214fd

Add optional gmpy2.mpz for even faster encode/decode Add longer random benchmark

Cache translate bytes instead of mapping

66adc8d

Apply Black code style Quiet mypy errors

malvidin force-pushed the fast_encode branch from d6b6a65 to 66adc8d Compare December 5, 2022 12:40

keis mentioned this pull request Dec 11, 2022

Use int.to_bytes to speed up decoding #73

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Draft: Better performance for larger inputs #70

Draft: Better performance for larger inputs #70

malvidin commented Nov 30, 2022

keis commented Dec 1, 2022

keis Dec 1, 2022

malvidin commented Dec 3, 2022 •

edited

Loading

Draft: Better performance for larger inputs #70

Are you sure you want to change the base?

Draft: Better performance for larger inputs #70

Conversation

malvidin commented Nov 30, 2022

keis commented Dec 1, 2022

keis Dec 1, 2022

Choose a reason for hiding this comment

malvidin commented Dec 3, 2022 • edited Loading

malvidin commented Dec 3, 2022 •

edited

Loading