TLV JS: Introduce JS library for TLV #5184

wjthieme · 2023-09-01T09:00:00Z

resolves #5127

In this PR I have isolated parsing of TLVData into its own file. For ticket #5127, I think this is fine for now as this allows us to very quickly extract TLVData parsing into its own library if we ever need it.

The TLVData class adds two helper functions for reading TLVData.

entry(type) returns the first entry where the type matches the supplied type (preferred)
entries() returns a map of all entries in the TLVData.

Parsing still succeeds if the tlv data is not the right length. entry just returns null for a partial last entry and entries just excludes the last partial entry.

The TLVData class allows you to specify the length/span of both the type and length part of the tlv which should make parsing tlv data much easier in the future.

I added a couple simple tests for parsing and reading TLVData.

This new class is not actually being used yet anywhere in the code. I plan to create (at least) two follow up PRs that switch out the old implementation for this new one once this PR is merged:

getExtensionData in ./token/js/extension/extensionType.ts
getExtraAccountMetaAccount in ./token/js/exension/transferHook/state.ts
Anything else?

buffalojoec

This is definitely heading in the right direction. Thanks again for continuing to contribute here!

I will say I think it's worth matching a little bit more of the behavioral aspects of the Rust libraries, namely:

Parsing still succeeds if the tlv data is not the right length. entry just returns null for a partial last entry and entries just excludes the last partial entry.

In Rust, this would fail to parse, so we probably want this to happen in JS as well. We might even need to introduce some errors similar to the Rust ones.

solana-program-library/libraries/type-length-value/src/state.rs

Lines 143 to 145 in f35dc5f

    
           if tlv_data.len() < value_end { 
        
               return Err(ProgramError::InvalidAccountData); 
        
           }

I also think we should try to use similar nomenclature for the class and method names, to avoid confusion, ie: TlvState, get_first_bytes, etc. Obviously we don't need the separated mut API in here, though.

buffalojoec · 2023-09-07T03:03:56Z

token/js/src/extensions/tlvData.ts

+function readTLVNumberSize<T>(
+    buffer: Buffer,
+    size: TLVNumberSize,
+    offset: number,
+    constructor: (x: number | bigint) => T
+): T {
+    switch (size) {
+        case 1:
+            return constructor(buffer.readUInt8(offset));
+        case 2:
+            return constructor(buffer.readUInt16LE(offset));
+        case 4:
+            return constructor(buffer.readUInt32LE(offset));
+        case 8:
+            return constructor(buffer.readBigUInt64LE(offset));
+    }
+}


This is a neat function here, but our TLV library is using u32 for length and Token2022 is using u16, so we can probably do away with handling u64 and eliminate the need for bigint. What do you think?

solana-program-library/token/program-2022/src/extension/mod.rs

Line 81 in f35dc5f

pub struct Length(PodU16);

solana-program-library/libraries/type-length-value/src/length.rs

Line 11 in f35dc5f

pub struct Length(PodU32);

But the discriminator could still by 8 bytes, right? This function is currently used for both parsing the discriminator and the length

Yes but it's not a u64 it's just an array, so it doesn't need to be represented as bigint

Yep that makes sense. This was only added because you can then supply a BigInt or number in the entry function as well as a Buffer.

token/js/src/extensions/tlvData.ts

token/js/test/unit/tlvData.test.ts

buffalojoec · 2023-09-07T03:29:15Z

I can see that you're going for a more generalized approach here, and I'm not sure this idea I'm going to suggest will be as slick as I imagine it, but what do you think about instead using an interface and/or super class that together mock the TlvState trait? Ideally then you'd just supply the sizes and get all of the TLV stuff out of the box?

That's what a lot of these tests are doing; just creating objects that implement SplDiscriminate and can then be used as TLV entries.

In JS, I understand we would have to somehow replicate Rust's size_of or the solana_program::borsh::get_instance_packed_len functions, but that could be roped in here.

wjthieme · 2023-09-07T07:36:11Z

In Rust, this would fail to parse, so we probably want this to happen in JS as well. We might even need to introduce some errors similar to the Rust ones.

Agreed. Let's keep it similar to the rust implementation. Also think it makes sense to keep the naming the same. i.e. TlvState instead of TLVData, get_first_bytes and get_bytes_with_repetition and maybe get_discriminators which would then replace the existing functions.

I can see that you're going for a more generalized approach here, and I'm not sure this idea I'm going to suggest will be as slick as I imagine it, but what do you think about instead using an interface and/or super class that together mock the TlvState trait? Ideally then you'd just supply the sizes and get all of the TLV stuff out of the box?

I think it would definitely make sense to add some form of spl_discriminate to the js library as well. Mainly because it makes it easier to read and verify the code. i.e. you can just match the discriminator string to the rust code instead of having the take bytes/bigint/number and turning that into raw bytes and then trying to find the matching discriminator.

I think probably the easiest (and fastest) implementation could be a single function that takes a discriminator string and turns it into bytes/bigint/number which you can then throw into the TlvState to grab the entry. Something like:

export function splDiscriminator(key: string, length = 8): Buffer {
    // TODO: make sure this also works in browser without needing to polyfill node crypto/buffer?
    const digest = createHash('sha256').update(key).digest();
    return digest.subarray(0, length);
}

…lass

buffalojoec · 2023-09-12T11:40:44Z

Ok I think we managed to fix the CI!

Let me know when you're ready for another review

buffalojoec · 2023-09-12T11:52:27Z

I think it would definitely make sense to add some form of spl_discriminate to the js library as well. Mainly because it makes it easier to read and verify the code. i.e. you can just match the discriminator string to the rust code instead of having the take bytes/bigint/number and turning that into raw bytes and then trying to find the matching discriminator.

I think probably the easiest (and fastest) implementation could be a single function that takes a discriminator string and turns it into bytes/bigint/number which you can then throw into the TlvState to grab the entry. Something like:
export function splDiscriminator(key: string, length = 8): Buffer {
    // TODO: make sure this also works in browser without needing to polyfill node crypto/buffer?
    const digest = createHash('sha256').update(key).digest();
    return digest.subarray(0, length);
}

Yes, I like this approach! I think that's perfect. The more I think about it, the more I think a more generalized approach as you've written makes sense, because as I mentioned above we want to actually not do what we did in Rust for Token2022 and TLV, and shared all TLV operations across this library.

If Token2022 wasn't being audited, I'd vote to do this in Rust as well using the Rust TLV lib, and modify the Rust spl-discriminator library to support whatever length discriminator you want. This may happen in the future, so what you've got for any length discriminator makes sense to me!

buffalojoec · 2023-09-12T11:57:23Z

Can we roll a separate library for this stuff, rather than working it into Token JS?

I think after some of our discussion it makes sense to introduce new ones, and then they can be imported into the metadata library as well as token (#5228).

I think, for now, let's just keep SplDiscriminate and TLV stuff in one JS library - we'll call it @solana/spl-type-length-value, and then import it where we need it. What do you think?

We could prob add a folder to the type-length-value library for js

wjthieme · 2023-09-12T12:04:31Z

Can we roll a separate library for this stuff, rather than working it into Token JS?

I think that makes sense. What location in the repository would you suggest for that?

There is the libraries folder but it currently only contains rust libraries so wouldn't necessarily say it makes sense to put it there.

I'm leaning towards either a type-length-value or js-type-length-value in the root or something along the lines of libraries-js/type-length-value. What do you think?

buffalojoec · 2023-09-12T13:12:01Z

There is the libraries folder but it currently only contains rust libraries so wouldn't necessarily say it makes sense to put it there.

You're right but that's only because none of those libraries have any JS counterparts 😂

I'd say just stick a js folder into libraries/type-length-value. Cargo won't even look at it

buffalojoec

This is coming together nicely! I left comments mostly around simplifying the provided discriminator(s). I think if we can strip some of that complexity away, we'll be in good shape.

Thanks again!

.github/workflows/pull-request-libraries.yml

libraries/type-length-value/js/package.json

libraries/type-length-value/js/README.md

libraries/type-length-value/js/src/splDiscriminate.ts

libraries/type-length-value/js/src/tlvState.ts

buffalojoec

Lgtm. Thanks for putting this together and addressing all feedback!

Added a TLVData parsing helper class

005cf6b

mergify bot added the community Community contribution label Sep 1, 2023

wjthieme changed the title ~~Added a TLVData parsing helper class~~ token-js: added a helper class for parsing tlv-data Sep 1, 2023

buffalojoec self-requested a review September 7, 2023 00:59

buffalojoec reviewed Sep 7, 2023

View reviewed changes

Matched naming to rust library and added an SplDiscriminator helper c…

60838b1

…lass

Empty to retrigger CI

24afa21

wjthieme added 2 commits September 12, 2023 17:51

Moved tlv-js libary into its own folder and added ci

54967b1

Added ts node dev dependency to tlv js library

02c009b

wjthieme requested a review from buffalojoec September 12, 2023 16:00

buffalojoec reviewed Sep 12, 2023

View reviewed changes

wjthieme added 2 commits September 12, 2023 19:37

Simplify some of the components in TlvState

68dfef5

Run prettier

defa88e

buffalojoec changed the title ~~token-js: added a helper class for parsing tlv-data~~ TLV JS: Introduce JS library for TLV Sep 12, 2023

buffalojoec approved these changes Sep 12, 2023

View reviewed changes

buffalojoec merged commit a399681 into solana-labs:master Sep 13, 2023
8 checks passed

wjthieme deleted the wjthieme-patch-5 branch September 13, 2023 13:55

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

TLV JS: Introduce JS library for TLV #5184

TLV JS: Introduce JS library for TLV #5184

wjthieme commented Sep 1, 2023 •

edited

Loading

buffalojoec left a comment

buffalojoec Sep 7, 2023

wjthieme Sep 7, 2023

buffalojoec Sep 7, 2023

wjthieme Sep 7, 2023

buffalojoec commented Sep 7, 2023

wjthieme commented Sep 7, 2023

buffalojoec commented Sep 12, 2023

buffalojoec commented Sep 12, 2023

buffalojoec commented Sep 12, 2023 •

edited

Loading

wjthieme commented Sep 12, 2023

buffalojoec commented Sep 12, 2023

buffalojoec left a comment

buffalojoec left a comment

	if tlv_data.len() < value_end {
	return Err(ProgramError::InvalidAccountData);
	}

TLV JS: Introduce JS library for TLV #5184

TLV JS: Introduce JS library for TLV #5184

Conversation

wjthieme commented Sep 1, 2023 • edited Loading

buffalojoec left a comment

Choose a reason for hiding this comment

buffalojoec Sep 7, 2023

Choose a reason for hiding this comment

wjthieme Sep 7, 2023

Choose a reason for hiding this comment

buffalojoec Sep 7, 2023

Choose a reason for hiding this comment

wjthieme Sep 7, 2023

Choose a reason for hiding this comment

buffalojoec commented Sep 7, 2023

wjthieme commented Sep 7, 2023

buffalojoec commented Sep 12, 2023

buffalojoec commented Sep 12, 2023

buffalojoec commented Sep 12, 2023 • edited Loading

wjthieme commented Sep 12, 2023

buffalojoec commented Sep 12, 2023

buffalojoec left a comment

Choose a reason for hiding this comment

buffalojoec left a comment

Choose a reason for hiding this comment

wjthieme commented Sep 1, 2023 •

edited

Loading

buffalojoec commented Sep 12, 2023 •

edited

Loading