Add tar/asm.IterateHeaders #71

mtrmac · 2024-09-11T18:03:12Z

This allows reading the metadata contained in tar-split without expensively recreating the whole tar stream including full contents.

We have two use cases for this:

In a situation where tar-split is distributed along with a separate metadata stream, ensuring that the two are exactly consistent
Reading the tar headers allows making a ~cheap check of consistency of on-disk layers, just checking that the files exist in expected sizes, without reading the full contents.

This can be implemented outside of this repo, but it's not ideal:

The function necessarily hard-codes some assumptions about how tar-split determines the boundaries of SegmentType/FileType entries (or, indeed, whether it uses FileType entries at all). That's best maintained directly beside the code that creates this.
The ExpectedPadding() value is not currently exported, so the consumer would have to heuristically guess where the padding ends.

Cc: @kwilczynski

This allows reading the metadata contained in tar-split without expensively recreating the whole tar stream including full contents. We have two use cases for this: - In a situation where tar-split is distributed along with a separate metadata stream, ensuring that the two are exactly consistent - Reading the tar headers allows making a ~cheap check of consistency of on-disk layers, just checking that the files exist in expected sizes, without reading the full contents. This can be implemented outside of this repo, but it's not ideal: - The function necessarily hard-codes some assumptions about how tar-split determines the boundaries of SegmentType/FileType entries (or, indeed, whether it uses FileType entries at all). That's best maintained directly beside the code that creates this. - The ExpectedPadding() value is not currently exported, so the consumer would have to heuristically guess where the padding ends. Signed-off-by: Miloslav Trmač <[email protected]>

kwilczynski · 2024-09-12T21:16:12Z

@mtrmac, this is very nice! Thank you for exposing this!

There was a use case I had where exposing FileType and Payload (so the CRC64) would be useful. But I don't know if this is something we would like to do and what the complexity of this would be.

kwilczynski · 2024-09-12T21:16:18Z

/approve
/lgtm

vbatts

Interesting use-case. Thanks for that.

vbatts · 2024-09-27T00:19:53Z

and i've tagged release v0.11.6

mtrmac · 2024-09-27T17:39:15Z

@vbatts Thanks!

mtrmac · 2024-09-27T17:57:46Z

There was a use case I had where exposing FileType and Payload (so the CRC64) would be useful. But I don't know if this is something we would like to do and what the complexity of this would be.

The current code simply ignores FileType, but it already assumes that there is only one tar header per SegmentType; so changing that to expect the two to interleave regularly, and to collect the other data, seems easy enough to do.

This was referenced Sep 11, 2024

Ensure chunked TOC and tar-split metadata are consistent containers/storage#2035

Merged

Zstd(:chunked) work tracking checklist containers/image#2189

Open

vbatts approved these changes Sep 26, 2024

View reviewed changes

vbatts merged commit 93a41cf into vbatts:main Sep 26, 2024
5 checks passed

mtrmac deleted the iterate branch September 27, 2024 17:39

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add tar/asm.IterateHeaders #71

Add tar/asm.IterateHeaders #71

mtrmac commented Sep 11, 2024 •

edited

Loading

kwilczynski commented Sep 12, 2024

kwilczynski commented Sep 12, 2024

vbatts left a comment

vbatts commented Sep 27, 2024

mtrmac commented Sep 27, 2024

mtrmac commented Sep 27, 2024

Add tar/asm.IterateHeaders #71

Add tar/asm.IterateHeaders #71

Conversation

mtrmac commented Sep 11, 2024 • edited Loading

kwilczynski commented Sep 12, 2024

kwilczynski commented Sep 12, 2024

vbatts left a comment

Choose a reason for hiding this comment

vbatts commented Sep 27, 2024

mtrmac commented Sep 27, 2024

mtrmac commented Sep 27, 2024

mtrmac commented Sep 11, 2024 •

edited

Loading