Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Thoughts on "Content-defined Merkle Trees for Efficient Container Delivery" (2021) #5539

Open
fenollp opened this issue Nov 20, 2024 · 1 comment

Comments

@fenollp
Copy link
Contributor

fenollp commented Nov 20, 2024

Today there is layer reuse while pushing or pulling images. However, layers may differ by just a few percent of their contents.
The article at https://arxiv.org/abs/2104.02158 describes a Merkle tree on the contents of each layer.

What's the team's thoughts on this potential technique?

I was thinking it might help with

@AkihiroSuda
Copy link
Member

AkihiroSuda commented Nov 21, 2024

Chunk-level dedupe is already supported with Nydus: https://github.com/moby/buildkit/blob/master/docs/nydus.md

However, Nydus isn't popular as its layer blobs are not compatible with OCI tar.gz layers.

There was a brainstorming to bring chunk-level dedupe to OCI "v2", but the discussion has got stuck: http://web.archive.org/web/20200919032824/https://hackmd.io/@cyphar/ociv2-brainstorm#Reduced-Duplication

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants