cache: don’t link blobonly based on chainid #3566

tonistiigi · 2023-02-01T20:17:29Z

Debugging #2631 and trying to understand how #3447 might fix this I found this case where we link to another snapshot that has the same chainID(uncompressed digest) but different blobID(compressed digest). In that case, we can mark the blob as !blobOnly while the ref is actually lazy. That does not look correct.

This case should be quite hard to hit, but maybe there are some layers with very common files that could make it more likely. @ohmer is testing if this is enough to fix the issue they are seeing.

Looking at @imeoer patch #3447 again it doesn't look very scary, so if this patch doesn't fully fix this, we could pick that as well, but I'm worried that we don't understand the actual issue, then and it will return in some other form.

PTAL @imeoer @sipsma

Signed-off-by: Tonis Tiigi tonistiigi@gmail.com

Signed-off-by: Tonis Tiigi <tonistiigi@gmail.com>

tonistiigi · 2023-02-08T21:00:12Z

@sipsma @imeoer Looks like testing was successful in #2631 (comment) . PTAL

sipsma

Approving because I see the issue here and can see how this fixes it, but have some concerns.

It seems like with this change it's possible for there to be a snapshot associated with the record but blobonly to still be true, which seems to contradict what I assume the purpose of blobonly was originally. If it works, it works, it just seems confusing.

Double checking my understanding: the fundamental problem is that the ref can be marked as not blobonly due to the existence of an equivalent snapshot but then when we later try to export such a ref the code presumes blobs for it must exist even though they don't due to different compression from the snapshot it dedupes with. Right?

In that case the patch from @imeoer makes more sense to me. That fixes the logic to determine whether a blob exists or not based on just checking the content store.

I actually feel like ideally it should be taken further; blobonly doesn't even need to exist in metadata and instead we could just check the content store everytime to determine whether it exists locally or not. It's the ultimate source of truth anyways.

That's a larger change of course, hard to do w/ back-compat and I'm probably not remembering details here so LGTM on this change for the short term either way.

tonistiigi · 2023-02-09T17:49:56Z

@sipsma Update this in follow-up if you have ideas.

ohmer · 2023-02-13T23:22:52Z

I have applied this change against v0.11.2 (https://github.com/moby/buildkit/compare/v0.11.2...sharesight:buildkit:fix-blobonly?expand=1) and started using it against our private repo. Issue has reappeared unfortunately.

Result differs from the tests I have done (described in #2631 (comment)). In the test bed, I have a references to public images only (https://github.com/sharesight/buildkit-debug/blob/pr-2/docker/Dockerfile). In our private repo, I have 2 branches where I see different results:

on our main branch, I have a Dockerfile with reference to private and public images (stored in ECR, if that matters).
on our test branch, I have a Dockerfile with reference only to public images (Docker Hub).

For both these branches, I am using the patched version linked above (same change as here). Result after 24h:

main branch: fails 10% of the time, 1 or 2 retry unblocked our pipelines
test branch: success rate is 100%

I store cache on S3 and call build-push action twice. First invocation: restore from S3, save to local directory. Second invocation: restore from local directory, save to S3. Workflow fails at second invocation (https://github.com/sharesight/buildkit-debug/blob/pr-2/.github/workflows/docker.yml#L96) with same error (Get error: content digest sha256: ***: not found when exporting image).

@tonistiigi would it be possible that this patch does not address both cases (private image vs no private images)?

cache: don’t link blobonly based on chainid

99bd0d8

Signed-off-by: Tonis Tiigi <tonistiigi@gmail.com>

crazy-max added the needs-cherry-pick/v0.11 label Feb 6, 2023

crazy-max added this to the v0.11.3 milestone Feb 6, 2023

tonistiigi marked this pull request as ready for review February 8, 2023 20:59

imeoer approved these changes Feb 9, 2023

View reviewed changes

sipsma approved these changes Feb 9, 2023

View reviewed changes

tonistiigi merged commit a196d7b into moby:master Feb 9, 2023

This was referenced Feb 10, 2023

Fix error: content digest sha256: ***: not found #3447

Closed

Get error: content digest sha256: ***: not found when exporting image #2631

Closed

tonistiigi mentioned this pull request Feb 13, 2023

[v0.11] cherry-picks for v0.11.3 #3634

Merged

crazy-max removed the needs-cherry-pick/v0.11 label Feb 13, 2023

pjonsson mentioned this pull request Apr 20, 2023

ERROR: content digest sha256:***: not found #3809

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

cache: don’t link blobonly based on chainid #3566

cache: don’t link blobonly based on chainid #3566

tonistiigi commented Feb 1, 2023

tonistiigi commented Feb 8, 2023

sipsma left a comment

tonistiigi commented Feb 9, 2023

ohmer commented Feb 13, 2023 •

edited

Loading

cache: don’t link blobonly based on chainid #3566

cache: don’t link blobonly based on chainid #3566

Conversation

tonistiigi commented Feb 1, 2023

tonistiigi commented Feb 8, 2023

sipsma left a comment

Choose a reason for hiding this comment

tonistiigi commented Feb 9, 2023

ohmer commented Feb 13, 2023 • edited Loading

ohmer commented Feb 13, 2023 •

edited

Loading