Fix bug in extracting hardlinks #284

priyawadhwa · 2018-08-10T20:10:05Z

Should fix the issues brought up in #282

dlorenc · 2018-08-10T20:13:29Z

pkg/util/fs_util.go

@@ -247,6 +287,15 @@ func checkWhiteouts(path string, whiteouts map[string]struct{}) bool {
 	return false
 }

+func checkSymlinks(path string, symlinks map[string]struct{}) bool {


I think I'm not sure why we need this one if we're checking the other stuff. Are you sure we need it?

You're right, thanks for pointing that out! I'll remove it.

I think we might actually need the symlink check, since we only want to specially hardlink at the end of extraction if the linked file exists but can't be hardlinked (which I think only happens if it's a symlink)

If a file was normally extracted in a previous layer than we can extract the file and create the hardlink normally

vito · 2018-08-13T14:45:36Z

pkg/util/fs_util.go

+					}
+					hardlinks[linkname] = &hardlink{
+						links:  []*tar.Header{hdr},
+						reader: tr,


Hmm, I'd be kinda surprised if saving off the tar reader here works later. Per the docs for tr.Next:

Next advances to the next entry in the tar archive. The Header.Size determines how many bytes can be read for the next file. Any remaining data in the current file is automatically discarded.

(emphasis mine)

One problem is that it's basically streaming through the archvie sequentially as it's downloading. It never buffers to disk or RAM, so I don't think you can go "back" to earlier entries in the archive. Tricky.

Thanks for pointing this out, fixed.

priyawadhwa · 2018-08-14T23:12:58Z

Woot tests are finally passing, PTAL!

dlorenc · 2018-08-16T16:58:23Z

pkg/util/fs_util.go

@@ -55,6 +62,10 @@ func GetFSFromImage(root string, img v1.Image) error {

 	fs := map[string]struct{}{}
 	whiteouts := map[string]struct{}{}
+	hardlinks, err := retrieveHardlinks(layers)


Do you think it would make sense to process each layer separately instead of doing them all at once? I think we already disallow hardlinks to cross layer boundaries.

Yah I think that would be better, I changed it to retrieve hardlinks as we process layers

dlorenc · 2018-08-23T20:56:44Z

I think this needs a rebase now.

priyawadhwa · 2018-08-24T00:08:23Z

Done! :)

Extracting the layers of the filesystem in order will make it easier to extract cached layers and deal with hardlinks, as mentioned in GoogleContainerTools#284 This PR implements extracting in order and adds an integration tests to test the bug hardlinks error in GoogleContainerTools#284 It also fixes GoogleContainerTools#325

priyawadhwa · 2018-08-30T17:49:32Z

Closing, fixed by #326

fix bug in extracting hardlinks

3d128d3

dlorenc reviewed Aug 10, 2018

View reviewed changes

remove symlinks and fix hardlink unit test

2808197

priyawadhwa force-pushed the hardlink branch from d2e3f6a to 2808197 Compare August 10, 2018 23:19

fixing integrationt tests

881708b

vito reviewed Aug 13, 2018

View reviewed changes

WIP

84787f4

priyawadhwa force-pushed the hardlink branch 5 times, most recently from e00f0d9 to 3dc172a Compare August 14, 2018 21:30

Go through layers twice to resolve hardlinks

69b9e92

priyawadhwa force-pushed the hardlink branch from 3dc172a to 69b9e92 Compare August 14, 2018 22:00

priyawadhwa changed the title ~~[WIP] Fix bug in extracting hardlinks~~ Fix bug in extracting hardlinks Aug 14, 2018

Removed unnecessary check

ff09a44

dlorenc reviewed Aug 16, 2018

View reviewed changes

Get hardlinks per layer

d013c30

Rebased on master

5ac7822

container-tools-bot added the size/L label Aug 23, 2018

Rebased against master

28766cd

priyawadhwa closed this Aug 30, 2018

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fix bug in extracting hardlinks #284

Fix bug in extracting hardlinks #284

priyawadhwa commented Aug 10, 2018

dlorenc Aug 10, 2018

priyawadhwa Aug 10, 2018

priyawadhwa Aug 13, 2018

vito Aug 13, 2018

priyawadhwa Aug 14, 2018

priyawadhwa commented Aug 14, 2018

dlorenc Aug 16, 2018

priyawadhwa Aug 17, 2018

dlorenc commented Aug 23, 2018

priyawadhwa commented Aug 24, 2018

priyawadhwa commented Aug 30, 2018

Fix bug in extracting hardlinks #284

Fix bug in extracting hardlinks #284

Conversation

priyawadhwa commented Aug 10, 2018

dlorenc Aug 10, 2018

Choose a reason for hiding this comment

priyawadhwa Aug 10, 2018

Choose a reason for hiding this comment

priyawadhwa Aug 13, 2018

Choose a reason for hiding this comment

vito Aug 13, 2018

Choose a reason for hiding this comment

priyawadhwa Aug 14, 2018

Choose a reason for hiding this comment

priyawadhwa commented Aug 14, 2018

dlorenc Aug 16, 2018

Choose a reason for hiding this comment

priyawadhwa Aug 17, 2018

Choose a reason for hiding this comment

dlorenc commented Aug 23, 2018

priyawadhwa commented Aug 24, 2018

priyawadhwa commented Aug 30, 2018