🐛 CAPD: delete container after failed start to work around port allocation issues #9125

chrischdi · 2023-08-04T16:52:59Z

What this PR does / why we need it:

Deletes the container after a failed ContainerStart to improve the retry later on by going again by creating a new container.

Which issue(s) this PR fixes (optional, in fixes #<issue number>(, fixes #<issue_number>, ...) format, will close the issue(s) when PR gets merged):
Fixes #8824

…ion issues

killianmuldoon

Thanks!

/lgtm

k8s-ci-robot · 2023-08-04T16:56:47Z

LGTM label has been added.

Git tree hash: 4fb62dc082a474992ef947e94cf76d4349ebf765

killianmuldoon · 2023-08-04T18:45:22Z

/area provider/infrastructure-docker

furkatgofurov7

/lgtm

killianmuldoon · 2023-08-07T11:42:19Z

/approve

k8s-ci-robot · 2023-08-07T11:42:27Z

[APPROVALNOTIFIER] This PR is APPROVED

This pull-request has been approved by: killianmuldoon

The full list of commands accepted by this bot can be found here.

The pull request process is described here

Needs approval from an approver in each of these files:

~~OWNERS~~ [killianmuldoon]

Approvers can indicate their approval by writing /approve in a comment
Approvers can cancel approval by writing /approve cancel in a comment

killianmuldoon · 2023-08-07T11:42:32Z

/cherry-pick release-1.5

k8s-infra-cherrypick-robot · 2023-08-07T11:42:33Z

@killianmuldoon: once the present PR merges, I will cherry-pick it on top of release-1.5 in a new PR and assign it to you.

In response to this:

/cherry-pick release-1.5

Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the kubernetes/test-infra repository.

killianmuldoon · 2023-08-07T11:42:35Z

/cherry-pick release-1.4

k8s-infra-cherrypick-robot · 2023-08-07T11:42:36Z

@killianmuldoon: once the present PR merges, I will cherry-pick it on top of release-1.4 in a new PR and assign it to you.

In response to this:

/cherry-pick release-1.4

Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the kubernetes/test-infra repository.

killianmuldoon · 2023-08-07T11:42:57Z

I'll put a hold on the cherry-picks until we get some sort of signal that this is helping the flake.

k8s-infra-cherrypick-robot · 2023-08-07T11:57:12Z

@killianmuldoon: new pull request created: #9130

In response to this:

/cherry-pick release-1.5

Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the kubernetes/test-infra repository.

k8s-infra-cherrypick-robot · 2023-08-07T11:57:48Z

@killianmuldoon: new pull request created: #9131

In response to this:

/cherry-pick release-1.4

Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the kubernetes/test-infra repository.

sbueringer · 2023-08-07T15:01:13Z

test/infrastructure/container/docker.go

+		// Delete the container and retry later on. This helps getting around the race
+		// condition where of hitting "port is already allocated" issues.
+		if innerErr := d.dockerClient.ContainerRemove(ctx, resp.ID, types.ContainerRemoveOptions{Force: true, RemoveVolumes: true}); innerErr != nil {
+			return errors.Wrapf(innerErr, "error removing container after failed start: %s", err)


Not sure if that mixes the errors in a way that is not easily readable. I think ideally we would use kerrors aggregate (and usually do)

In this case it should result in:

{InnerErr}: error removing container after failed start: {err}

Ack, kerrors aggregate would have been an option I didn't think of.

Was also thinking about using go's errors.Join.

I think this is not how error wrapping works. As far as I know it appends the innerErr at the end

(just like the normal fmt.Errorf("adfasfd test: %v", err) would)

Usually we use something like this in CAPI:

reterr = kerrors.NewAggregate([]error{reterr, errors.New("failed to unlock the kubeadm init lock")})

Ah yeah so currently it is:

error removing container after failed start: {err}: {innerErr}

I will follow up and use kerrors instead 👍

Thx! I know it's a nit in a test provider, just noticed and was thinking about future me trying to parse the error :)

CAPD: delete container after failed start to work around port allocat…

fb74bf8

…ion issues

k8s-ci-robot added cncf-cla: yes Indicates the PR's author has signed the CNCF CLA. size/XS Denotes a PR that changes 0-9 lines, ignoring generated files. labels Aug 4, 2023

k8s-ci-robot requested review from elmiko and jackfrancis August 4, 2023 16:53

killianmuldoon reviewed Aug 4, 2023

View reviewed changes

k8s-ci-robot assigned killianmuldoon Aug 4, 2023

k8s-ci-robot added the lgtm "Looks good to me", indicates that a PR is ready to be merged. label Aug 4, 2023

k8s-ci-robot added the area/provider/infrastructure-docker Issues or PRs related to the docker infrastructure provider label Aug 4, 2023

furkatgofurov7 reviewed Aug 4, 2023

View reviewed changes

k8s-ci-robot assigned furkatgofurov7 Aug 4, 2023

k8s-ci-robot added the approved Indicates a PR has been approved by an approver from all required OWNERS files. label Aug 7, 2023

k8s-ci-robot merged commit bda002f into kubernetes-sigs:main Aug 7, 2023

k8s-ci-robot added this to the v1.6 milestone Aug 7, 2023

k8s-infra-cherrypick-robot mentioned this pull request Aug 7, 2023

[release-1.5] 🐛 CAPD: delete container after failed start to work around port allocation issues #9130

Merged

k8s-infra-cherrypick-robot mentioned this pull request Aug 7, 2023

[release-1.4] 🐛 CAPD: delete container after failed start to work around port allocation issues #9131

Merged

sbueringer reviewed Aug 7, 2023

View reviewed changes

chrischdi mentioned this pull request Aug 7, 2023

🌱 CAPD: fix multi error handling in RunContainer #9139

Merged

chrischdi deleted the pr-docker-fix-flaky-containercreate branch August 18, 2023 10:23

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

🐛 CAPD: delete container after failed start to work around port allocation issues #9125

🐛 CAPD: delete container after failed start to work around port allocation issues #9125

chrischdi commented Aug 4, 2023

killianmuldoon left a comment

k8s-ci-robot commented Aug 4, 2023

killianmuldoon commented Aug 4, 2023

furkatgofurov7 left a comment

killianmuldoon commented Aug 7, 2023

k8s-ci-robot commented Aug 7, 2023

killianmuldoon commented Aug 7, 2023

k8s-infra-cherrypick-robot commented Aug 7, 2023

killianmuldoon commented Aug 7, 2023

k8s-infra-cherrypick-robot commented Aug 7, 2023

killianmuldoon commented Aug 7, 2023

k8s-infra-cherrypick-robot commented Aug 7, 2023

k8s-infra-cherrypick-robot commented Aug 7, 2023

sbueringer Aug 7, 2023

chrischdi Aug 7, 2023

sbueringer Aug 7, 2023

sbueringer Aug 7, 2023

sbueringer Aug 7, 2023

chrischdi Aug 7, 2023

sbueringer Aug 7, 2023

🐛 CAPD: delete container after failed start to work around port allocation issues #9125

🐛 CAPD: delete container after failed start to work around port allocation issues #9125

Conversation

chrischdi commented Aug 4, 2023

killianmuldoon left a comment

Choose a reason for hiding this comment

k8s-ci-robot commented Aug 4, 2023

killianmuldoon commented Aug 4, 2023

furkatgofurov7 left a comment

Choose a reason for hiding this comment

killianmuldoon commented Aug 7, 2023

k8s-ci-robot commented Aug 7, 2023

killianmuldoon commented Aug 7, 2023

k8s-infra-cherrypick-robot commented Aug 7, 2023

killianmuldoon commented Aug 7, 2023

k8s-infra-cherrypick-robot commented Aug 7, 2023

killianmuldoon commented Aug 7, 2023

k8s-infra-cherrypick-robot commented Aug 7, 2023

k8s-infra-cherrypick-robot commented Aug 7, 2023

sbueringer Aug 7, 2023

Choose a reason for hiding this comment

chrischdi Aug 7, 2023

Choose a reason for hiding this comment

sbueringer Aug 7, 2023

Choose a reason for hiding this comment

sbueringer Aug 7, 2023

Choose a reason for hiding this comment

sbueringer Aug 7, 2023

Choose a reason for hiding this comment

chrischdi Aug 7, 2023

Choose a reason for hiding this comment

sbueringer Aug 7, 2023

Choose a reason for hiding this comment