Return success from CNI DEL when IPAMD is unreachable #2350

jdn5126 · 2023-04-11T19:43:14Z

What type of PR is this?
bug

Which issue does this PR fix:
#2331

What does this PR do / Why do we need it:
This PR modifies the VPC CNI pod deletion logic to return no error to the container-runtime when IPAMD is unreachable. This is needed for three primary reasons:

The CNI spec specifies that delete operations should generally complete without error, even when resources are missing.
The CNI cannot count on kubelet to retry deletes until IPAMD is available. If a user deletes a pod with a termination grace period of 0 seconds, the pod is deleted from the API server before kubelet returns success, so kubelet will give up after a few iterations.
If the aws-node daemonset pod cannot be scheduled until other pods are deleted, pod deletion needs to happen without IPAMD.

CNI Changes:
As before, the CNI will try to delete non-branch-ENI pods using PrevResult when IPAMD is not reachable. The difference is that CNI will now return no error in this case. Note that PrevResult will only be available for pods created with VPC CNI v1.12.1+. For pods created before this, IP rules will be leaked, and it is IPAMD's responsibility to cleanup these rules when it starts again.

IPAMD Changes:
On init, IPAMD identifies stale allocations and prunes IP rules for these stale allocations. IPAMD cannot be sure if CNI was able to prune those rules before the pod was deleted. Also, IPAMD does an additional state file write after pruning entries.

Misc:
I also did some refactoring of network utilities to avoid code duplication.

If an issue # is not available please add repro steps and logs from IPAMD/CNI showing the issue:
N/A

Testing done on this change:
Added unit test coverage for stale IPAM allocations. Manually created a cluster and verified behavior when IPAMD is unreachable, plus upgrade/downgrade scenarios.

Automation added to e2e:
N/A

Will this PR introduce any new dependencies?:
N/A

Will this break upgrades or downgrades. Has updating a running cluster been tested?:
No, Yes

Does this change require updates to the CNI daemonset config files to work?:
No

Does this PR introduce any user-facing change?:
Yes

Update CNI to delete pod when IPAMD is unreachable.

By submitting this pull request, I confirm that my contribution is made under the terms of the Apache 2.0 license.

pkg/networkutils/network.go

cmd/routed-eni-cni-plugin/cni.go

pkg/ipamd/datastore/data_store.go

jayanthvn · 2023-04-24T19:13:21Z

pkg/ipamd/datastore/data_store.go

 	for _, allocation := range checkpoint.Allocations {
 		if err := ds.validateAllocationByPodVethExistence(allocation, hostNSLinks); err != nil {
-			ds.log.Warnf("ignore IP allocation for %v:%v,%v due to %v", allocation.ContainerID, allocation.IPv4, allocation.IPv6, err)
+			ds.log.Warnf("stale IP allocation for ID(%v): IPv4(%v), IPv6(%v) due to %v", allocation.ContainerID, allocation.IPv4, allocation.IPv6, err)
+			staleAllocations = append(staleAllocations, allocation)


If we cleanup resources and then call IPAMD then this check wouldn't be needed..

Right, but we can only do that if the pod was created by v1.12.1+. It would be great to remove this overhead in the future, but in the common case, there should never be any stale allocations to clean up

pkg/ipamd/datastore/data_store.go

cmd/routed-eni-cni-plugin/cni.go

On IPAMD startup, prune IP rules for stale allocations

M00nF1sh

/approve
approving given we'll do the mentioned refactor as follow up.

jdn5126 requested a review from a team as a code owner April 11, 2023 19:43

jdn5126 force-pushed the cni_cleanup branch from 29a7202 to 9c6f655 Compare April 11, 2023 19:43

jdn5126 requested review from jayanthvn, M00nF1sh and achevuru April 11, 2023 19:44

jdn5126 force-pushed the cni_cleanup branch from 9c6f655 to 4a53ec9 Compare April 13, 2023 14:54

orsenthil reviewed Apr 24, 2023

View reviewed changes

pkg/networkutils/network.go Show resolved Hide resolved

orsenthil reviewed Apr 24, 2023

View reviewed changes

pkg/networkutils/network.go Outdated Show resolved Hide resolved

jayanthvn reviewed Apr 24, 2023

View reviewed changes

cmd/routed-eni-cni-plugin/cni.go Show resolved Hide resolved

jayanthvn reviewed Apr 24, 2023

View reviewed changes

pkg/ipamd/datastore/data_store.go Show resolved Hide resolved

jayanthvn reviewed Apr 24, 2023

View reviewed changes

pkg/ipamd/datastore/data_store.go Show resolved Hide resolved

orsenthil reviewed Apr 24, 2023

View reviewed changes

cmd/routed-eni-cni-plugin/cni.go Show resolved Hide resolved

jdn5126 force-pushed the cni_cleanup branch 4 times, most recently from 38f992d to 71df0fd Compare April 26, 2023 15:13

M00nF1sh reviewed May 1, 2023

View reviewed changes

cmd/routed-eni-cni-plugin/cni.go Show resolved Hide resolved

M00nF1sh reviewed May 1, 2023

View reviewed changes

cmd/routed-eni-cni-plugin/cni.go Show resolved Hide resolved

jdn5126 force-pushed the cni_cleanup branch 3 times, most recently from 508364a to 8d448c7 Compare May 3, 2023 22:13

Return success from CNI DEL when IPAMD is unreachable

01f37bf

On IPAMD startup, prune IP rules for stale allocations

jdn5126 force-pushed the cni_cleanup branch from 8d448c7 to 01f37bf Compare May 5, 2023 20:55

M00nF1sh self-requested a review May 8, 2023 21:01

M00nF1sh approved these changes May 8, 2023

View reviewed changes

jdn5126 merged commit 690a9e3 into aws:master May 8, 2023

jdn5126 deleted the cni_cleanup branch May 8, 2023 21:24

jdn5126 mentioned this pull request May 12, 2023

Increasing resource requests leads to loss of cni on nodes #2331

Closed

jdn5126 mentioned this pull request May 22, 2023

aws-cni pod can get stuck in Pending state #2389

Closed

alam0rt mentioned this pull request Aug 4, 2023

IPAMD RPC connection refused - DelNetwork fails #2487

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Return success from CNI DEL when IPAMD is unreachable #2350

Return success from CNI DEL when IPAMD is unreachable #2350

jdn5126 commented Apr 11, 2023

jayanthvn Apr 24, 2023

jdn5126 Apr 24, 2023

M00nF1sh left a comment

Return success from CNI DEL when IPAMD is unreachable #2350

Return success from CNI DEL when IPAMD is unreachable #2350

Conversation

jdn5126 commented Apr 11, 2023

jayanthvn Apr 24, 2023

Choose a reason for hiding this comment

jdn5126 Apr 24, 2023

Choose a reason for hiding this comment

M00nF1sh left a comment

Choose a reason for hiding this comment