Change tryAssignIPs to assign up to configured WARM_IP_TARGET #1279

jacksontj · 2020-11-05T17:58:48Z

What type of PR is this?

bug / feature

Which issue does this PR fix:
#1272

What does this PR do / Why do we need it:
Previously tryAssignIPs would always allocate all available IPs on the
ENI regardless of the number of IPs the node would want. A great
showcase of this behavior is on node bootstrap. Assuming we have a node
with maxIPsPerENI=30 and WARM_IP_TARGET=5, the following happens:

Node starts with single IP
tryAssignIPs allocates an additional 29 IPs
decreaseIPPool will release 29-5=24 IPs

After the change this would look like:

Node starts with a single IP
tryAssignIPs allocates an additional 5 IPs

So in this very common flow we cut out an entire AWS API call per node.
In addition this behavior exhibits itself any time we need to scale up
IPs as tryAssignIPs always attempts to over-allocate IPs -- which then
just need to be released almost immediately.

If an issue # is not available please add repro steps and logs from IPAMD/CNI showing the issue:

Testing done on this change:

Automation added to e2e:

Will this break upgrades or downgrades. Has updating a running cluster been tested?:

Does this change require updates to the CNI daemonset config files to work?:

Does this PR introduce any user-facing change?:

By submitting this pull request, I confirm that my contribution is made under the terms of the Apache 2.0 license.

Previously tryAssignIPs would always allocate all available IPs on the ENI regardless of the number of IPs the node would want. A great showcase of this behavior is on node bootstrap. Assuming we have a node with maxIPsPerENI=30 and WARM_IP_TARGET=5, the following happens: - Node starts with single IP - tryAssignIPs allocates an additional 29 IPs - decreaseIPPool will release 29-5=24 IPs After the change this would look like: - Node starts with a single IP - tryAssignIPs allocates an additional 5 IPs So in this very common flow we cut out an entire AWS API call per node. In addition this behavior exhibits itself any time we need to scale up IPs as tryAssignIPs always attempts to over-allocate IPs -- which then just need to be released almost immediately. Fixes aws#1272

jacksontj · 2020-11-05T17:59:04Z

cc @mogren

jayanthvn · 2020-11-06T18:37:33Z

Hi @jacksontj

Please kindly fill the PR template and attach unit test cases - logs showing the functionality is working should be sufficient.

Appreciate your help on this. Thank you!

nithu0115

(lgtm) fwiw, we need to test if setting WARM_IP_TARGET as 5 and if I have 30 pending pods to be scheduled on a newly created node, and see if p99 of pod start time increases in this case and/or see if there are other any negative impacts.

nithu0115

(lgtm) fwiw, we need to test if setting WARM_IP_TARGET as 5 and if I have 30 pending pods to be scheduled on a newly created node, and see if p99 of pod start time increases in this case and/or see if there are other any negative impacts.

jayanthvn · 2020-11-11T18:44:59Z

Also, can you please add mock up unit test cases something similar to ones here - https://github.com/aws/amazon-vpc-cni-k8s/blob/master/pkg/awsutils/awsutils_test.go. Thank you!

jayanthvn · 2021-01-21T21:20:43Z

As @nithu0115 mentioned, we will test this change to verify if p99 pod start up time is increasing.

haouc · 2021-03-02T01:02:12Z

pkg/ipamd/ipamd.go

 	// Find an ENI where we can add more IPs
 	eni := c.dataStore.GetENINeedsIP(c.maxIPsPerENI, c.useCustomNetworking)
 	if eni != nil && len(eni.IPv4Addresses) < c.maxIPsPerENI {
 		currentNumberOfAllocatedIPs := len(eni.IPv4Addresses)
 		// Try to allocate all available IPs for this ENI
-		err = c.awsClient.AllocIPAddresses(eni.ID, c.maxIPsPerENI-currentNumberOfAllocatedIPs)
+		err = c.awsClient.AllocIPAddresses(eni.ID, int(math.Min(float64(c.maxIPsPerENI-currentNumberOfAllocatedIPs), float64(toAllocate))))


We have min() and max() defined in this file which can be used here without doing type conversion.

M00nF1sh

/lgtm

nithu0115 · 2021-06-03T16:50:22Z

I cherry picked this PR on top of 1.7.10 release and tested it. The change is looks good to me.

Tested it with below 2 scenarios and saw expected behavior. One thing to note was, I saw an increase in number of AssignPrivateIpAddresses API calls as we are not going optimistic behavior anymore where we previously assumed that there were pending Pods.

Scenario1: Started with 1 node in the cluster and CNI configuration MINIMUM_IP_TARGET=5 and WARM_IP_TARGET=2 and launched 100 Pods.
Scenario2: WARM_IP_TARGET=2 and launched 100 Pods.

Previously tryAssignIPs would always allocate all available IPs on the ENI regardless of the number of IPs the node would want. A great showcase of this behavior is on node bootstrap. Assuming we have a node with maxIPsPerENI=30 and WARM_IP_TARGET=5, the following happens: - Node starts with single IP - tryAssignIPs allocates an additional 29 IPs - decreaseIPPool will release 29-5=24 IPs After the change this would look like: - Node starts with a single IP - tryAssignIPs allocates an additional 5 IPs So in this very common flow we cut out an entire AWS API call per node. In addition this behavior exhibits itself any time we need to scale up IPs as tryAssignIPs always attempts to over-allocate IPs -- which then just need to be released almost immediately. Fixes aws#1272

…#1495) Previously tryAssignIPs would always allocate all available IPs on the ENI regardless of the number of IPs the node would want. A great showcase of this behavior is on node bootstrap. Assuming we have a node with maxIPsPerENI=30 and WARM_IP_TARGET=5, the following happens: - Node starts with single IP - tryAssignIPs allocates an additional 29 IPs - decreaseIPPool will release 29-5=24 IPs After the change this would look like: - Node starts with a single IP - tryAssignIPs allocates an additional 5 IPs So in this very common flow we cut out an entire AWS API call per node. In addition this behavior exhibits itself any time we need to scale up IPs as tryAssignIPs always attempts to over-allocate IPs -- which then just need to be released almost immediately. Fixes #1272 Co-authored-by: Thomas Jackson <jacksontj.89@gmail.com>

nithu0115 reviewed Nov 9, 2020

View reviewed changes

jayanthvn assigned nithu0115 Jan 6, 2021

jayanthvn requested a review from haouc March 1, 2021 22:59

haouc reviewed Mar 2, 2021

View reviewed changes

M00nF1sh approved these changes Jun 3, 2021

View reviewed changes

M00nF1sh merged commit 078bcb7 into aws:master Jun 3, 2021

jayanthvn added this to the v1.8.0 milestone Jun 3, 2021

This was referenced Jun 10, 2021

🥳 aws-vpc-cni v1.8.0 Automated Release! 🥑 aws/eks-charts#536

Closed

🥳 aws-vpc-cni v1.8.0 Automated Release! 🥑 aws/eks-charts#538

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Change tryAssignIPs to assign up to configured WARM_IP_TARGET #1279

Change tryAssignIPs to assign up to configured WARM_IP_TARGET #1279

jacksontj commented Nov 5, 2020

jacksontj commented Nov 5, 2020

jayanthvn commented Nov 6, 2020

nithu0115 left a comment

nithu0115 left a comment

jayanthvn commented Nov 11, 2020

jayanthvn commented Jan 21, 2021

haouc Mar 2, 2021 •

edited

Loading

M00nF1sh left a comment

nithu0115 commented Jun 3, 2021

Change tryAssignIPs to assign up to configured WARM_IP_TARGET #1279

Change tryAssignIPs to assign up to configured WARM_IP_TARGET #1279

Conversation

jacksontj commented Nov 5, 2020

jacksontj commented Nov 5, 2020

jayanthvn commented Nov 6, 2020

nithu0115 left a comment

Choose a reason for hiding this comment

nithu0115 left a comment

Choose a reason for hiding this comment

jayanthvn commented Nov 11, 2020

jayanthvn commented Jan 21, 2021

haouc Mar 2, 2021 • edited Loading

Choose a reason for hiding this comment

M00nF1sh left a comment

Choose a reason for hiding this comment

nithu0115 commented Jun 3, 2021

haouc Mar 2, 2021 •

edited

Loading