DrainAndValidate rolling-update hangs if pods won't evict #2537

blakebarnett · 2017-05-09T19:23:24Z

I noticed a few instances where if a pod is hung in ContainerCreating state, or some other state and won't go into Evicted state Kops hangs forever waiting for it during a rolling-update.

The text was updated successfully, but these errors were encountered:

chrislovecnm · 2017-05-09T21:00:10Z

what did cli switches you use?

blakebarnett · 2017-05-09T21:15:14Z

none, just the usual kops rolling-update cluster --yes

chrislovecnm · 2017-05-09T21:29:43Z

So it should have timed out, which is interesting. Did you use the feature flag to turn on drain?

blakebarnett · 2017-05-09T21:59:32Z

yes, in another instance where it hung I deleted a pod manually that was stuck in ContainerCreating and it moved on.

chrislovecnm · 2017-07-01T00:45:57Z

@foxish any way to have a pod not evict? How do I reproduce this?

fejta-bot · 2017-12-31T03:02:40Z

Issues go stale after 90d of inactivity.
Mark the issue as fresh with /remove-lifecycle stale.
Stale issues rot after an additional 30d of inactivity and eventually close.

Prevent issues from auto-closing with an /lifecycle frozen comment.

If this issue is safe to close now please do so with /close.

Send feedback to sig-testing, kubernetes/test-infra and/or @fejta.
/lifecycle stale

foxish · 2018-01-02T18:00:43Z

Sorry, I never saw this @chrislovecnm.
I'm not sure if this is the wrong behavior - Waiting for Container creation to complete before trying to evict seems like a safe thing to do.
We don't expect the container creating state to last that long.

foxish · 2018-01-02T18:02:41Z

Related: kubernetes/kubernetes#48307 (comment)

chrislovecnm · 2018-01-02T20:15:14Z

/lifecycle frozen

sidhartanoleto · 2018-04-17T20:32:17Z

I am having the exactly same behavior on AWS. (edited, similar issue I'd say)

I have ~10 nodes and on some of them the node cannot be drained and gets stuck on SchedulingDisabled state. All the pods inside this node is being evicted however it doesn't complete. When I terminate the instance manually it continues and goes on.

I notice that the only pods left on those nodes are managed by DaemonSets. Maybe that is somehow related.

Any way to investigate this further?

Globegitter · 2018-04-23T08:51:12Z

I had a similar issue just now, a normal pod (nginx ingress controller) was still in running state and had been deployed for over 5 days. Somehow it kept the rolling update stuck (no timeout etc). It has however been restarted 676. Unfortunately I did not look at the logs or anything before I manually terminated the pod, so I can not even verify that there restarts are related to the rolling update but it has now been fixed and the rolling update could move on. If it happens again I will make sure to check logs etc for anything suspicious.

Edit: Yeah strange just happening again, the nginx controllers could not be evicted. I could not see anything in the logs (some of them where not even serving any traffic), nothing on describe it just seemed they never received the shutdown signal. But again manually deleting the pods fixes the issue.

Even if the issue is not necessarily fixable, I wonder if it is possible to show more logs? To not have to guess that something is up.

Globegitter · 2018-04-24T10:59:08Z

So strange, this keeps on happening now and is a subset of pods related to the nginx ingress controller (it is the internal ingress controller and the default backend for both internal and public ingress controller). The interesting thing is they are all deployments with 1 pod (where-as the public ingress controller has 2) but the rolling update strategy is set to:

rollingUpdate:
      maxSurge: 1
      maxUnavailable: 1

so do not think this is related but thought it was worth posting.

michalschott · 2018-05-11T08:33:48Z

I've noticed similar behaviour, I'm using nginx-ingress deployed with/as HPA.

I had to manually kill all nginx-ingress related pods, additionally I also had to kill kube-flannel pod on the drained node.

SharpEdgeMarshall · 2018-06-13T16:10:36Z

Same issue here, this is the second time kops rolling-update --yes waits for drain until I manually kill the nginx-ingress default-backend pod.

mf-lit · 2018-06-22T09:29:31Z

I had the same here, also with nginx-ingress, but the issue was revealed by addding verbosity to the rolling update:
kops rolling-update cluster --yes -v 10

I then saw:

I0622 10:20:46.881051   15660 request.go:873] Response Body: {"kind":"Status","apiVersion":"v1","metadata":{},"status":"Failure","message":"Cannot evict pod as it would violate the pod's disruption budget.","reason":"TooManyRequests","details":{"causes":[{"reason":"DisruptionBudget","message":"The disruption budget qa-nginx-ingress-controller needs 1 healthy pods and has 1 currently"}]},"code":429}

So really kops was doing the right thing, just not being very chatty about it. I just had to have more than 1 replica and then decrease minAvailable in the disruption budget and the rolling-update carried on as soon as the additional pod was healthy.

olemarkus · 2018-06-22T10:02:13Z

Would be very helpful if kops could log when it is waiting for pods with disruption budgets.

montyz · 2018-08-06T17:23:01Z

I agree with making this important, spent some time wondering why my node replacement was stuck. Someone in slack said it was likely to be waiting for pods with disruption budgets and that was the case.

inodb · 2018-08-15T14:29:52Z

@mf-lit how did u change the minAvailable parameter? Is that in the spec of qa-nginx-ingress-controller? I'm experiencing the same issue with this chart: https://github.com/helm/charts/tree/master/stable/nginx-ingress. Maybe I can just send a PR to update the chart

mf-lit · 2018-08-16T12:17:57Z

@inodb That helm chart has what you need:

Either set the replica count with this value:
https://github.com/helm/charts/blob/master/stable/nginx-ingress/templates/controller-deployment.yaml#L13
Or if you want to use HPA, set it with these values:
https://github.com/helm/charts/blob/master/stable/nginx-ingress/templates/controller-hpa.yaml#L18-L19

And then make sure your PDB is set to a value appropriately lower than the Replica count:
https://github.com/helm/charts/blob/master/stable/nginx-ingress/templates/controller-poddisruptionbudget.yaml#L17

mf-lit · 2018-08-16T13:17:43Z

Having thought about this a little more, one of the gotchas of that chart (and many helm charts) is that the ReplicaCount defaults to 1 and the PDB MinAvailable also defaults to 1. This is perfectly reasonable, but means that it impossible to evict the pod.

I think (but haven't tested) it would be better to have minAvailable set to 0 when ReplicaCount is 1, which I think is equivalent to not having a PDB at all, which makes more sense with a single replica.

EDIT: Ah, I see someone has already brought this up:
helm/charts#7127

…ops#2537

thedarkfalcon · 2018-09-11T00:06:30Z

I'm having a similar issue, running a rolling update always hangs on "instancegroups.go:332] Waiting for 1m30s for pods to stabilize after draining."
This never times out, I have left it for over an hour several times. This seems to happen no matter the change, the most recent time was when trying to upgrade the kubernetes version, but previously to that it was when I was just adding some kubeAPIServer settings (PodSecurityPolicy). My quick/dirty solution was just to delete master and nodes in AWS and have the availability set recreate them - with the updated settings.

Edit: I haven't actually seen it in the documentation, but do I have to disable pod availability scaling first?

FrederikNJS · 2019-01-24T12:14:10Z

In our clusters we are running some jobs we don't want interrupted. Some of these jobs can take a full day.

Because we didn't want the jobs interrupted, we resorted to setting up a Pod Disruption Budget, setting max unavailable to 0. This works quite nicely, and keeps the jobs around until they complete.

The problem I have arises when I perform a rolling update on the cluster. Whenever kops gets to a node running one of these uninterruptible pods, it just hangs until the pod completes. Sometimes the blocked node can be the first node in a rolling update. In the mean time, more uninterruptible pods can easily have been scheduled to additional nodes that needs to be rolled.

It would be much nicer if kops would wait for a while (maybe 5 minutes), and if the drain operation had not completed, kops would skip the node, and continue with a different node. Then finally when all other nodes had been rolled, kops could get back to the blocked nodes and wait for the pod to complete.

neolit123 · 2019-08-04T00:24:16Z

hi, i found this ticket by searching on github.
i think i'm seeing PodDisruptionBudget bugs when i enabled it on the coredns Deployment in kubeadm (as an experiment only).

the Deployment has 2 replicas.
adding a PDB to the Deployment (maxUnavailable: 1) and draining the nodes that host the Deployment pods causes unexpected behavior that is difficult to recover from!

kubernetes/kubeadm#1672 (comment)

johngmyers · 2019-11-04T05:14:21Z

/area rolling-update

sstarcher · 2019-11-15T16:18:24Z

It would be useful to have a flag that ignores PDB after a certain amount of time. This will trip anyone if they have a PDB of 1 with a replica of 1 it will just sit for a long time waiting for something that will never come.

johngmyers · 2019-11-15T18:07:39Z

@sstarcher I believe that would be a separate feature request. I happen to think such a thing would be quite dangerous, but it would be useful when the cluster operators and the workload developers have an adversarial relationship.

I believe this particular ticket should be closed as "that's the intended behavior".

blakebarnett · 2019-11-15T18:38:27Z

@johngmyers originally this ticket had nothing to do with PDBs. Containers can get stuck in ContainerCreating and prevent evictions, breaking a node drain (just one of many possible failure scenarios). PDBs preventing a drain are valid and I agree that Kops shouldn't ignore them. Nodes getting into a bad state and not being able to drain them is a more general k8s operational problem and maybe Kops shouldn't do anything about it either.

But it might be good to document and/or add output that explains why the timeout occurred, Kops does validation elsewhere and explains why it fails, this would be just another flavor.

johngmyers · 2019-11-15T19:01:33Z

It might be good to change the title of this issue to limit its scope to ContainerCreating.

I believe kops's logging of hung drains is better now.

johngmyers · 2019-11-15T20:49:58Z

@blakebarnett Would you have a procedure for getting a pod hung in ContainerCreating?

I tend to agree a stuck ContainerCreating pod blocking eviction seems a problem with the Kubernetes eviction implementation.

blakebarnett · 2019-11-15T21:18:43Z

We've seen it usually when there has been resource contention on a node, and something puts the node into an unrecoverable state. It happens for quite a few different reasons, but I believe it's usually because of the contention and the bad behavior of most apps in that scenario. It's been hard to reproduce it intentionally though. When oom-killer kicks in at the system level and happens to pick dockerd as the process to snipe that definitely seems to be problematic.

We've seen it with the unregister_netdevice kernel issue, high CPU contention, file locking contention (currently trying to nail this down). And NIC driver resets (ENAs on AWS c5/m5 instances with older kernels are very problematic).

johngmyers · 2019-11-15T22:26:34Z

Could you file a Kubernetes issue? I believe pods in ContainerCreating state should not block voluntary eviction, be they stuck or not. It's not as if they have state that needs grace to terminate.

Or is it the "wait for pods to terminate" phase they're blocking, not voluntary eviction?

olemarkus · 2022-08-02T13:08:57Z

kops now has --drain-timeout which should will prevent rolls from hanging. There is also more logging of why kops hangs on drain.

/close

k8s-ci-robot · 2022-08-02T13:09:15Z

@olemarkus: Closing this issue.

In response to this:

kops now has --drain-timeout which should will prevent rolls from hanging. There is also more logging of why kops hangs on drain.

/close

Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the kubernetes/test-infra repository.

k8s-ci-robot added the lifecycle/stale Denotes an issue or PR has remained open with no activity and has become stale. label Dec 31, 2017

k8s-ci-robot added the lifecycle/frozen Indicates that an issue or PR should not be auto-closed due to staleness. label Jan 2, 2018

sheldonkwok mentioned this issue May 8, 2018

kops rolling-update cluster hanging on pod eviction #5118

Closed

odavid added a commit to odavid/ansible-role-k8s-aws-cluster that referenced this issue Aug 23, 2018

using minAvailable=0 in nginx for ability to drain - see kubernetes/k…

bfb0137

…ops#2537

neolit123 mentioned this issue Aug 4, 2019

Create a PodDisruptionBudget for CoreDNS kubernetes/kubeadm#1672

Closed

k8s-ci-robot added the area/rolling-update label Nov 4, 2019

k8s-ci-robot closed this as completed Aug 2, 2022

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

DrainAndValidate rolling-update hangs if pods won't evict #2537

DrainAndValidate rolling-update hangs if pods won't evict #2537

blakebarnett commented May 9, 2017

chrislovecnm commented May 9, 2017

blakebarnett commented May 9, 2017

chrislovecnm commented May 9, 2017

blakebarnett commented May 9, 2017

chrislovecnm commented Jul 1, 2017

fejta-bot commented Dec 31, 2017

foxish commented Jan 2, 2018

foxish commented Jan 2, 2018

chrislovecnm commented Jan 2, 2018

sidhartanoleto commented Apr 17, 2018 •

edited

Loading

Globegitter commented Apr 23, 2018 •

edited

Loading

Globegitter commented Apr 24, 2018 •

edited

Loading

michalschott commented May 11, 2018

SharpEdgeMarshall commented Jun 13, 2018 •

edited

Loading

mf-lit commented Jun 22, 2018 •

edited

Loading

olemarkus commented Jun 22, 2018

montyz commented Aug 6, 2018

inodb commented Aug 15, 2018

mf-lit commented Aug 16, 2018

mf-lit commented Aug 16, 2018 •

edited

Loading

thedarkfalcon commented Sep 11, 2018 •

edited

Loading

FrederikNJS commented Jan 24, 2019 •

edited

Loading

neolit123 commented Aug 4, 2019 •

edited

Loading

johngmyers commented Nov 4, 2019

sstarcher commented Nov 15, 2019

johngmyers commented Nov 15, 2019

blakebarnett commented Nov 15, 2019

johngmyers commented Nov 15, 2019

johngmyers commented Nov 15, 2019

blakebarnett commented Nov 15, 2019

johngmyers commented Nov 15, 2019 •

edited

Loading

olemarkus commented Aug 2, 2022

k8s-ci-robot commented Aug 2, 2022

DrainAndValidate rolling-update hangs if pods won't evict #2537

DrainAndValidate rolling-update hangs if pods won't evict #2537

Comments

blakebarnett commented May 9, 2017

chrislovecnm commented May 9, 2017

blakebarnett commented May 9, 2017

chrislovecnm commented May 9, 2017

blakebarnett commented May 9, 2017

chrislovecnm commented Jul 1, 2017

fejta-bot commented Dec 31, 2017

foxish commented Jan 2, 2018

foxish commented Jan 2, 2018

chrislovecnm commented Jan 2, 2018

sidhartanoleto commented Apr 17, 2018 • edited Loading

Globegitter commented Apr 23, 2018 • edited Loading

Globegitter commented Apr 24, 2018 • edited Loading

michalschott commented May 11, 2018

SharpEdgeMarshall commented Jun 13, 2018 • edited Loading

mf-lit commented Jun 22, 2018 • edited Loading

olemarkus commented Jun 22, 2018

montyz commented Aug 6, 2018

inodb commented Aug 15, 2018

mf-lit commented Aug 16, 2018

mf-lit commented Aug 16, 2018 • edited Loading

thedarkfalcon commented Sep 11, 2018 • edited Loading

FrederikNJS commented Jan 24, 2019 • edited Loading

neolit123 commented Aug 4, 2019 • edited Loading

johngmyers commented Nov 4, 2019

sstarcher commented Nov 15, 2019

johngmyers commented Nov 15, 2019

blakebarnett commented Nov 15, 2019

johngmyers commented Nov 15, 2019

johngmyers commented Nov 15, 2019

blakebarnett commented Nov 15, 2019

johngmyers commented Nov 15, 2019 • edited Loading

olemarkus commented Aug 2, 2022

k8s-ci-robot commented Aug 2, 2022

sidhartanoleto commented Apr 17, 2018 •

edited

Loading

Globegitter commented Apr 23, 2018 •

edited

Loading

Globegitter commented Apr 24, 2018 •

edited

Loading

SharpEdgeMarshall commented Jun 13, 2018 •

edited

Loading

mf-lit commented Jun 22, 2018 •

edited

Loading

mf-lit commented Aug 16, 2018 •

edited

Loading

thedarkfalcon commented Sep 11, 2018 •

edited

Loading

FrederikNJS commented Jan 24, 2019 •

edited

Loading

neolit123 commented Aug 4, 2019 •

edited

Loading

johngmyers commented Nov 15, 2019 •

edited

Loading