HPA scaling while in scale down delay window causes perpetual "progressing" state on rollout #3848
Closed
2 tasks done
Labels
bug
Something isn't working
Checklist:
Describe the bug
I’ve noticed a bug in the
Rollout
behaviour when these specific conditions met:It seems like the
HPA
only scales the stable replica set and not the old revision. When a new canary rollout is triggered, this causes it to be in a perpetualProgressing
state with the message"more replicas need to be updated"
. Looking at the code, it seems to be because theUpdatedReplicas
doesn’t match thespec.replicas
To Reproduce
Rollout
with a scale down delay and an attachedHorizontalPodAutoscaler
:scaleDownDelaySeconds
windowHorizontalPodAutoscaler
. For example, change theminReplicas
to 2. Notice the HPA only affects the latest revision of theReplicaSet
and not the previousRollout
is in perpetualProgressing
state with the message"more replicas need to be updated"
Expected behavior
I think the
HorizontalPodAutoscaler
should scale allReplicaSets
so they are in sync and if ever a rollback if performed, the previous revision will be able to handle the load. If that's not possible then it should at least not block the rollout progression.Screenshots
Version
v1.7.2+59e5bd3
Logs
None for the moment. Will try to reproduce and post them.
Message from the maintainers:
Impacted by this bug? Give it a 👍. We prioritize the issues with the most 👍.
The text was updated successfully, but these errors were encountered: