You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
We should set the HonorCooldown option to false when setting the desired capacity for the ASG.
We've seen cases where an instance is stuck in a Pending state and will block the ASG from being updated. This can occur for 30-40 minutes until the instance is terminated by AWS because it is failing health checks. The instance is usually in this state due to an underlying hardware issue.
Changing this value to false will allow Escalator to continue operating even when there are nodes with issues in the ASG.
The text was updated successfully, but these errors were encountered:
I don't think there are any downsides to setting this to false, as we already provide a safeguard in Escalator with the scale lock. The scale lock works the same way as the cooldown in Autoscaling groups in that it prevents runaway scaling.
To include it in the node group config would require some extra thought - this is an AWS specific setting, we will need a way to store per-cloudprovider settings in the node group config.
We should set the
HonorCooldown
option tofalse
when setting the desired capacity for the ASG.We've seen cases where an instance is stuck in a Pending state and will block the ASG from being updated. This can occur for 30-40 minutes until the instance is terminated by AWS because it is failing health checks. The instance is usually in this state due to an underlying hardware issue.
Changing this value to
false
will allow Escalator to continue operating even when there are nodes with issues in the ASG.The text was updated successfully, but these errors were encountered: