Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Disable HonorCooldown for AWS desired capacity changes #151

Closed
awprice opened this issue Feb 27, 2019 · 2 comments
Closed

Disable HonorCooldown for AWS desired capacity changes #151

awprice opened this issue Feb 27, 2019 · 2 comments
Labels
AWS bug Something isn't working Cloud Provider

Comments

@awprice
Copy link
Member

awprice commented Feb 27, 2019

We should set the HonorCooldown option to false when setting the desired capacity for the ASG.

We've seen cases where an instance is stuck in a Pending state and will block the ASG from being updated. This can occur for 30-40 minutes until the instance is terminated by AWS because it is failing health checks. The instance is usually in this state due to an underlying hardware issue.

Changing this value to false will allow Escalator to continue operating even when there are nodes with issues in the ASG.

@awprice awprice added bug Something isn't working AWS Cloud Provider labels Feb 27, 2019
@Jacobious52
Copy link
Member

Are there any downsides or side effects to settings this to false? If so, would having it as a defaulted option in the node group config make sense?

@awprice
Copy link
Member Author

awprice commented Feb 28, 2019

I don't think there are any downsides to setting this to false, as we already provide a safeguard in Escalator with the scale lock. The scale lock works the same way as the cooldown in Autoscaling groups in that it prevents runaway scaling.

Cluster-autoscaler also sets this to false and doesn't have an option to change it - https://github.com/kubernetes/autoscaler/blob/master/cluster-autoscaler/cloudprovider/aws/auto_scaling_groups.go#L201

To include it in the node group config would require some extra thought - this is an AWS specific setting, we will need a way to store per-cloudprovider settings in the node group config.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
AWS bug Something isn't working Cloud Provider
Projects
None yet
Development

No branches or pull requests

2 participants