Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Alert when nodes fail to join cluster #3828

Open
yulianedyalkova opened this issue Jan 13, 2025 · 0 comments
Open

Alert when nodes fail to join cluster #3828

yulianedyalkova opened this issue Jan 13, 2025 · 0 comments
Labels
team/tenet Team Tenet

Comments

@yulianedyalkova
Copy link

Related to https://github.com/giantswarm/adidas/issues/1325#issuecomment-2554383452

Tl;dr both Karpenter and cluster-autoscaler were trying to scale up but new nodes couldn't join the cluster because of issues with the bootstrap token (link). That resulted in continuous termination and recreation of the failing nodes.

We should be able to register this situation as early as possible instead of relying of unrelated alerts to catch the symptoms.

@yulianedyalkova yulianedyalkova converted this from a draft issue Jan 13, 2025
@yulianedyalkova yulianedyalkova added the team/tenet Team Tenet label Jan 13, 2025
@yulianedyalkova yulianedyalkova moved this from Inbox 📥 to Up Next ➡️ in Roadmap Jan 14, 2025
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
team/tenet Team Tenet
Projects
Status: Up Next ➡️
Development

No branches or pull requests

1 participant