Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Reconciliation can get blocked when Statefulset Controller is stuck trying to create a Pod #7969

Open
d-guo opened this issue Jul 27, 2024 · 1 comment
Labels
>enhancement Enhancement of existing functionality

Comments

@d-guo
Copy link

d-guo commented Jul 27, 2024

Bug Report

This is very similar to the problem reported in #2266, except it is surfaced in a way that the fix for that issue doesn't work.

The way it surfaced is I modified the spec with a toleration that is not allowed. This propagated to the statefulset which then resulted in the error pasted below. I then fixed the spec by removing this toleration, but eck is not propagating the change to the statefulset anymore.

Events:
  Type     Reason        Age                     From                    Message
  ----     ------        ----                    ----                    -------
  Warning  FailedCreate  6m18s (x32 over 3h59m)  statefulset-controller  create Pod es-test-2 in StatefulSet es-test failed error: admission webhook "{my-auth-service}" denied the request: {my-error-message — related to a toleration i set in the spec that is not allowed}

The problem here is that the "dry-run" solution in #3020 (which works in most cases) fails here because the failure does not occur in the pod but rather in the statefulset controller.

Is this an issue the community is interested in? I think a fix here could be to implement the timeout described in #2266 (comment). This would still be nice to have on top of the existing "dry-run" validation.

@botelastic botelastic bot added the triage label Jul 27, 2024
@d-guo
Copy link
Author

d-guo commented Jul 31, 2024

@barkbay @sebgl in case you guys are interested here :)

@pebrc pebrc added the >enhancement Enhancement of existing functionality label Aug 1, 2024
@botelastic botelastic bot removed the triage label Aug 1, 2024
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
>enhancement Enhancement of existing functionality
Projects
None yet
Development

No branches or pull requests

2 participants