-
Notifications
You must be signed in to change notification settings - Fork 1.3k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
🌱 Make ClusterResourceSet controller more predictable #10869
🌱 Make ClusterResourceSet controller more predictable #10869
Conversation
Together with @sbueringer we did investingate this a little bit more. There are a few things at play:
We should wait for @sbueringer PR to merge before checking again if this fix the issue |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Just a few nits, otherwise lgtm
exp/addons/internal/controllers/clusterresourceset_controller.go
Outdated
Show resolved
Hide resolved
exp/addons/internal/controllers/clusterresourceset_controller.go
Outdated
Show resolved
Hide resolved
exp/addons/internal/controllers/clusterresourceset_controller.go
Outdated
Show resolved
Hide resolved
exp/addons/internal/controllers/clusterresourceset_controller.go
Outdated
Show resolved
Hide resolved
The separate PR mentioned above is open now (#10880) |
/hold for #10880 |
Thx! /lgtm (keeping the hold for the other PR and thus also until after the beta tag is created) |
LGTM label has been added. Git tree hash: d5f1201126700ccde19cea8527f7cac1a67a8734
|
[APPROVALNOTIFIER] This PR is APPROVED This pull-request has been approved by: sbueringer The full list of commands accepted by this bot can be found here. The pull request process is described here
Needs approval from an approver in each of these files:
Approvers can indicate their approval by writing |
@fabriziopandini my PR is merged, you have to rebase if you want to run the test locally though |
/test pull-cluster-api-test-main |
1 similar comment
/test pull-cluster-api-test-main |
https://prow.k8s.io/view/gs/kubernetes-jenkins/pr-logs/pull/kubernetes-sigs_cluster-api/10869/pull-cluster-api-test-main/1813476101129768960 is a new flake independent of your PR (flaky on main, fails 100% on test-mink8s) |
With the qps changes one run took 6s and another 7s. That is roughly what we expected (and far away from the 1m timeout) |
Thank you so much @fabriziopandini and @sbueringer for looking at this! Sorry for introducing this test flakiness while fixing the previous bug 😞 |
/lgtm |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
/lgtm
/hold cancel Let's keep an eye on the test. I think it should not be flaky anymore (even with the current timeout) |
Can we backport this to 1.7 branch too please? |
/cherrypick release-1.7 |
@jimmidyson: #10869 failed to apply on top of branch "release-1.7":
In response to this:
Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the kubernetes-sigs/prow repository. |
Yup, that's fine. Just needs a manual cherry-pick. Maybe you can check for other PRs we had in this area and that should probably also be cherry-picked (I think there were only 1-2 since your PR merged) |
@sbueringer Looks like we'd need to cherry-pick ##10756, #10880, and this one. Does that sound right to you? |
…usterresourceset-more-predictable 🌱 Make ClusterResourceSet controller more predictable
Sounds reasonable! |
What this PR does / why we need it:
ClusterResource set controller doesn't have an ideal implementation for cases where many ClusterResourceSets are targeting the same cluster (might be also the API is not specifically designed for this use case).
This PR is making this case a little bit more predictable, by avoiding a potential impact of exponential backoff delay quickly growing, but it is not solving the underlying issue.
At least, this should help in getting rid of some flakiness in our tests, and there is also a comment providing some more context on what's going on.
Another positive effect of this PR is that it remove a lot of noise from the logs
/area clusterresourceset
Related to #10854