-
Notifications
You must be signed in to change notification settings - Fork 1.5k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
KEP-1972: kubelet exec probe timeouts #1973
KEP-1972: kubelet exec probe timeouts #1973
Conversation
/assign @SergeyKanzhelev For initial review |
58ef3bc
to
95a36f6
Compare
the timeout value is supported in the Container Probe API. Because exec probe timeouts | ||
were never respected by kubelet, a new feature gate `ExecProbeTimeouts` will be introduced | ||
so users have an easy way to revert back if the newly introduced probe timeout results | ||
in unexpected behavior. |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
small nit: the feature gate is for ops. For developers addressing the issue would be an increase of a timeout. So "users" and the action these users do in this context can be clarified.
Perhaps:
the timeout value is supported in the Container Probe API. Because exec probe timeouts | |
were never respected by kubelet, a new feature gate `ExecProbeTimeouts` will be introduced | |
so users have an easy way to revert back if the newly introduced probe timeout results | |
in unexpected behavior. | |
the timeout value is supported in the Container Probe API. Because exec probe timeouts | |
were never respected by kubelet, a new feature gate `ExecProbeTimeouts` will be introduced. | |
With this feature nodes can be configured to preserve the current behavior while the proper | |
timeouts are configured for the probes. |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Updated
|
||
### Goals | ||
|
||
* fix exec probe timeouts in kubelet |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
perhaps "treat exec probe timeouts as probe failures in kubelet"
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
done
95a36f6
to
0606728
Compare
Signed-off-by: Andrew Sy Kim <kim.andrewsy@gmail.com>
0606728
to
40670f6
Compare
/lgtm |
@andrewsykim are we missing some discussion on TODO(s) for different runtimes / dockershim? |
@dims I don't think so, I think the two TODOs that came out from this discussion were:
Both of those I imagine will be addressed in separate KEPs, this is one is to only fix exec probe timeouts |
/assign @dchen1107 @derekwaynecarr |
@andrewsykim sounds good. thanks! |
Another suggestion: should we increase scope of the KEP and also respect timeout in preStop exec hook? If so, the name of the flag may need to change to |
I agree that we should fix this, but I think grace termination period and probe timeouts are separate concerns here. I know some of the efforts around side car containers were also trying to address this so I wouldn't want to overlap there too much. I might be missing context on this one though. |
thinking more about it - I'd say even adding logic to respect the gracePeriodOverride would likely not be a breaking change so no flag is needed. So this KEP should be gtg |
### Test Plan | ||
|
||
E2E tests: | ||
* re-enable [existing exec liveness probe e2e test](https://github.com/kubernetes/kubernetes/blob/ea1458550077bdf3b26ac34551a3591d280fe1f5/test/e2e/common/container_probe.go#L210-L227) that is currently being skipped |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
wow :/
thanks for the write-up. agree with approach. /approve |
/lgtm |
[APPROVALNOTIFIER] This PR is APPROVED This pull-request has been approved by: andrewsykim, dchen1107, derekwaynecarr The full list of commands accepted by this bot can be found here. The pull request process is described here
Needs approval from an approver in each of these files:
Approvers can indicate their approval by writing |
Signed-off-by: Andrew Sy Kim kim.andrewsy@gmail.com
Kubelet today does not respect exec probe timeouts. This is an unexpected behavior
since the timeout value is supported in the Container Probe API. Because exec probe timeouts
were never respected by kubelet, a new feature gate
ExecProbeTimeouts
will be introducedso users have an easy way to revert back if the newly introduced probe timeout results
in unexpected behavior.
KEP issue: #1972