Remove scheduler `wait`s to speed up recovery time #8200

pierDipi · 2024-09-23T10:57:26Z

Currently, the scheduler and autoscaler are single threads and use a lock to prevent multiple scheduling and autoscaling decision from happening in parallel; this is not a problem for our use cases, however, the multiple wait currently present are slowing down recovery time.

From my testing, if I delete and recreate the Kafka control plane and data plane (sort of simulates an upgrade), without this patch it takes hours to recover when there are 400 triggers or 20 minutes when there are 100 triggers; with the patch it is immediate (only a 2/3 minutes with 400 triggers).

Remove waits from state builder and autoscaler
Add additional debug logs
Use logger provided through the context as opposed to gloabal loggers in each individual component to preserve knative/pkg resource aware log keys.

Before with 200 triggers:

Before with 1024 triggers, we see a very high work queue depth for 3 hours (knative controller queue size)

After with various number of triggers

Work queue depth is not high for long periods

knative-prow · 2024-09-23T10:57:38Z

[APPROVALNOTIFIER] This PR is APPROVED

This pull-request has been approved by: pierDipi

The full list of commands accepted by this bot can be found here.

The pull request process is described here

Needs approval from an approver in each of these files:

~~pkg/scheduler/OWNERS~~ [pierDipi]

Approvers can indicate their approval by writing /approve in a comment
Approvers can cancel approval by writing /approve cancel in a comment

codecov · 2024-09-23T11:03:41Z

Codecov Report

Attention: Patch coverage is 66.40625% with 43 lines in your changes missing coverage. Please review.

Project coverage is 66.56%. Comparing base (e79f3b6) to head (03fd6ae).
Report is 4 commits behind head on main.

Files with missing lines	Patch %	Lines
pkg/scheduler/statefulset/autoscaler.go	48.83%	19 Missing and 3 partials ⚠️
pkg/scheduler/state/state.go	56.25%	9 Missing and 5 partials ⚠️
pkg/scheduler/statefulset/scheduler.go	89.58%	5 Missing ⚠️
pkg/scheduler/state/helpers.go	33.33%	1 Missing and 1 partial ⚠️

Additional details and impacted files

@@            Coverage Diff             @@
##             main    #8200      +/-   ##
==========================================
- Coverage   67.47%   66.56%   -0.91%     
==========================================
  Files         371      371              
  Lines       18036    18271     +235     
==========================================
- Hits        12169    12162       -7     
- Misses       5088     5324     +236     
- Partials      779      785       +6

☔ View full report in Codecov by Sentry.
📢 Have feedback on the report? Share it here.

Currently, the scheduler and autoscaler are single threads and use a lock to prevent multiple scheduling and autoscaling decision from happening in parallel; this is not a problem for our use cases, however, the multiple `wait` currently present are slowing down recovery time. From my testing, if I delete and recreate the Kafka control plane and data plane, without this patch it takes 1h to recover when there are 400 triggers or 20 minutes when there are 100 triggers; with the patch it is immediate (only a 2/3 minutes with 400 triggers). - Remove `wait`s from state builder and autoscaler - Add additional debug logs - Use logger provided through the context as opposed to gloabal loggers in each individual component to preserve `knative/pkg` resource aware log keys. Signed-off-by: Pierangelo Di Pilato <pierdipi@redhat.com>

pierDipi · 2024-09-23T11:41:45Z

/test unit-tests

matzew

/lgtm

I like the extra added logging here as well

pierDipi · 2024-09-23T12:48:00Z

/test reconciler-tests

pierDipi · 2024-09-23T13:22:42Z

/cherry-pick release-1.15

knative-prow-robot · 2024-09-23T13:22:45Z

@pierDipi: once the present PR merges, I will cherry-pick it on top of release-1.15 in a new PR and assign it to you.

In response to this:

/cherry-pick release-1.15

Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the kubernetes-sigs/prow repository.

pierDipi · 2024-09-23T13:22:45Z

/cherry-pick release-1.14

knative-prow-robot · 2024-09-23T13:22:47Z

@pierDipi: once the present PR merges, I will cherry-pick it on top of release-1.14 in a new PR and assign it to you.

In response to this:

/cherry-pick release-1.14

Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the kubernetes-sigs/prow repository.

knative-prow-robot · 2024-09-23T13:51:22Z

@pierDipi: #8200 failed to apply on top of branch "release-1.14":

Applying: Remove scheduler `wait`s to speed up recovery time
Using index info to reconstruct a base tree...
M	pkg/scheduler/scheduler.go
M	pkg/scheduler/scheduler_test.go
M	pkg/scheduler/state/helpers.go
M	pkg/scheduler/state/state.go
M	pkg/scheduler/state/state_test.go
M	pkg/scheduler/statefulset/autoscaler.go
M	pkg/scheduler/statefulset/autoscaler_test.go
M	pkg/scheduler/statefulset/scheduler.go
M	pkg/scheduler/statefulset/scheduler_test.go
Falling back to patching base and 3-way merge...
Auto-merging pkg/scheduler/statefulset/scheduler_test.go
Auto-merging pkg/scheduler/statefulset/scheduler.go
CONFLICT (content): Merge conflict in pkg/scheduler/statefulset/scheduler.go
Auto-merging pkg/scheduler/statefulset/autoscaler_test.go
Auto-merging pkg/scheduler/statefulset/autoscaler.go
Auto-merging pkg/scheduler/state/state_test.go
Auto-merging pkg/scheduler/state/state.go
Auto-merging pkg/scheduler/state/helpers.go
Auto-merging pkg/scheduler/scheduler_test.go
Auto-merging pkg/scheduler/scheduler.go
error: Failed to merge in the changes.
hint: Use 'git am --show-current-patch=diff' to see the failed patch
Patch failed at 0001 Remove scheduler `wait`s to speed up recovery time
When you have resolved this problem, run "git am --continue".
If you prefer to skip this patch, run "git am --skip" instead.
To restore the original branch and stop patching, run "git am --abort".

In response to this:

/cherry-pick release-1.14

Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the kubernetes-sigs/prow repository.

knative-prow-robot · 2024-09-23T13:51:57Z

@pierDipi: #8200 failed to apply on top of branch "release-1.15":

Applying: Remove scheduler `wait`s to speed up recovery time
Using index info to reconstruct a base tree...
M	pkg/scheduler/statefulset/scheduler.go
M	pkg/scheduler/statefulset/scheduler_test.go
Falling back to patching base and 3-way merge...
Auto-merging pkg/scheduler/statefulset/scheduler_test.go
Auto-merging pkg/scheduler/statefulset/scheduler.go
CONFLICT (content): Merge conflict in pkg/scheduler/statefulset/scheduler.go
error: Failed to merge in the changes.
hint: Use 'git am --show-current-patch=diff' to see the failed patch
Patch failed at 0001 Remove scheduler `wait`s to speed up recovery time
When you have resolved this problem, run "git am --continue".
If you prefer to skip this patch, run "git am --skip" instead.
To restore the original branch and stop patching, run "git am --abort".

In response to this:

/cherry-pick release-1.15

Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the kubernetes-sigs/prow repository.

Currently, the scheduler and autoscaler are single threads and use a lock to prevent multiple scheduling and autoscaling decision from happening in parallel; this is not a problem for our use cases, however, the multiple `wait` currently present are slowing down recovery time. From my testing, if I delete and recreate the Kafka control plane and data plane, without this patch it takes 1h to recover when there are 400 triggers or 20 minutes when there are 100 triggers; with the patch it is immediate (only a 2/3 minutes with 400 triggers). - Remove `wait`s from state builder and autoscaler - Add additional debug logs - Use logger provided through the context as opposed to gloabal loggers in each individual component to preserve `knative/pkg` resource aware log keys. Signed-off-by: Pierangelo Di Pilato <pierdipi@redhat.com>

…its to speed up recovery time (#8202) * Improve scheduler memory usage (#8144) * Improve scheduler memory usage - Create a namespaced-scoped statefulset lister instead of being cluster-wide - Accept a PodLister rather than creating a cluster-wide one Signed-off-by: Pierangelo Di Pilato <pierdipi@redhat.com> * Update codegen Signed-off-by: Pierangelo Di Pilato <pierdipi@redhat.com> --------- Signed-off-by: Pierangelo Di Pilato <pierdipi@redhat.com> * Remove scheduler `wait`s to speed up recovery time (#8200) Currently, the scheduler and autoscaler are single threads and use a lock to prevent multiple scheduling and autoscaling decision from happening in parallel; this is not a problem for our use cases, however, the multiple `wait` currently present are slowing down recovery time. From my testing, if I delete and recreate the Kafka control plane and data plane, without this patch it takes 1h to recover when there are 400 triggers or 20 minutes when there are 100 triggers; with the patch it is immediate (only a 2/3 minutes with 400 triggers). - Remove `wait`s from state builder and autoscaler - Add additional debug logs - Use logger provided through the context as opposed to gloabal loggers in each individual component to preserve `knative/pkg` resource aware log keys. Signed-off-by: Pierangelo Di Pilato <pierdipi@redhat.com> --------- Signed-off-by: Pierangelo Di Pilato <pierdipi@redhat.com>

…its to speed up recovery time (#8203) * Improve scheduler memory usage (#8144) * Improve scheduler memory usage - Create a namespaced-scoped statefulset lister instead of being cluster-wide - Accept a PodLister rather than creating a cluster-wide one Signed-off-by: Pierangelo Di Pilato <pierdipi@redhat.com> * Update codegen Signed-off-by: Pierangelo Di Pilato <pierdipi@redhat.com> --------- Signed-off-by: Pierangelo Di Pilato <pierdipi@redhat.com> * Remove scheduler `wait`s to speed up recovery time (#8200) Currently, the scheduler and autoscaler are single threads and use a lock to prevent multiple scheduling and autoscaling decision from happening in parallel; this is not a problem for our use cases, however, the multiple `wait` currently present are slowing down recovery time. From my testing, if I delete and recreate the Kafka control plane and data plane, without this patch it takes 1h to recover when there are 400 triggers or 20 minutes when there are 100 triggers; with the patch it is immediate (only a 2/3 minutes with 400 triggers). - Remove `wait`s from state builder and autoscaler - Add additional debug logs - Use logger provided through the context as opposed to gloabal loggers in each individual component to preserve `knative/pkg` resource aware log keys. Signed-off-by: Pierangelo Di Pilato <pierdipi@redhat.com> --------- Signed-off-by: Pierangelo Di Pilato <pierdipi@redhat.com>

knative-prow bot requested review from Cali0707 and lionelvillard September 23, 2024 10:57

knative-prow bot added the size/L Denotes a PR that changes 100-499 lines, ignoring generated files. label Sep 23, 2024

knative-prow bot added the approved Indicates a PR has been approved by an approver from all required OWNERS files. label Sep 23, 2024

pierDipi mentioned this pull request Sep 23, 2024

Upgrading from 1.13 to 1.14 leaves triggers in broken state knative-extensions/eventing-kafka-broker#4091

Closed

pierDipi force-pushed the scheduler-remove-wait-better-logging branch 4 times, most recently from 0a65cf8 to a64478f Compare September 23, 2024 11:25

pierDipi force-pushed the scheduler-remove-wait-better-logging branch from a64478f to 03fd6ae Compare September 23, 2024 11:26

matzew reviewed Sep 23, 2024

View reviewed changes

knative-prow bot assigned matzew Sep 23, 2024

knative-prow bot added the lgtm Indicates that a PR is ready to be merged. label Sep 23, 2024

knative-prow bot merged commit 641cbb7 into knative:main Sep 23, 2024
33 of 36 checks passed

pierDipi mentioned this pull request Sep 23, 2024

[release-1.15] Improve scheduler memory usage and remove scheduler waits to speed up recovery time #8202

Merged

pierDipi mentioned this pull request Sep 23, 2024

[release-1.14] Improve scheduler memory usage and remove scheduler waits to speed up recovery time #8203

Merged

pierDipi deleted the scheduler-remove-wait-better-logging branch September 24, 2024 05:58

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Remove scheduler `wait`s to speed up recovery time #8200

Remove scheduler `wait`s to speed up recovery time #8200

pierDipi commented Sep 23, 2024 •

edited

Loading

knative-prow bot commented Sep 23, 2024

codecov bot commented Sep 23, 2024 •

edited

Loading

pierDipi commented Sep 23, 2024

matzew left a comment

pierDipi commented Sep 23, 2024

pierDipi commented Sep 23, 2024

knative-prow-robot commented Sep 23, 2024

pierDipi commented Sep 23, 2024

knative-prow-robot commented Sep 23, 2024

knative-prow-robot commented Sep 23, 2024

knative-prow-robot commented Sep 23, 2024

Remove scheduler waits to speed up recovery time #8200

Remove scheduler waits to speed up recovery time #8200

Conversation

pierDipi commented Sep 23, 2024 • edited Loading

knative-prow bot commented Sep 23, 2024

codecov bot commented Sep 23, 2024 • edited Loading

Codecov Report

pierDipi commented Sep 23, 2024

matzew left a comment

Choose a reason for hiding this comment

pierDipi commented Sep 23, 2024

pierDipi commented Sep 23, 2024

knative-prow-robot commented Sep 23, 2024

pierDipi commented Sep 23, 2024

knative-prow-robot commented Sep 23, 2024

knative-prow-robot commented Sep 23, 2024

knative-prow-robot commented Sep 23, 2024

Remove scheduler `wait`s to speed up recovery time #8200

Remove scheduler `wait`s to speed up recovery time #8200

pierDipi commented Sep 23, 2024 •

edited

Loading

codecov bot commented Sep 23, 2024 •

edited

Loading