[Fleet] Change default batch size #161249

nchaulet · 2023-07-05T11:43:19Z

Summary

Change default batch size for schema version upgrade from 100 concurrent policies to 2 to reduce memory usage.

When we have to do a schema change on the policy we need to update all the policies, the current code was doing 100 policies concurently that could cause some memory issue on small Kibana instance. (as we recommend to have at max 500 agent policies, and that upgrade is triggered asynchronously during Kibana start it should still happen in a reasonable amount of time)

Alternative solution

Instead of configuring this as a default we could configure this in the stackpack only for small Kibana instances

elasticmachine · 2023-07-05T11:43:23Z

Pinging @elastic/fleet (Team:Fleet)

apmmachine · 2023-07-05T11:43:33Z

🤖 GitHub comments

Expand to view the GitHub comments

Just comment with:

/oblt-deploy : Deploy a Kibana instance using the Observability test environments.
run elasticsearch-ci/docs : Re-trigger the docs validation. (use unformatted text in the comment!)

jlind23 · 2023-07-05T12:24:11Z

@nchaulet do you think that on the long term we could have a batch size inferred from the available memory?

kibana-ci · 2023-07-05T12:47:19Z

💚 Build Succeeded

Buildkite Build
Commit: 2ef34dd

Metrics [docs]

Async chunks

Total size of all lazy-loaded chunks that will be downloaded as the user navigates the app

id	before	after	diff
`fleet`	975.2KB	975.4KB	+215.0B

Unknown metric groups

ESLint disabled line counts

id	before	after	diff
`enterpriseSearch`	14	16	+2
`securitySolution`	410	414	+4
total			+6

Total ESLint disabled count

id	before	after	diff
`enterpriseSearch`	15	17	+2
`securitySolution`	489	493	+4
total			+6

To update your PR or re-run it, just comment with:
@elasticmachine merge upstream

cc @nchaulet

nchaulet · 2023-07-05T13:49:18Z

@nchaulet do you think that on the long term we could have a batch size inferred from the available memory?

Yes it probably something we can improve in the long term

Also on your question offline on when do we trigger those upgrade it happens when we need a new feature deployed all agent policies, for example in 8.8 the schema was bumped to introduce agent protection features to the agent policy

(cherry picked from commit 8ef1287)

kibanamachine · 2023-07-05T13:58:56Z

💚 All backports created successfully

Status	Branch	Result
✅	8.9

Note: Successful backport PRs will be merged automatically after passing CI.

Questions ?

Please refer to the Backport tool documentation

# Backport This will backport the following commits from `main` to `8.9`: - [[Fleet] Change default batch size (#161249)](#161249)  ### Questions ? Please refer to the [Backport tool documentation](https://github.com/sqren/backport)  Co-authored-by: Nicolas Chaulet <nicolas.chaulet@elastic.co>

juliaElastic · 2023-07-12T08:47:43Z

x-pack/plugins/fleet/public/applications/fleet/sections/agents/agent_list_page/index.tsx

@@ -519,6 +520,17 @@ export const AgentListPage: React.FunctionComponent<{}> = () => {
          <EuiSpacer size="l" />
        </>
      )}
+      {/* TODO serverless agent soft limit */}
+      {showUnhealthyCallout && (


@nchaulet Is this an accidental change? I am seeing a double notification on Fleet UI on main:

Adding #161249 (Kibana can run out of memory during an upgrade when there are many Fleet agent policies in place) to known issues for 8.8.x. --------- Co-authored-by: David Kilfoyle <41695641+kilfoyle@users.noreply.github.com>

Adding elastic#161249 (Kibana can run out of memory during an upgrade when there are many Fleet agent policies in place) to known issues for 8.8.x. --------- Co-authored-by: David Kilfoyle <41695641+kilfoyle@users.noreply.github.com>

Adding elastic#161249 (Kibana can run out of memory during an upgrade when there are many Fleet agent policies in place) to known issues for 8.8.x. --------- Co-authored-by: David Kilfoyle <41695641+kilfoyle@users.noreply.github.com> (cherry picked from commit e5cce01)

Adding elastic#161249 (Kibana can run out of memory during an upgrade when there are many Fleet agent policies in place) to known issues for 8.8.x. --------- Co-authored-by: David Kilfoyle <41695641+kilfoyle@users.noreply.github.com>

[Fleet] Change default batch size

2ef34dd

nchaulet added the Team:Fleet Team label for Observability Data Collection Fleet team label Jul 5, 2023

nchaulet self-assigned this Jul 5, 2023

nchaulet requested a review from a team as a code owner July 5, 2023 11:43

nchaulet added the release_note:skip Skip the PR/issue when compiling release notes label Jul 5, 2023

jlind23 approved these changes Jul 5, 2023

View reviewed changes

nchaulet added the v8.9.0 label Jul 5, 2023

nchaulet merged commit 8ef1287 into elastic:main Jul 5, 2023

nchaulet deleted the fix-default-batch-size branch July 5, 2023 13:49

kibanamachine added the v8.10.0 label Jul 5, 2023

kibanamachine mentioned this pull request Jul 5, 2023

[8.9] [Fleet] Change default batch size (#161249) #161260

Merged

kibanamachine pushed a commit to kibanamachine/kibana that referenced this pull request Jul 5, 2023

[Fleet] Change default batch size (elastic#161249)

fb97aeb

(cherry picked from commit 8ef1287)

ppf2 mentioned this pull request Jul 5, 2023

Adding 161249 to known issues for 8.8.x #161313

Merged

juliaElastic reviewed Jul 12, 2023

View reviewed changes

nchaulet mentioned this pull request Jul 12, 2023

[Fleet] Fix dupplicate unhealthy callout #161755

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Fleet] Change default batch size #161249

[Fleet] Change default batch size #161249

nchaulet commented Jul 5, 2023 •

edited by kibanamachine

Loading

elasticmachine commented Jul 5, 2023

apmmachine commented Jul 5, 2023

jlind23 commented Jul 5, 2023

kibana-ci commented Jul 5, 2023

ESLint disabled line counts

Total ESLint disabled count

nchaulet commented Jul 5, 2023

kibanamachine commented Jul 5, 2023

juliaElastic Jul 12, 2023

[Fleet] Change default batch size #161249

[Fleet] Change default batch size #161249

Conversation

nchaulet commented Jul 5, 2023 • edited by kibanamachine Loading

Summary

Alternative solution

elasticmachine commented Jul 5, 2023

apmmachine commented Jul 5, 2023

🤖 GitHub comments

jlind23 commented Jul 5, 2023

kibana-ci commented Jul 5, 2023

💚 Build Succeeded

Metrics [docs]

Async chunks

ESLint disabled line counts

Total ESLint disabled count

nchaulet commented Jul 5, 2023

kibanamachine commented Jul 5, 2023

💚 All backports created successfully

Questions ?

juliaElastic Jul 12, 2023

Choose a reason for hiding this comment

nchaulet commented Jul 5, 2023 •

edited by kibanamachine

Loading