Add doc for node-agent memory preserve #8167

Lyndon-Li · 2024-08-30T07:09:13Z

Partially fix issue #8138, add doc for node-agent memory preserve

codecov · 2024-08-30T07:19:33Z

Codecov Report

All modified and coverable lines are covered by tests ✅

Project coverage is 59.10%. Comparing base (3408ffe) to head (43de32a).
Report is 21 commits behind head on main.

Additional details and impacted files

@@            Coverage Diff             @@
##             main    #8167      +/-   ##
==========================================
+ Coverage   59.05%   59.10%   +0.04%     
==========================================
  Files         364      365       +1     
  Lines       30324    30336      +12     
==========================================
+ Hits        17909    17931      +22     
+ Misses      10972    10962      -10     
  Partials     1443     1443

☔ View full report in Codecov by Sentry.
📢 Have feedback on the report? Share it here.

kaovilai · 2024-08-30T18:17:08Z

site/content/docs/main/file-system-backup.md

@@ -641,6 +641,16 @@ Both the uploader and repository consume remarkable CPU/memory during the backup
 Velero node-agent uses [BestEffort as the QoS][14] for node-agent pods (so no CPU/memory request/limit is set), so that backups/restores wouldn't fail due to resource throttling in any cases.  
 If you want to constraint the CPU/memory usage, you need to [customize the resource limits][15]. The CPU/memory consumption is always related to the scale of data to be backed up/restored, refer to [Performance Guidance][16] for more details, so it is highly recommended that you perform your own testing to find the best resource limits for your data.   

+For Kopia path, some memory is preserved by the node-agent to avoid frequent memory allocations, therefore, after you run a file-system backup/restore, you won't see node-agent releases all the memory. There is a limit for the memory preservation, so the memory won't increase all the time. The limit varies from the number of CPU cores in the cluster nodes, as calculated below:  


Can we clarify how much if at all is released? Should there be timeout for preserved memory? If you only backup once every 6 months, you may rather spend time to reallocate memory next backup.

Added the clarification that there is no timeout for the preserved memory, so you won't see node-agent releases all the memory until it restarts.

Can we clarify how much if at all is released?

The released memory is unknown actually, because released memory = total allocated memory - preserved memory. While for total allocated memory, we've already clarified as below:
The CPU/memory consumption is always related to the scale of data to be backed up/restored, refer to [Performance Guidance][16] for more details, so it is highly recommended that you perform your own testing to find the best resource limits for your data.

Should there be timeout for preserved memory?

Yes, it is more rational to have a smarter mechanism instead of preserve the memory forever. But whether a timeout is ideal enough, we need to further consider.
At present, we just document it and leave it as is. We think it is not a high priority task, reasons:

It happens to fs-backup only from 1.15 on, because data movers will not run in the long-running node-agent pods.

The preserved memory won't reach to the limit very easily, normally it is less than the limit

The backup is usually a scheduled task, e.g., one/several per day, so the preserved memory is normally effective

site/content/docs/main/file-system-backup.md

Signed-off-by: Lyndon-Li <lyonghui@vmware.com>

github-actions bot added the Documentation label Aug 30, 2024

Lyndon-Li force-pushed the node-agent-memory-preserve-doc branch from 7be784d to c540104 Compare August 30, 2024 07:11

github-actions bot added the has-changelog label Aug 30, 2024

Lyndon-Li marked this pull request as ready for review August 30, 2024 07:12

github-actions bot requested review from blackpiglet and shubham-pampattiwar August 30, 2024 07:12

github-actions bot assigned Lyndon-Li Aug 30, 2024

blackpiglet previously approved these changes Aug 30, 2024

View reviewed changes

kaovilai reviewed Aug 30, 2024

View reviewed changes

Lyndon-Li dismissed blackpiglet’s stale review via 138c4d9 September 2, 2024 02:24

Lyndon-Li force-pushed the node-agent-memory-preserve-doc branch 2 times, most recently from 138c4d9 to 6a9a827 Compare September 2, 2024 02:40

Lyndon-Li requested review from blackpiglet and kaovilai September 2, 2024 02:42

blackpiglet previously approved these changes Sep 2, 2024

View reviewed changes

shubham-pampattiwar requested changes Sep 3, 2024

View reviewed changes

site/content/docs/main/file-system-backup.md Outdated Show resolved Hide resolved

add doc for node-agent memory preserve

43de32a

Signed-off-by: Lyndon-Li <lyonghui@vmware.com>

Lyndon-Li dismissed blackpiglet’s stale review via 43de32a September 9, 2024 05:39

Lyndon-Li force-pushed the node-agent-memory-preserve-doc branch from 6a9a827 to 43de32a Compare September 9, 2024 05:39

Lyndon-Li requested review from shubham-pampattiwar and blackpiglet September 9, 2024 05:39

blackpiglet approved these changes Sep 9, 2024

View reviewed changes

shubham-pampattiwar approved these changes Sep 9, 2024

View reviewed changes

shubham-pampattiwar merged commit a19cf56 into vmware-tanzu:main Sep 9, 2024
45 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add doc for node-agent memory preserve #8167

Add doc for node-agent memory preserve #8167

Lyndon-Li commented Aug 30, 2024

codecov bot commented Aug 30, 2024 •

edited

Loading

kaovilai Aug 30, 2024

Lyndon-Li Sep 2, 2024 •

edited

Loading

Lyndon-Li Sep 2, 2024 •

edited

Loading

Lyndon-Li Sep 2, 2024

Add doc for node-agent memory preserve #8167

Add doc for node-agent memory preserve #8167

Conversation

Lyndon-Li commented Aug 30, 2024

codecov bot commented Aug 30, 2024 • edited Loading

Codecov Report

kaovilai Aug 30, 2024

Choose a reason for hiding this comment

Lyndon-Li Sep 2, 2024 • edited Loading

Choose a reason for hiding this comment

Lyndon-Li Sep 2, 2024 • edited Loading

Choose a reason for hiding this comment

Lyndon-Li Sep 2, 2024

Choose a reason for hiding this comment

codecov bot commented Aug 30, 2024 •

edited

Loading

Lyndon-Li Sep 2, 2024 •

edited

Loading

Lyndon-Li Sep 2, 2024 •

edited

Loading