Mem limiter #368

DmitriyMusatkin · 2023-11-09T17:45:26Z

Issue #, if available:
aws/aws-sdk-java-v2#4034

Description of changes:

By submitting this pull request, I confirm that my contribution is made under the terms of the Apache 2.0 license.

source/s3_buffer_pool.c

source/s3_client.c

include/aws/s3/private/s3_buffer_pool.h

source/s3_buffer_pool.c

graebm

I didn't get very far, so some comments may be uninformed. I'll continue tomorrow...

include/aws/s3/s3_client.h

include/aws/s3/private/s3_buffer_pool.h

include/aws/s3/private/s3_meta_request_impl.h

codecov-commenter · 2023-11-10T08:22:39Z

Codecov Report

Merging #368 (a6c69c2) into main (5fe1c3b) will increase coverage by 0.09%.
The diff coverage is 91.05%.

Additional details and impacted files

@@            Coverage Diff             @@
##             main     #368      +/-   ##
==========================================
+ Coverage   88.45%   88.55%   +0.09%     
==========================================
  Files          19       20       +1     
  Lines        5120     5348     +228     
==========================================
+ Hits         4529     4736     +207     
- Misses        591      612      +21

Files	Coverage Δ
source/s3.c	`95.65% <ø> (ø)`
source/s3_auto_ranged_put.c	`92.70% <100.00%> (+0.09%)`	⬆️
source/s3_chunk_stream.c	`76.87% <100.00%> (ø)`
source/s3_meta_request.c	`92.82% <100.00%> (-0.10%)`	⬇️
source/s3_platform_info.c	`45.33% <ø> (ø)`
source/s3_request.c	`95.25% <100.00%> (+0.04%)`	⬆️
source/s3_auto_ranged_get.c	`97.61% <80.00%> (-0.64%)`	⬇️
source/s3_client.c	`88.91% <84.61%> (-0.09%)`	⬇️
source/s3_buffer_pool.c	`91.82% <91.82%> (ø)`

... and 1 file with indirect coverage changes

include/aws/s3/private/s3_request.h

source/s3_auto_ranged_get.c

source/s3_client.c

source/s3_buffer_pool.c

include/aws/s3/private/s3_buffer_pool.h

source/s3_buffer_pool.c

source/s3_client.c

graebm · 2023-11-20T21:24:09Z

source/s3_client.c

+    aws_s3_client_lock_synced_data(client);
+    client->synced_data.trim_buffer_pool_task_scheduled = false;
+    aws_s3_client_unlock_synced_data(client);


we could move trim_buffer_pool_task_scheduled from synced_data to threaded_data and not bother with locks here, right? we only ever schedule this and run it from the client-work-thread?

Co-authored-by: Michael Graeb <graebm@amazon.com>

graebm

fix & ship

graebm · 2023-11-21T18:23:21Z

source/s3_buffer_pool.c

+struct aws_s3_buffer_pool_ticket *aws_s3_buffer_pool_reserve(struct aws_s3_buffer_pool *buffer_pool, size_t size) {
+    AWS_PRECONDITION(buffer_pool);
+
+    if (buffer_pool->has_reservation_hold) {
+        return NULL;
+    }
+
+    if (size == 0) {
+        AWS_LOGF_ERROR(AWS_LS_S3_CLIENT, "Could not reserve from buffer pool. 0 is not a valid size.");
+        aws_raise_error(AWS_ERROR_INVALID_ARGUMENT);
+        return NULL;


Real code never checks the error-code from aws_s3_buffer_pool_reserve(). Real code assumes it failed due to the memory limit being temporarily exceeded.

so if size was 0, the meta-request would get stuck perpetually trying to acquire a ticket

similarly, if size > mem_limit the meta-request would be stuck perpetually trying to acquire a ticket. I realize you adjusted the client's max_part_size to make this impossible, but it might be worth an ASSERT or FATAL_ASSERT here in case client code changes in the future and something slips through

Anyway, we should probably do one of these things:

real code calling aws_s3_buffer_pool_reserve() continues to assume that it can only fail when the mem limit is temporarily exceeded. And the reserve() function here FATAL_ASSERTs that the size is legal (not 0, not exceeding mem_limit)

OR real code calling aws_s3_buffer_pool_reserve() checks to differentiate the expected "waiting for memory" from other errors that terminate the meta-request

yeah, good point. size == 0 or > mem_lim is really an indication that something wrong went on before, so for simplicity sake, i think fatal assert makes more sense.

source/s3_client.c

… limit (#429) **Issue:** `aws_s3_meta_request_write()` must write to a buffer immediately if the data is less than part-size. Currently, it uses a buffer hooked up the to default allocator ([code here](https://github.com/awslabs/aws-c-s3/blob/cc06c419448b40417caa7b587f61bb4d8b4c08c1/source/s3_meta_request.c#L2260-L2262)). We'd like to get these buffers from the [buffer-pool](#368), to reduce memory fragmentation and resident set size (RSS). The problem is: the buffer-pool maintains strict memory limits, and won't allow further allocations when that limit is hit. But `aws_s3_meta_request_write()` MUST get a buffer immediately, or else the system can deadlock (see description in PR #418) **Description of changes:** Add `aws_s3_buffer_pool_acquire_forced_buffer()`. These buffers can be created even if they exceed the memory limit. **Future work:** Modify `aws_s3_meta_request_write()` to use this new function **Additional thoughts:** - Anyone using `aws_s3_meta_request_write()` should limit the total number of uploads like: `max-uploads = memory-limit / part-size`. That was the case even before this PR.

DmitriyMusatkin added 10 commits November 6, 2023 00:17

initial buffer reuse

1e44954

test fixes

0f59f52

more test fixes

511ce35

lets not be too fancy

a524b08

enable for puts

5804c39

bump limits

bd17c6d

dont reinit buffer

690fc85

fixes

be631c6

remove logging

d1df7ce

cleaning up

bc9b126

JonathanHenson reviewed Nov 9, 2023

View reviewed changes

source/s3_buffer_pool.c Outdated Show resolved Hide resolved

source/s3_buffer_pool.c Outdated Show resolved Hide resolved

source/s3_buffer_pool.c Outdated Show resolved Hide resolved

source/s3_client.c Outdated Show resolved Hide resolved

DmitriyMusatkin added 5 commits November 9, 2023 11:31

test fixes

891cd90

32 bit fix

c695744

test fixes

6c9e6ae

fix small buffer for gets

e7166e9

dont cancel trim

9c40011

TingDaoK reviewed Nov 9, 2023

View reviewed changes

include/aws/s3/private/s3_buffer_pool.h Outdated Show resolved Hide resolved

source/s3_buffer_pool.c Show resolved Hide resolved

source/s3_buffer_pool.c Outdated Show resolved Hide resolved

DmitriyMusatkin added 3 commits November 9, 2023 16:03

move around trim canceling

5207457

typo

6f7286c

lets check metrics inside synced block

22f38eb

graebm reviewed Nov 10, 2023

View reviewed changes

data race

43f12e6

graebm reviewed Nov 10, 2023

View reviewed changes

DmitriyMusatkin added 5 commits November 10, 2023 13:29

addressing comments

3d467d8

logging

a7b9e7f

low mem limits on 32

ca9e597

typo

b0a9d83

add more logging

d9197dc

DmitriyMusatkin added 15 commits November 18, 2023 00:51

addressing comments

cb943cd

address comments

4f1488b

fix test

b450054

another test

db19f07

data race

f3e896a

data race

1a8d329

reenable trim

c600680

tweak buffer

849d655

fix build error

e0b0ab7

lint and update docs

7f14806

more lint

3739d1c

adjust validation

00cc3c1

trim test

ed5baa3

lint

ab23ab6

net test case

8fd62e4

graebm reviewed Nov 20, 2023

View reviewed changes

DmitriyMusatkin and others added 5 commits November 20, 2023 23:58

Update source/s3_buffer_pool.c

52ffae7

Co-authored-by: Michael Graeb <graebm@amazon.com>

addressing comments

fccca7c

addressing comments

1630751

lint, fix docs

0028246

even more lint

fb3e351

graebm approved these changes Nov 21, 2023

View reviewed changes

DmitriyMusatkin added 3 commits November 21, 2023 15:27

address comments

79977a5

remove 0 size buffer test

bb5c139

typo

a6c69c2

DmitriyMusatkin merged commit f961971 into main Nov 21, 2023
30 checks passed

DmitriyMusatkin deleted the mem_ticket branch November 21, 2023 21:44

DmitriyMusatkin changed the title ~~(WIP) Mem limiter~~ Mem limiter Nov 21, 2023

graebm mentioned this pull request May 8, 2024

Buffer-pool allows "forced" buffers, which don't count against memory limit #429

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Mem limiter #368

Mem limiter #368

DmitriyMusatkin commented Nov 9, 2023 •

edited by graebm

Loading

graebm left a comment

codecov-commenter commented Nov 10, 2023 •

edited

Loading

graebm Nov 20, 2023

graebm left a comment

graebm Nov 21, 2023

DmitriyMusatkin Nov 21, 2023

Mem limiter #368

Mem limiter #368

Conversation

DmitriyMusatkin commented Nov 9, 2023 • edited by graebm Loading

graebm left a comment

Choose a reason for hiding this comment

codecov-commenter commented Nov 10, 2023 • edited Loading

Codecov Report

graebm Nov 20, 2023

Choose a reason for hiding this comment

graebm left a comment

Choose a reason for hiding this comment

graebm Nov 21, 2023

Choose a reason for hiding this comment

DmitriyMusatkin Nov 21, 2023

Choose a reason for hiding this comment

DmitriyMusatkin commented Nov 9, 2023 •

edited by graebm

Loading

codecov-commenter commented Nov 10, 2023 •

edited

Loading