Add DENYOOM flag to SCRIPT LOAD and make it fail on OOM #866

enjoy-binbin · 2024-08-02T09:52:46Z

Currently we can load a lot of lua to bypass maxmemory,
adds a DENYOOM flag to make it fail on OOM.

Currently we can load a lot of lua to bypass maxmemory, adds a DENYOOM flag to make it fail on OOM. Signed-off-by: Binbin <binloveplay1314@qq.com>

codecov · 2024-08-02T10:08:22Z

Codecov Report

All modified and coverable lines are covered by tests ✅

Project coverage is 70.24%. Comparing base (b728e41) to head (f6049db).
Report is 5 commits behind head on unstable.

Additional details and impacted files

@@             Coverage Diff              @@
##           unstable     #866      +/-   ##
============================================
- Coverage     70.47%   70.24%   -0.24%     
============================================
  Files           112      112              
  Lines         61467    61467              
============================================
- Hits          43320    43177     -143     
- Misses        18147    18290     +143

Files	Coverage Δ
src/commands.def	`100.00% <ø> (ø)`

... and 13 files with indirect coverage changes

madolson · 2024-08-07T22:55:06Z

This makes sense to me @soloestoy Do you know of any use case where this would cause an issue. Multi-exec with script load should fail. I guess it's a breaking change so we need a major change probably.

Realized something

madolson · 2024-08-07T22:56:55Z

I suppose I realized one case this might fail. If someone is trying to load a script that cleans up data, it might fail unnecessarily. There are also previous cases where multi-execs would succeed but should now fail.

soloestoy · 2024-08-08T06:51:38Z

I think this would be hard for users to understand. Currently, when the maxmemory is exceeded, data eviction is triggered, so DENYOOM is mainly aimed at data write operations. However, the scripts loaded by script load are not data and are not evicted, which makes it difficult to explain.

This also reminds me of a feature I worked on before redis/redis#5454, where in order to prevent clients from caching too many commands in a MULTI context, causing uncontrolled memory growth, we marked transactions as dirty when we exceeded maxmemory. However, this was eventually reverted redis/redis#12961 because it was hard for users to understand.

Maybe we can take a different approach by adding maxmemory-scripts to limit the size of cached scripts. It would only be allowed to use a certain percentage of maxmemory, and once exceeded, the scripts would be evicted (even those loaded by script load).

madolson · 2024-08-14T20:19:51Z

Maybe we can take a different approach by adding maxmemory-scripts to limit the size of cached scripts. It would only be allowed to use a certain percentage of maxmemory, and once exceeded, the scripts would be evicted (even those loaded by script load).

I guess I want to ask how much of a real problem this is? AFAIK we at AWS have never seen this manifest at a problem, but maybe others have?

soloestoy · 2024-08-15T02:20:52Z

I have seen many instances where number_of_cached_scripts reaches the million level, and used_memory_scripts_eval consumes tens of gigabytes of memory, lol. When a feature is unrestricted, it is prone to abuse.

enjoy-binbin · 2024-08-15T04:18:24Z

I have seen many instances where number_of_cached_scripts reaches the million level, and used_memory_scripts_eval consumes tens of gigabytes of memory, lol. When a feature is unrestricted, it is prone to abuse.

yeah, i have seem this too (both from eval scripts, haven't seen script load yet), that why now we have a eval script eviction.

btw, we now have these following issue:

lua does not use the jemalloc, and we do not count lua memory into used_memory
abuse of script loading (so maxmemory-scripts seems like a good idea, but i do think we need a DENYOOM flag when maxmemory-scripts is 0(unlimit))
to ensure script atomicity, we can bypass maxmemory (data memory or lua heap memory):

EVAL "for i=1,10000000 do redis.call('SET', 'key-kkkk:' .. i, 'value-tttt:' .. i) end" 0
EVAL "for i=1,10000000 do redis.pcall('SET', 'key-kkkk:' .. i, 'value-tttt:' .. i) end" 0
EVAL "local a={}; local i; for i=1, 30000000 do a[i]=i; end" 0

madolson · 2024-11-04T21:41:01Z

I have seen many instances where number_of_cached_scripts reaches the million level, and used_memory_scripts_eval consumes tens of gigabytes of memory, lol. When a feature is unrestricted, it is prone to abuse.

We have also seen this, but this is almost always from EVAL abuse, not SCRIPT LOAD + EVALSHA abuse. It looks like binbin also mentioned the same thing.

abuse of script loading (so maxmemory-scripts seems like a good idea, but i do think we need a DENYOOM flag when maxmemory-scripts is 0(unlimit))

I think the concern in the past was that many clients, on new connection establishment will send SCRIPT LOAD with some scripts that may reduce maxmemory, but they error out because the SCRIPT LOAD fails so they can never reduce memory. I would be more inclined to add a config here so that managed valkey providers have an operational config they can change but the default doesn't change.

enjoy-binbin · 2024-11-05T02:26:01Z

I would be more inclined to add a config here so that managed valkey providers have an operational config they can change but the default doesn't change.

sound good to me, what kind of config are we talking about here? maxmemory-scripts to limit the scripts memory (including server part and the lua VM part)?

Add DENYOOM flag to SCRIPT LOAD and make it fail on OOM

f6049db

Currently we can load a lot of lua to bypass maxmemory, adds a DENYOOM flag to make it fail on OOM. Signed-off-by: Binbin <binloveplay1314@qq.com>

hpatro approved these changes Aug 2, 2024

View reviewed changes

madolson previously approved these changes Aug 7, 2024

View reviewed changes

madolson added breaking-change Indicates a possible backwards incompatible change release-notes This issue should get a line item in the release notes labels Aug 7, 2024

madolson added the major-decision-pending Major decision pending by TSC team label Aug 7, 2024

enjoy-binbin mentioned this pull request Aug 28, 2024

Make KEYS to be an exact match if there is no pattern #792

Merged

enjoy-binbin mentioned this pull request Feb 26, 2025

[NEW] Kill unkillable scripts #1783

Open

soloestoy mentioned this pull request Feb 28, 2025

[NEW] More refined memory management system #1792

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add DENYOOM flag to SCRIPT LOAD and make it fail on OOM #866

Add DENYOOM flag to SCRIPT LOAD and make it fail on OOM #866

enjoy-binbin commented Aug 2, 2024

codecov bot commented Aug 2, 2024 •

edited

Loading

madolson commented Aug 7, 2024 •

edited

Loading

madolson commented Aug 7, 2024 •

edited

Loading

soloestoy commented Aug 8, 2024

madolson commented Aug 14, 2024

soloestoy commented Aug 15, 2024

enjoy-binbin commented Aug 15, 2024 •

edited

Loading

madolson commented Nov 4, 2024

enjoy-binbin commented Nov 5, 2024

Add DENYOOM flag to SCRIPT LOAD and make it fail on OOM #866

Are you sure you want to change the base?

Add DENYOOM flag to SCRIPT LOAD and make it fail on OOM #866

Conversation

enjoy-binbin commented Aug 2, 2024

codecov bot commented Aug 2, 2024 • edited Loading

Codecov Report

madolson commented Aug 7, 2024 • edited Loading

madolson commented Aug 7, 2024 • edited Loading

soloestoy commented Aug 8, 2024

madolson commented Aug 14, 2024

soloestoy commented Aug 15, 2024

enjoy-binbin commented Aug 15, 2024 • edited Loading

madolson commented Nov 4, 2024

enjoy-binbin commented Nov 5, 2024

codecov bot commented Aug 2, 2024 •

edited

Loading

madolson commented Aug 7, 2024 •

edited

Loading

madolson commented Aug 7, 2024 •

edited

Loading

enjoy-binbin commented Aug 15, 2024 •

edited

Loading