Add options to disable mmap. (Typescript bindings) #1399

saul-jb · 2023-09-06T03:32:32Z

Feature request

It would be nice to be able to disable mmap on models to increase inference speed. I am only guessing that mmap is to blame since my memory stays really low when loading a large llama based model.

Related issue in llama.cpp

Motivation

I am getting low inference speeds (and low memory) when loading large llama based models such as llama-30b.ggmlv3.q5_K_M. The ability to disable mmap could help improve this.

Your contribution

None.

saul-jb · 2023-09-06T23:16:07Z

Looks like my low inference speed was due to exceeding my RAM limit and going to swap. I didn't realize at the time due to the OS not reporting it as RAM usage, closing as I no longer see a reason for someone to disable mmap.

saul-jb closed this as completed Sep 6, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add options to disable mmap. (Typescript bindings) #1399

Add options to disable mmap. (Typescript bindings) #1399

saul-jb commented Sep 6, 2023 •

edited

Loading

saul-jb commented Sep 6, 2023

Add options to disable mmap. (Typescript bindings) #1399

Add options to disable mmap. (Typescript bindings) #1399

Comments

saul-jb commented Sep 6, 2023 • edited Loading

Feature request

Motivation

Your contribution

saul-jb commented Sep 6, 2023

saul-jb commented Sep 6, 2023 •

edited

Loading