-
-
Notifications
You must be signed in to change notification settings - Fork 3k
Commit
This commit does not belong to any branch on this repository, and may belong to a fork outside of the repository.
update ollama performance mode (#2874)
- Loading branch information
1 parent
af70342
commit a51de73
Showing
2 changed files
with
19 additions
and
8 deletions.
There are no files selected for viewing
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
a51de73
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
@timothycarambat just wondering why mlock should be false?
One impact to this is, if using the same model for both Agent & regular chat on Ollama, is this will cause the model to unload and re-load because this parameter is different.
My vote would be for mlock to be consistant, off everywhere or on everywhere, to prevent unloading.
a51de73
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Correct, this was a leftover mistake during testing - will patch now
4b2bb52