Running --local load my local model over and over again everytime a message is sent #666

xinranli0809 · 2023-10-20T19:44:34Z

Describe the bug

I managed to load a local mistral 7b model and was able to chat back and forth with the interpreter, but the time it takes to begin streaming the tokens is extremely long after I send a message, considering I get decent token/s once it starts streaming.

Upon a closer look at the memory usage in the task manager, I see that every time I press enter and send a message, the program loads a brand new Ooba server and a GGUF model! This is confirmed by adding a print("Ooba starting and model loading!") before Line 84 in Ooba's llm.py and that an additional python process will pop up in the task manager and occupy 8GB RAM after sending every message.

Please see the screenshots for details.

Reproduce

interpreter --local
chat with it and see how long it takes before begin streaming the response
Monitor your RAM usage, the model is loaded into RAM every time you send a message

Expected behavior

Model load only once per session

Screenshots

Open Interpreter version

0.1.10

Python version

3.11.6

Operating System name and version

Windows 10

Additional context

I am also getting an interpreter.procedure being a NoneType error, for some reason... But I don't think these are related issues. I tried putting the procedures_db.json in the same directory as get_relevant_procedures_string.py and loaded it manually to make sure it was presenting. The NoneType error went away but a new model is still being loaded after every message.

Notnaton · 2023-10-23T15:26:49Z

There are two bugs here

[Fixed] Fix a crash in get_relevant_procedures_string #676
[Waiting] Fixes restart every second message KillianLucas/ooba#8
Hopefully solved next update

xinranli0809 · 2023-10-25T04:05:38Z

There are two bugs here

[Fixed] Fix a crash in get_relevant_procedures_string #676

[Waiting] Fixes restart every second message ooba#8
Hopefully solved next update

It works, thanks!

ericrallen · 2023-10-27T02:28:50Z

It works, thanks!

Hey there, @xinranli0809!

Does that mean this issue is resolved?

Notnaton · 2023-10-27T13:34:23Z

It works, thanks!

Hey there, @xinranli0809!

Does that mean this issue is resolved?

@ericrallen
This is an ooba issue, once ooba is merged with KillianLucas/ooba#8 it should be fixed.

ericrallen · 2023-10-27T13:37:27Z

Thanks for all of the awesome work, @Notnaton!

xinranli0809 added the Bug Something isn't working label Oct 20, 2023

ericrallen added the Local Model Running a local model label Oct 27, 2023

ericrallen closed this as completed Oct 27, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Running --local load my local model over and over again everytime a message is sent #666

Running --local load my local model over and over again everytime a message is sent #666

xinranli0809 commented Oct 20, 2023

Notnaton commented Oct 23, 2023 •

edited

Loading

xinranli0809 commented Oct 25, 2023

ericrallen commented Oct 27, 2023

Notnaton commented Oct 27, 2023

ericrallen commented Oct 27, 2023

Running --local load my local model over and over again everytime a message is sent #666

Running --local load my local model over and over again everytime a message is sent #666

Comments

xinranli0809 commented Oct 20, 2023

Describe the bug

Reproduce

Expected behavior

Screenshots

Open Interpreter version

Python version

Operating System name and version

Additional context

Notnaton commented Oct 23, 2023 • edited Loading

xinranli0809 commented Oct 25, 2023

ericrallen commented Oct 27, 2023

Notnaton commented Oct 27, 2023

ericrallen commented Oct 27, 2023

Notnaton commented Oct 23, 2023 •

edited

Loading