Complete revamp of model loading to allow for more discrete control #1969

manyoso · 2024-02-15T18:36:44Z

Large revamp of model loading.

the user of the models loading behavior. Signed-off-by: Adam Treat <treat.adam@gmail.com>

cebtenzzre · 2024-02-15T21:33:39Z

gpt4all-chat/chatllm.cpp

@@ -170,7 +222,7 @@ bool ChatLLM::loadModel(const ModelInfo &modelInfo)
 #endif
        delete m_llModelInfo.model;
        m_llModelInfo.model = nullptr;
-        emit isModelLoadedChanged(false);
+        emit modelLoadingPercentageChanged(std::numeric_limits<float>::min());


I think there should be a comment explaining the significance of negative infinity here.

This is not negative infinite. It is the smallest possible positive floating point value closest to zero.

Oops, sometimes I forget that FLT_MIN has nothing to do with INT_MIN. I'm thinking there should be a brief comment like // small nonzero value to make it clear that it is arbitrary.

Added comments.

gpt4all-chat/main.qml

cebtenzzre · 2024-02-15T21:57:03Z

Also, attempting to switch models while a model is loading is allowed - it changes the model name but not the progress value, and it ends up queueing a series of model loads. We should either abort the model load via the progress callback, or prevent the user from doing this.

cebtenzzre · 2024-02-15T22:04:26Z

Is this dialog still necessary, when selecting a model that previously failed to load? Especially the "Model loading error..." at the top.

cebtenzzre · 2024-02-15T22:08:02Z

This combo box is too wide now that the Clone/Remove buttons are gone:

Also, half of the generation settings are off the edge of the dialog (not visible).

edit: Apparently also seen in the 2.7.0 release on Windows, see the Discord.

Signed-off-by: Adam Treat <treat.adam@gmail.com>

manyoso · 2024-02-19T15:41:51Z

All of the above should be fixed with this newest version. Thanks for the quality review.

ThiloteE · 2024-02-20T01:55:59Z

Nitpick: Models with long names bleed into symbols.
Shouldn't prevent a merge, but wanted to mention it.

ThiloteE · 2024-02-20T02:35:13Z

Also, having multiple conversations with different models and switching between them causes long (few seconds long) "switching context" messages.
.
Is it because the prompt is evaluated on CPU immediatelly?

manyoso · 2024-02-20T13:58:37Z

Nitpick: Models with long names bleed into symbols. Shouldn't prevent a merge, but wanted to mention it.

Fixed.

manyoso · 2024-02-20T13:59:26Z

Also, having multiple conversations with different models and switching between them causes long (few seconds long) "switching context" messages.

This has to do with the save/restore of context that can be slow under vulkan. We're looking at ways to speed it up.

Signed-off-by: Adam Treat <treat.adam@gmail.com>

ThiloteE · 2024-02-20T14:59:01Z

Thank you. Can confirm it is much better now. To be honest though, adding slightly to the width wouldn't hurt, because most models on huggingface do have long model names, especially the mergers. Also, if users have multiple model cards for them (e.g. various system prompts in different languages) then those model names might become even longer to avoid confusion.

cebtenzzre · 2024-02-20T16:01:22Z

A. Things look wrong at the minimum window size - and in general, when the window width is less than 3/4 of my screen.

B. I liked the font size of the regenerate button better before:

Than it is now:

C.

This combo box is too wide now that the Clone/Remove buttons are gone:

This is still the case, the combo box goes off the edge of the dialog.

D. If you:

Select a model and chat with it
Unload the model
Instead of clicking reload, or selecting a different model, select the same model again
Your chat session has now been cleared??

E. The way we handle crashes during model loading is still broken - not only is there a dialog without OK/Cancel buttons that appears when the user explicitly tries to load the same model again, but the model switcher actually completely freezes now, and the app has to be restarted in order to continue. And you can never load that model again until you successfully load a different one.

Signed-off-by: Adam Treat <treat.adam@gmail.com>

manyoso · 2024-02-20T17:06:45Z

I don't know what you mean by "This is still the case, the combo box goes off the edge of the dialog." as I don't see that.

The sizing issues with min width and so on are not germane to this PR. I'd like to address them in another PR. Moreover, they are going to go away when the new UI revamp comes so I don't want to spend much time on them.

manyoso · 2024-02-20T17:07:46Z

I've fixed D. I'm not sure about E and what we want to do. It should probably be addressed in a new PR.

cebtenzzre · 2024-02-20T19:58:00Z

The sizing issues with min width and so on are not germane to this PR. I'd like to address them in another PR. Moreover, they are going to go away when the new UI revamp comes so I don't want to spend much time on them.

Ok. But the clear chat button didn't overlap before because the model selection box was smaller, so this is a regression, even if you think it's not important.

I don't know what you mean by "This is still the case, the combo box goes off the edge of the dialog." as I don't see that.

On my Mac:

Notice how:

The "Generation Settings" header that is supposed to be centered is now shifted to the right
The right column of generation settings is cut off (as in, I cannot see the right half of each inputbox)
The expanded combobox to select a model stretches outside of the settings dialog and onto the dimmed background behind it (to the right)

manyoso · 2024-02-20T21:31:38Z

Ok. But the clear chat button didn't overlap before because the model selection box was smaller, so this is a regression, even if you think it's not important.

Got it, thanks.

On my Mac:

Yeah, I don't see anything like this. I'll check on my mac.

barnett-yuxiang

LGTM

manyoso · 2024-02-21T14:23:08Z

I cannot reproduce what you see on the mac with the model settings. Further, this change does not seem to have any impact on the model settings or that combo box. I'm not sure why you see what you see @cebtenzzre

Signed-off-by: Adam Treat <treat.adam@gmail.com>

manyoso · 2024-02-21T16:10:50Z

The model loading error you get when app crashes and we display on reload is now gone.

ThiloteE · 2024-03-12T09:31:27Z

fixed #1660

Complete revamp of model loading to allow for more discreet control by

0470f9e

the user of the models loading behavior. Signed-off-by: Adam Treat <treat.adam@gmail.com>

manyoso requested a review from cebtenzzre February 15, 2024 18:36

cebtenzzre changed the title ~~Complete revamp of model loading to allow for more discreet control by~~ Complete revamp of model loading to allow for more discrete control Feb 15, 2024

cebtenzzre reviewed Feb 15, 2024

View reviewed changes

This comment was marked as resolved.

Sign in to view

Fixes for issues identified in review.

03970da

Signed-off-by: Adam Treat <treat.adam@gmail.com>

Increase padding for elided text in combo.

8c7357b

Signed-off-by: Adam Treat <treat.adam@gmail.com>

Don't erase context when reloading model by selection.

b8c4810

Signed-off-by: Adam Treat <treat.adam@gmail.com>

barnett-yuxiang reviewed Feb 21, 2024

View reviewed changes

manyoso added 3 commits February 21, 2024 09:54

Add comment to make this clear.

3e44a12

Signed-off-by: Adam Treat <treat.adam@gmail.com>

Make the reload/regenerate buttons a little bit larger font.

82fd4df

Signed-off-by: Adam Treat <treat.adam@gmail.com>

Don't try and detect model load error on startup.

87368e0

Signed-off-by: Adam Treat <treat.adam@gmail.com>

cebtenzzre approved these changes Feb 21, 2024

View reviewed changes

manyoso merged commit fa0a212 into main Feb 21, 2024
6 of 17 checks passed

ThiloteE mentioned this pull request Mar 12, 2024

Issue: When groing through chat history, the client attempts to load the entire model for each individual conversation. #1660

Closed

ThiloteE mentioned this pull request Jul 27, 2024

Keep selected model when creating "New chat" #850

Closed

10 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Complete revamp of model loading to allow for more discrete control #1969

Complete revamp of model loading to allow for more discrete control #1969

manyoso commented Feb 15, 2024

cebtenzzre Feb 15, 2024

manyoso Feb 19, 2024

cebtenzzre Feb 20, 2024

manyoso Feb 21, 2024

This comment was marked as resolved.

This comment was marked as resolved.

cebtenzzre commented Feb 15, 2024

cebtenzzre commented Feb 15, 2024

cebtenzzre commented Feb 15, 2024 •

edited

Loading

This comment was marked as resolved.

manyoso commented Feb 19, 2024

ThiloteE commented Feb 20, 2024

ThiloteE commented Feb 20, 2024

manyoso commented Feb 20, 2024 •

edited

Loading

manyoso commented Feb 20, 2024

ThiloteE commented Feb 20, 2024 •

edited

Loading

cebtenzzre commented Feb 20, 2024

manyoso commented Feb 20, 2024

manyoso commented Feb 20, 2024

cebtenzzre commented Feb 20, 2024 •

edited

Loading

manyoso commented Feb 20, 2024

barnett-yuxiang left a comment

manyoso commented Feb 21, 2024

manyoso commented Feb 21, 2024

ThiloteE commented Mar 12, 2024

Complete revamp of model loading to allow for more discrete control #1969

Complete revamp of model loading to allow for more discrete control #1969

Conversation

manyoso commented Feb 15, 2024

cebtenzzre Feb 15, 2024

Choose a reason for hiding this comment

manyoso Feb 19, 2024

Choose a reason for hiding this comment

cebtenzzre Feb 20, 2024

Choose a reason for hiding this comment

manyoso Feb 21, 2024

Choose a reason for hiding this comment

This comment was marked as resolved.

This comment was marked as resolved.

cebtenzzre commented Feb 15, 2024

cebtenzzre commented Feb 15, 2024

cebtenzzre commented Feb 15, 2024 • edited Loading

This comment was marked as resolved.

manyoso commented Feb 19, 2024

ThiloteE commented Feb 20, 2024

ThiloteE commented Feb 20, 2024

manyoso commented Feb 20, 2024 • edited Loading

manyoso commented Feb 20, 2024

ThiloteE commented Feb 20, 2024 • edited Loading

cebtenzzre commented Feb 20, 2024

manyoso commented Feb 20, 2024

manyoso commented Feb 20, 2024

cebtenzzre commented Feb 20, 2024 • edited Loading

manyoso commented Feb 20, 2024

barnett-yuxiang left a comment

Choose a reason for hiding this comment

manyoso commented Feb 21, 2024

manyoso commented Feb 21, 2024

ThiloteE commented Mar 12, 2024

cebtenzzre commented Feb 15, 2024 •

edited

Loading

manyoso commented Feb 20, 2024 •

edited

Loading

ThiloteE commented Feb 20, 2024 •

edited

Loading

cebtenzzre commented Feb 20, 2024 •

edited

Loading