Use a chat model instead of LLM for codestral completion #31

brichet · 2025-01-28T15:28:38Z

Langchainjs warns against using the text completion model, and advises using the chat model instead, which may be more up-to-date (see caution at https://js.langchain.com/docs/integrations/llms/).

This PR replace the text completion model by a chat model in codestral completion.

brichet · 2025-01-28T15:31:04Z

The completion works better when using a MistralAI model designed for code completion, like codestral-latest.

jtpio · 2025-02-04T16:56:58Z

src/llm-models/codestral-completer.ts

 */
-const REQUEST_TIMEOUT = 3000;
+const DEFAULT_PROMPT = `


Looks like we can now update this PR to use the default prompts from #28?

jtpio · 2025-02-06T17:31:16Z

Testing this locally, completions appear to be quite fast, although the response often includes code block delimiters:

Over on #27 there is some logic to not discard responses that may include the code block delimiters, but instead process the string to keep the suggestion without the backticks and language name.

So maybe we could isolate that to some util functions so it can be reused across providers.

jtpio · 2025-02-06T17:36:31Z

We can also improve the default completion prompt to mention that responses should be formatted without code block delimiters.

brichet · 2025-02-07T08:28:45Z

We can also improve the default completion prompt to mention that responses should be formatted without code block delimiters.

I did some tests with these kind of prompts and it doesn't work, looks like it doesn't care of these instructions.
But it worked better on my side with the codestral-latest model, is it the one you tested ?

jtpio · 2025-02-07T14:21:33Z

But it worked better on my side with the codestral-latest model, is it the one you tested ?

Yeah strangely it seems to be returning code blocks quite often?

jupyterlite-ai-codestral-codeblocks.mov

Network response:

{
    "object": "chat.completion",
    "created": 1738938116,
    "model": "codestral-latest",
    "choices": [
        {
            "index": 0,
            "message": {
                "role": "assistant",
                "tool_calls": null,
                "content": "```python\ndf = pd.DataFrame({\n    'A': [1, 2, 3],\n    'B': [4, 5, 6]\n})\n```"
            },
            "finish_reason": "stop"
        }
    ],
    "usage": {
        "prompt_tokens": 88,
        "total_tokens": 125,
        "completion_tokens": 37
    }
}

jtpio · 2025-02-07T15:00:27Z

Playing around with it a bit more locally, adding more instructions to the prompt like this one seems to be helping a bit:

ai/src/llm-models/chrome-completer.ts

Lines 11 to 20 in 640f525

    
           /** 
        
            * The initial prompt to use for the completion. 
        
            * Add extra instructions to get better results. 
        
            */ 
        
           const CUSTOM_SYSTEM_PROMPT = `${COMPLETION_SYSTEM_PROMPT} 
        
           Only give raw strings back, do not format the response using backticks! 
        
           The output should be a single string, and should correspond to what a human users 
        
           would write. 
        
           Do not include the prompt in the output, only the string that should be appended to the current input. 
        
           `;

But there still seems to be some issues with completions being a little bit "off" by a few characters?

jupyterlite-ai-codestral-test-2.mov

brichet · 2025-02-07T15:16:13Z

Playing around with it a bit more locally, adding more instructions to the prompt like this one seems to be helping a bit

Thanks for digging into it. Feel free to push in the branch with your changes. Otherwise I can do it.

jtpio · 2025-02-10T08:51:58Z

I pushed a change to improve the default prompt used for completions, so it can also be useful for other providers.

jtpio · 2025-02-10T09:25:37Z

@brichet if this looks good to you, happy to merge as it is right now.

brichet · 2025-02-10T10:53:18Z

@brichet if this looks good to you, happy to merge as it is right now.

Let's merge it, thank you for the prompt update.

I still have some unexpected suggestions from MistralAI , but it may be more an issue with the model.

Peek.2025-02-10.11-51.webm

brichet added the enhancement New feature or request label Jan 28, 2025

jtpio reviewed Feb 4, 2025

View reviewed changes

brichet added 2 commits February 4, 2025 21:19

Use a chat model instead of LLM for codestral completion

a8f5ea8

Fix the prompt after rebase

c6884b4

brichet force-pushed the codestral_completion_chat_model branch from 945ae53 to c6884b4 Compare February 6, 2025 14:53

jtpio added 3 commits February 10, 2025 09:49

improve completion prompt

73e91bd

Merge branch 'main' into codestral_completion_chat_model

e6cae9c

use default prompt for the chrome completer

6b41d23

brichet marked this pull request as ready for review February 10, 2025 10:49

brichet merged commit 4207dd1 into jupyterlite:main Feb 10, 2025
7 checks passed

brichet deleted the codestral_completion_chat_model branch February 10, 2025 10:53

jtpio mentioned this pull request Feb 10, 2025

Add OpenAI provider #19

Draft

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Use a chat model instead of LLM for codestral completion #31

Use a chat model instead of LLM for codestral completion #31

brichet commented Jan 28, 2025

brichet commented Jan 28, 2025

jtpio Feb 4, 2025

jtpio commented Feb 6, 2025

jtpio commented Feb 6, 2025

brichet commented Feb 7, 2025

jtpio commented Feb 7, 2025 •

edited

Loading

jtpio commented Feb 7, 2025

brichet commented Feb 7, 2025

jtpio commented Feb 10, 2025

jtpio commented Feb 10, 2025

brichet commented Feb 10, 2025

Use a chat model instead of LLM for codestral completion #31

Use a chat model instead of LLM for codestral completion #31

Conversation

brichet commented Jan 28, 2025

brichet commented Jan 28, 2025

jtpio Feb 4, 2025

Choose a reason for hiding this comment

jtpio commented Feb 6, 2025

jtpio commented Feb 6, 2025

brichet commented Feb 7, 2025

jtpio commented Feb 7, 2025 • edited Loading

jtpio commented Feb 7, 2025

brichet commented Feb 7, 2025

jtpio commented Feb 10, 2025

jtpio commented Feb 10, 2025

brichet commented Feb 10, 2025

jtpio commented Feb 7, 2025 •

edited

Loading