Merlinite model needs "chat_format" set to "openchat" #825

MichaelClifford · 2024-04-05T15:32:21Z

Some models in our list have a different chat format. The environment variable CHAT_FORMAT needs to be set when starting the playground or model server image with these models. Without this parameter set correctly the models will not return valid responses.

https://github.com/containers/ai-lab-recipes/blob/38f66d8fa8b62ba63bb469083f0c32acf1793955/model_servers/llamacpp_python/src/run.sh#L8C5-L8C213

The affected models are:

Merlinite
Cerebrum
dragon-mistral

All three requires CHAT_FORMAT="openchat"

axel7083 · 2024-04-10T12:59:54Z

The image we are using ghcr.io/containers/podman-desktop-extension-ai-lab-playground-images/ai-lab-playground-chat:0.2.0 is not up to date.

$: podman run --entrypoint=/bin/bash -it ghcr.io/containers/podman-desktop-extension-ai-lab-playground-images/ai-lab-playground-chat:0.2.0 -c "cat /home/default/run.sh"
#!/bin/bash
#
# Copyright (C) 2024 Red Hat, Inc.
#
# Licensed under the Apache License, Version 2.0 (the "License");
# you may not use this file except in compliance with the License.
# You may obtain a copy of the License at
#
# http://www.apache.org/licenses/LICENSE-2.0
#
# Unless required by applicable law or agreed to in writing, software
# distributed under the License is distributed on an "AS IS" BASIS,
# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
# See the License for the specific language governing permissions and
# limitations under the License.
#
# SPDX-License-Identifier: Apache-2.0
python -m llama_cpp.server --model ${MODEL_PATH} --host ${HOST:=0.0.0.0} --port ${PORT:=8000} --n_gpu_layers 0

axel7083 self-assigned this Apr 10, 2024

axel7083 added kind/enhancement ✨ New feature or request area/inference labels Apr 10, 2024

axel7083 mentioned this issue Apr 10, 2024

chore: update run.sh script containers/podman-desktop-extension-ai-lab-playground-images#11

Merged

axel7083 added this to Podman Desktop Planning Apr 10, 2024

axel7083 moved this to 🚧 In Progress in Podman Desktop Planning Apr 10, 2024

axel7083 mentioned this issue Apr 10, 2024

fix: adding chatformat to use for inference servers #868

Merged

1 task

slemeur added this to the 1.0 milestone Apr 11, 2024

axel7083 moved this from 🚧 In Progress to 🚥 In Review in Podman Desktop Planning Apr 11, 2024

axel7083 moved this from 🚥 In Review to 🚧 In Progress in Podman Desktop Planning Apr 11, 2024

axel7083 linked a pull request Apr 11, 2024 that will close this issue

chore: update run.sh script containers/podman-desktop-extension-ai-lab-playground-images#11

Merged

axel7083 closed this as completed in containers/podman-desktop-extension-ai-lab-playground-images#11 Apr 11, 2024

github-project-automation bot moved this from 🚧 In Progress to ✔️ Done in Podman Desktop Planning Apr 11, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Merlinite model needs "chat_format" set to "openchat" #825

Merlinite model needs "chat_format" set to "openchat" #825

MichaelClifford commented Apr 5, 2024 •

edited

Loading

axel7083 commented Apr 10, 2024

Merlinite model needs "chat_format" set to "openchat" #825

Merlinite model needs "chat_format" set to "openchat" #825

Comments

MichaelClifford commented Apr 5, 2024 • edited Loading

axel7083 commented Apr 10, 2024

MichaelClifford commented Apr 5, 2024 •

edited

Loading