add support for Orion-14B #5118

sharpHL · 2024-01-24T18:10:16Z

support for the Orion-14B related models
https://huggingface.co/OrionStarAI/Orion-14B-Chat
https://huggingface.co/OrionStarAI/Orion-14B-Chat-Plugin
https://huggingface.co/OrionStarAI/Orion-14B-Chat-RAG

…B-Chat)

arch-btw · 2024-01-25T10:59:53Z

Can confirm that it works with https://huggingface.co/OrionStarAI/Orion-14B-Chat/blob/main/Orion-14B-Chat.gguf (converted to Q5_K_M).

Although, it is not clear what the correct prompt format is, -i -ins seems to work.

sorasoras · 2024-01-25T17:43:31Z

Can confirm working on rocm

Tangweirui2021

These changes do can fix the convert problem. And it also enables the model to run correctly.

llama.cpp

sharpHL

Orion-14B-support

Co-authored-by: Georgi Gerganov <ggerganov@gmail.com>

llama.cpp

Co-authored-by: slaren <slarengh@gmail.com>

llama.cpp

zyxcambridge · 2024-01-29T13:14:29Z

llm_load_print_meta: BOS token = 1 ''
llm_load_print_meta: EOS token = 2 ''
llm_load_print_meta: UNK token = 0 ''
llm_load_print_meta: PAD token = 0 ''
llm_load_print_meta: LF token = 64 '<0x0A>'
llm_load_tensors: ggml ctx size = 0.34 MiB
ggml_backend_metal_buffer_from_ptr: error: failed to allocate buffer, size = 0.00 MiB
llama_model_load: error loading model: failed to allocate buffer
llama_load_model_from_file: failed to load model
llama_init_from_gpt_params: error: failed to load model 'Orion-14B-Chat.gguf'
main: error: unable to load model
(base) zhangyixin@zhangyixin llama.cpp %

* add support for Orion-14B(https://huggingface.co/OrionStarAI/Orion-14B-Chat) * flake8 support * Update llama.cpp Co-authored-by: Georgi Gerganov <ggerganov@gmail.com> * Update llama.cpp Co-authored-by: Georgi Gerganov <ggerganov@gmail.com> * Update llama.cpp Co-authored-by: Georgi Gerganov <ggerganov@gmail.com> * Update llama.cpp Co-authored-by: Georgi Gerganov <ggerganov@gmail.com> * Update llama.cpp Co-authored-by: slaren <slarengh@gmail.com> * Update llama.cpp * Update llama.cpp --------- Co-authored-by: lixiaopu <lixiaopu@cmcm.com> Co-authored-by: Georgi Gerganov <ggerganov@gmail.com> Co-authored-by: slaren <slarengh@gmail.com>

lixiaopu and others added 3 commits January 25, 2024 01:58

add support for Orion-14B(https://huggingface.co/OrionStarAI/Orion-14…

d64bb81

…B-Chat)

flake8 support

154319c

Merge branch 'ggerganov:master' into Orion-14B-support

0bd6d42

LostRuins mentioned this pull request Jan 26, 2024

(COMPATIBILITY) [v1.54 Smooth Sampling] - unknown model architecture: 'orion' LostRuins/koboldcpp#638

Closed

Tangweirui2021 approved these changes Jan 26, 2024

View reviewed changes

ggerganov approved these changes Jan 26, 2024

View reviewed changes

llama.cpp Outdated Show resolved Hide resolved

llama.cpp Outdated Show resolved Hide resolved

llama.cpp Outdated Show resolved Hide resolved

llama.cpp Outdated Show resolved Hide resolved

llama.cpp Outdated Show resolved Hide resolved

sharpHL commented Jan 27, 2024

View reviewed changes

sharpHL and others added 5 commits January 27, 2024 19:19

Update llama.cpp

0185aa7

Co-authored-by: Georgi Gerganov <ggerganov@gmail.com>

Update llama.cpp

db44ddf

Co-authored-by: Georgi Gerganov <ggerganov@gmail.com>

Update llama.cpp

aac36f9

Co-authored-by: Georgi Gerganov <ggerganov@gmail.com>

Merge branch 'ggerganov:master' into Orion-14B-support

220f917

Update llama.cpp

82f5d56

Co-authored-by: Georgi Gerganov <ggerganov@gmail.com>

slaren reviewed Jan 27, 2024

View reviewed changes

llama.cpp Outdated Show resolved Hide resolved

sharpHL and others added 2 commits January 28, 2024 00:57

Update llama.cpp

97fbb22

Co-authored-by: slaren <slarengh@gmail.com>

Merge branch 'ggerganov:master' into Orion-14B-support

40f5570

ggerganov reviewed Jan 28, 2024

View reviewed changes

llama.cpp Outdated Show resolved Hide resolved

Update llama.cpp

5918c98

ggerganov reviewed Jan 28, 2024

View reviewed changes

llama.cpp Outdated Show resolved Hide resolved

Update llama.cpp

f514e67

ggerganov merged commit f2e69d2 into ggerganov:master Jan 28, 2024
42 of 47 checks passed

prusnak mentioned this pull request Jan 28, 2024

convert-hf-to-gguf.py Qwen-72B-Chat model get Killed result #5156

Closed

ggerganov added a commit that referenced this pull request Jan 31, 2024

llama : reorder build_orion() at correct place (#5118)

d3bac7d

jordankanter pushed a commit to jordankanter/llama.cpp that referenced this pull request Feb 3, 2024

llama : reorder build_orion() at correct place (ggerganov#5118)

3196e58

hodlen pushed a commit to hodlen/llama.cpp that referenced this pull request Apr 1, 2024

llama : reorder build_orion() at correct place (ggerganov#5118)

bc5d042

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

add support for Orion-14B #5118

add support for Orion-14B #5118

sharpHL commented Jan 24, 2024

arch-btw commented Jan 25, 2024

sorasoras commented Jan 25, 2024

Tangweirui2021 left a comment

sharpHL left a comment

zyxcambridge commented Jan 29, 2024

add support for Orion-14B #5118

add support for Orion-14B #5118

Conversation

sharpHL commented Jan 24, 2024

arch-btw commented Jan 25, 2024

sorasoras commented Jan 25, 2024

Tangweirui2021 left a comment

Choose a reason for hiding this comment

sharpHL left a comment

Choose a reason for hiding this comment

zyxcambridge commented Jan 29, 2024