llamamodel: add DeepSeek-V2 to whitelist #2702

cebtenzzre · 2024-07-19T23:12:07Z

This model actually works fine. I tried it with llama-cli before, but was thrown off by its huge default context size. With GPT4All's context size of 2K, this model works fine with CUDA. It doesn't work with Kompute due to the lack of MoE support.

The changelog is not merged yet, but this patch should be applied:

diff --git a/gpt4all-chat/CHANGELOG.md b/gpt4all-chat/CHANGELOG.md
index b56b993c..bc75cb6c 100644
--- a/gpt4all-chat/CHANGELOG.md
+++ b/gpt4all-chat/CHANGELOG.md
@@ -28,8 +28,9 @@ The format is based on [Keep a Changelog](https://keepachangelog.com/en/1.1.0/).
 - Support translation of settings choices ([#2667](https://github.com/nomic-ai/gpt4all/pull/2667), [#2690](https://github.com/nomic-ai/gpt4all/pull/2690))
 - Improve LocalDocs view's error message (by @cosmic-snow in [#2679](https://github.com/nomic-ai/gpt4all/pull/2679))
 - Ignore case of LocalDocs file extensions ([#2642](https://github.com/nomic-ai/gpt4all/pull/2642), [#2684](https://github.com/nomic-ai/gpt4all/pull/2684))
-- Update llama.cpp to commit 87e397d00 from July 19th ([#2694](https://github.com/nomic-ai/gpt4all/pull/2694))
+- Update llama.cpp to commit 87e397d00 from July 19th ([#2694](https://github.com/nomic-ai/gpt4all/pull/2694), [#2702](https://github.com/nomic-ai/gpt4all/pull/2702))
   - Add support for GPT-NeoX, Gemma 2, OpenELM, ChatGLM, and Jais architectures (all with Vulkan support)
+  - Add support for DeepSeek-V2 architecture (no Vulkan support)
   - Enable Vulkan support for StarCoder2, XVERSE, Command R, and OLMo
 - Show scrollbar in chat collections list as needed (by [@cosmic-snow](https://github.com/cosmic-snow) in [#2691](https://github.com/nomic-ai/gpt4all/pull/2691))

Signed-off-by: Jared Van Bortel <jared@nomic.ai>

cosmic-snow · 2024-07-22T19:24:54Z

I'm assuming this includes deepseek-coder-v2? Because there's an open issue for that: #2527

cebtenzzre · 2024-07-29T16:07:00Z

I'm assuming this includes deepseek-coder-v2? Because there's an open issue for that: #2527

This change is required for DeepSeek-Coder-V2-Instruct support, but I haven't tested that model yet - only DeepSeek-V2-Lite-Chat. I believe the different DeepSeek models use different pretokenizer configurations which have to be individually supported by llama.cpp.

llamamodel: add DeepSeek-V2 to whitelist

bed8dbf

Signed-off-by: Jared Van Bortel <jared@nomic.ai>

cebtenzzre requested a review from manyoso July 19, 2024 23:12

manyoso approved these changes Jul 20, 2024

View reviewed changes

manyoso merged commit 4ca1d04 into main Jul 22, 2024
6 of 20 checks passed

cebtenzzre mentioned this pull request Jul 29, 2024

Can't run deepseek-coder-v2 (and lite) - "unsupported model type" #2527

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

llamamodel: add DeepSeek-V2 to whitelist #2702

llamamodel: add DeepSeek-V2 to whitelist #2702

cebtenzzre commented Jul 19, 2024 •

edited

Loading

cosmic-snow commented Jul 22, 2024

cebtenzzre commented Jul 29, 2024

llamamodel: add DeepSeek-V2 to whitelist #2702

llamamodel: add DeepSeek-V2 to whitelist #2702

Conversation

cebtenzzre commented Jul 19, 2024 • edited Loading

cosmic-snow commented Jul 22, 2024

cebtenzzre commented Jul 29, 2024

cebtenzzre commented Jul 19, 2024 •

edited

Loading