llamamodel: fix BERT tokenization after llama.cpp update #2381

cebtenzzre · 2024-05-28T16:15:25Z

We have been stripping an extra token from the end of input to BERT models after the llama.cpp update in #2310. This was caught by the assertion, but only in debug builds:

chat: .../llamamodel.cpp:923: LLamaModel::embedInternal(...)::<lambda(...)>: Assertion `useEOS == (eos_token != -1 && tokens[n_tokens - 1] == eos_token)' failed.
[1]    17958 IOT instruction (core dumped)  build/bin/chat

I had changed llama_tokenize upstream to use the same logic as BOS/CLS for the EOS/SEP token appended by the BERT tokenizer, but this code was still relying on the old behavior where EOS was appended unconditionally.

llama_tokenize now appends EOS and BOS consistently, so the logic that strips EOS needs to change. Signed-off-by: Jared Van Bortel <jared@nomic.ai>

Signed-off-by: Jared Van Bortel <jared@nomic.ai>

llamamodel: fix assertion failure after llama.cpp update

4cc1e5d

llama_tokenize now appends EOS and BOS consistently, so the logic that strips EOS needs to change. Signed-off-by: Jared Van Bortel <jared@nomic.ai>

cebtenzzre requested a review from manyoso May 28, 2024 16:15

make codespell happy

7b88bbf

Signed-off-by: Jared Van Bortel <jared@nomic.ai>

manyoso approved these changes May 28, 2024

View reviewed changes

cebtenzzre merged commit f1b4092 into main May 28, 2024
6 of 19 checks passed

Tim453 mentioned this pull request Jun 9, 2024

Crash when adding directory to the local documents collection flathub/io.gpt4all.gpt4all#1

Closed

ellipsis-dev bot mentioned this pull request Jul 2, 2024

release.json: update release notes for v3.0.0 #2514

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

llamamodel: fix BERT tokenization after llama.cpp update #2381

llamamodel: fix BERT tokenization after llama.cpp update #2381

cebtenzzre commented May 28, 2024

llamamodel: fix BERT tokenization after llama.cpp update #2381

llamamodel: fix BERT tokenization after llama.cpp update #2381

Conversation

cebtenzzre commented May 28, 2024