Fixed n vocab #9511

Xarbirus · 2024-09-16T16:38:01Z

I have read the contributing guidelines
Self-reported review complexity:
- Low
- Medium
- High

For no_vocab models, the current llm_load_vocab function incorrectly fills n_vocab, which is why an

llama_decode_internal: invalid token[0] = ...

error occurs during inference.

…code_internal`

src/llama.cpp

ggerganov

Btw, we currently have state duplication in the sense that hparams.n_vocab and vocab.n_vocab are the same thing but initialized independently in different ways. The reason to have the latter is because some parts of the implementation are decoupled from the model/hparams.

Just noting this here for awareness - no action is necessary for this PR. Will be resolved in future refactorings.

src/llama.cpp

Co-authored-by: Georgi Gerganov <[email protected]>

* llama: fixed n_vocab for `no_vocab` models * llama: updated error output for `llama_decode_internal` and `llama_encode_internal` * llama: log warning if there's no vocab_size in metadata * llama: correct vocab size for logging Co-authored-by: Georgi Gerganov <[email protected]> --------- Co-authored-by: Georgi Gerganov <[email protected]>

Xarbirus added 2 commits September 16, 2024 18:30

llama: fixed n_vocab for no_vocab models

a5e87bf

llama: updated error output for llama_decode_internal and `llama_en…

544b266

…code_internal`

ggerganov reviewed Sep 16, 2024

View reviewed changes

src/llama.cpp Outdated Show resolved Hide resolved

llama: log warning if there's no vocab_size in metadata

9704f0e

ggerganov reviewed Sep 17, 2024

View reviewed changes

src/llama.cpp Outdated Show resolved Hide resolved

llama: correct vocab size for logging

93ef595

Co-authored-by: Georgi Gerganov <[email protected]>

ggerganov approved these changes Sep 17, 2024

View reviewed changes

ggerganov merged commit 8344ef5 into ggerganov:master Sep 17, 2024
50 of 51 checks passed

Xarbirus deleted the fixed_n_vocab branch September 23, 2024 16:54

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fixed n vocab #9511

Fixed n vocab #9511

Xarbirus commented Sep 16, 2024

ggerganov left a comment

Fixed n vocab #9511

Fixed n vocab #9511

Conversation

Xarbirus commented Sep 16, 2024

ggerganov left a comment

Choose a reason for hiding this comment