Fix crash when using grammar #1649

tc-wolf · 2024-08-01T20:56:47Z

Upstream changed some things w/ how grammar works in ggerganov/llama.cpp#8508 and ggerganov/llama.cpp#8093 - may also want to check if llama_grammar_init return value is null, since this is what's done now rather than throwing an error.

Old argument order for llama_grammar_accept_token was:
llama_grammar_accept_token(ctx, grammar, token)

Now this is:
llama_grammar_accept_token(grammar, ctx, token)

Can test with

model = llama_cpp.Llama("bartowski/Meta-Llama-3.1-8B-Instruct-GGUF/Meta-Llama-3.1-8B-Instruct-Q8_0.gguf", n_ctx=4096, n_gpu_layers=-1, offload_kqv=True, n_batch=1024, n_threads=12, n_threads_batch=12, verbose=True)
model.create_chat_completion(
    messages=[
        {
            "role": "system",
            "content": "You are a helpful assistant that outputs in JSON.",
        },
        {"role": "user", "content": "Who won the world series in 2020"},
    ],
    response_format={
        "type": "json_object",
        "schema": {
            "type": "object",
            "properties": {"team_name": {"type": "string"}},
            "required": ["team_name"],
        },
    },
    temperature=0.7,
)

Should fix #1623

Old was: llama_grammar_accept_token(ctx, grammar, token) Now this is: llama_grammar_accept_token(grammar, ctx, token)

abetlen · 2024-08-04T21:07:47Z

@tc-wolf thank you so much!

fix llama_grammar_accept_token arg order

a928c62

Old was: llama_grammar_accept_token(ctx, grammar, token) Now this is: llama_grammar_accept_token(grammar, ctx, token)

abetlen merged commit 5575fed into abetlen:main Aug 4, 2024
13 checks passed

abetlen mentioned this pull request Aug 4, 2024

Ported back new grammar changes from C++ to Python implementation #1637

Merged

CISC mentioned this pull request Aug 5, 2024

Chat completions crashes when asked for JSON response #1655

Closed

4 tasks

This was referenced Aug 6, 2024

The latest version kills python kernel with LlamaGrammar #1623

Closed

create_chat_completion is stuck in versions 0.2.84 and 0.2.85 for Mac Silicon #1648

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fix crash when using grammar #1649

Fix crash when using grammar #1649

tc-wolf commented Aug 1, 2024 •

edited

Loading

abetlen commented Aug 4, 2024

Fix crash when using grammar #1649

Fix crash when using grammar #1649

Conversation

tc-wolf commented Aug 1, 2024 • edited Loading

abetlen commented Aug 4, 2024

tc-wolf commented Aug 1, 2024 •

edited

Loading