tool-call: fix type promotion typo causing crashes w/ --jinja w/o tools #11880

ochafik · 2025-02-15T00:50:05Z

Fixes (at least part of) #11866

Somehow a bool can be assigned to an std::string 🤯: new fear unlocked 😅

This introduced different levels of soft or hard crashes as the grammar string got a true / (byte 1) then... something random / platform specific.

Seems different compilers have different levels of letting this through:

bug.cc

// g++ -Wall -Wextra -Wpedantic bug.cc -o out
#include <string>
int main() {
  std::string s;
  // s = false; // this is caught by Apple clang version 15.0.0 but not GCC 14
  s = s.empty(); // caught by neither, even w/ -Wall -Wextra -Wpedantic 
}

henryclw · 2025-02-15T03:27:38Z

Now the fear unlocked for me as well. Couldn't assume my compiler would catch all the type error.

MoonRide303 · 2025-02-15T08:01:46Z

@ochafik eafd957 helped for Gemma 2 (parse: error parsing grammar: expecting name at message is gone), but both Llama 3.2 and also Qwen2.5 now cannot start when trying to load original jinja template from file. No errors, both just silently quit after printing device info:

llama-server.exe -ngl 99 -m Qwen2.5-1.5B-Instruct-Q8_0.gguf --jinja --chat-template-file qwen2.5.jinja -c 8192
ggml_cuda_init: GGML_CUDA_FORCE_MMQ:    no
ggml_cuda_init: GGML_CUDA_FORCE_CUBLAS: no
ggml_cuda_init: found 1 CUDA devices:
  Device 0: NVIDIA GeForce RTX 4080, compute capability 8.9, VMM: yes

In case of Gemma I also noticed "srv params_from_: Chat format: Content-only" message in the server log, which wasn't printed when I didn't use jinja template.

ggerganov · 2025-02-15T08:25:32Z

clang-tidy detects these kind of errors:

Overall the chat.cpp could use some tidy-ing, as it currently "lights up" in my editor and it's difficult to filter the relevant warnings from the less-relevant:

ochafik · 2025-02-15T10:05:26Z

clang-tidy detects these kind of errors:
...
Overall the chat.cpp could use some tidy-ing, as it currently "lights up" in my editor and it's difficult to filter the relevant warnings from the less-relevant:
...

I'll do a pass separately / see whether we can run checks in ci. In some of the cases above, clang-tidy and -Wunused-parameter disagree on whether an unused parameter should have a name (will try and use // NOLINT sparingly :-))

ochafik · 2025-02-15T10:11:00Z

@ochafik eafd957 helped for Gemma 2 (parse: error parsing grammar: expecting name at message is gone), but both Llama 3.2 and also Qwen2.5 now cannot start when trying to load original jinja template from file. No errors, both just silently quit after printing device info:
llama-server.exe -ngl 99 -m Qwen2.5-1.5B-Instruct-Q8_0.gguf --jinja --chat-template-file qwen2.5.jinja -c 8192
ggml_cuda_init: GGML_CUDA_FORCE_MMQ:    no
ggml_cuda_init: GGML_CUDA_FORCE_CUBLAS: no
ggml_cuda_init: found 1 CUDA devices:
  Device 0: NVIDIA GeForce RTX 4080, compute capability 8.9, VMM: yes

@henryclw Thanks for checking! I can't repro on Mac, will spin up a Windows VM & track this on #11866

In case of Gemma I also noticed "srv params_from_: Chat format: Content-only" message in the server log, which wasn't printed when I didn't use jinja template.

This is expected 👍

fix typo (bool grammar no good!)

eafd957

ochafik changed the title ~~tool-call: fix no-tools, jinja case~~ tool-call: fix --jinja w/o tools case Feb 15, 2025

ochafik marked this pull request as ready for review February 15, 2025 01:01

ochafik mentioned this pull request Feb 15, 2025

Misc. bug: Problems with official jinja templates (Gemma 2, Llama 3.2, Qwen 2.5) #11866

Open

ochafik requested a review from ggerganov February 15, 2025 01:05

ochafik changed the title ~~tool-call: fix --jinja w/o tools case~~ tool-call: fix --jinja w/o tools (type promotion typo caused crashes) Feb 15, 2025

ggerganov approved these changes Feb 15, 2025

View reviewed changes

ochafik changed the title ~~tool-call: fix --jinja w/o tools (type promotion typo caused crashes)~~ tool-call: fix type promotion typo causing crashes w/ --jinja w/o tools Feb 15, 2025

ochafik merged commit f355229 into ggml-org:master Feb 15, 2025
46 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

tool-call: fix type promotion typo causing crashes w/ --jinja w/o tools #11880

tool-call: fix type promotion typo causing crashes w/ --jinja w/o tools #11880

ochafik commented Feb 15, 2025 •

edited

Loading

henryclw commented Feb 15, 2025

MoonRide303 commented Feb 15, 2025

ggerganov commented Feb 15, 2025

ochafik commented Feb 15, 2025

ochafik commented Feb 15, 2025

tool-call: fix type promotion typo causing crashes w/ --jinja w/o tools #11880

tool-call: fix type promotion typo causing crashes w/ --jinja w/o tools #11880

Conversation

ochafik commented Feb 15, 2025 • edited Loading

henryclw commented Feb 15, 2025

MoonRide303 commented Feb 15, 2025

ggerganov commented Feb 15, 2025

ochafik commented Feb 15, 2025

ochafik commented Feb 15, 2025

ochafik commented Feb 15, 2025 •

edited

Loading