ggml : dynamic ggml_sched_max_splits based on graph_size #9047

nicoboss · 2024-08-15T19:13:18Z

This fixes #9044

Sets ggml_sched_max_splits to be equal to graph_size as recommended by @slaren in #9044 (comment) since at most there is one split for each node in the graph.

Thanks to this change I was able to run GPU accelerated inference on BigLlama-3.1-681B-Instruct which prior to this change caused llama.cpp to crash.

I have read the contributing guidelines
Self-reported review complexity:
- Low
- Medium
- High

ggml/src/ggml-backend.c

) * ggml : Dynamic ggml_sched_max_splits based on graph_size * Fixed and readded debug code for causes

ggml : Dynamic ggml_sched_max_splits based on graph_size

d86f10a

github-actions bot added the ggml changes relating to the ggml tensor library for machine learning label Aug 15, 2024

nicoboss mentioned this pull request Aug 15, 2024

Bug: GGML_SCHED_MAX_SPLITS must be increased to run BigLlama-3.1-681B-Instruct using GPU acceleration #9044

Closed

slaren reviewed Aug 16, 2024

View reviewed changes

ggml/src/ggml-backend.c Outdated Show resolved Hide resolved

Fixed and readded debug code for causes

fe40950

slaren approved these changes Aug 16, 2024

View reviewed changes

slaren merged commit e3f6fd5 into ggerganov:master Aug 16, 2024
51 checks passed

arthw pushed a commit to arthw/llama.cpp that referenced this pull request Nov 15, 2024

ggml : dynamic ggml_sched_max_splits based on graph_size (ggerganov#9047

7c7e814

) * ggml : Dynamic ggml_sched_max_splits based on graph_size * Fixed and readded debug code for causes

arthw pushed a commit to arthw/llama.cpp that referenced this pull request Nov 18, 2024

ggml : dynamic ggml_sched_max_splits based on graph_size (ggerganov#9047

402efe1

) * ggml : Dynamic ggml_sched_max_splits based on graph_size * Fixed and readded debug code for causes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

ggml : dynamic ggml_sched_max_splits based on graph_size #9047

ggml : dynamic ggml_sched_max_splits based on graph_size #9047

nicoboss commented Aug 15, 2024

ggml : dynamic ggml_sched_max_splits based on graph_size #9047

ggml : dynamic ggml_sched_max_splits based on graph_size #9047

Conversation

nicoboss commented Aug 15, 2024