Skip to content

Commit

Permalink
feat: increase limit to capable Llama 7B
Browse files Browse the repository at this point in the history
  • Loading branch information
hans00 committed May 3, 2024
1 parent a044dbc commit 6058861
Showing 1 changed file with 2 additions and 1 deletion.
3 changes: 2 additions & 1 deletion src/ggml-qnn/ggml-qnn.cpp
Original file line number Diff line number Diff line change
Expand Up @@ -4282,7 +4282,8 @@ static size_t ggml_backend_qnn_buffer_type_get_max_size(ggml_backend_buffer_type
GGML_UNUSED(buft);
//works fine with ggml-tiny.en-q8_0.bin for whisper.cpp
//return (38 * 1024 * 1024);
return (96 * 1024 * 1024);
// return (96 * 1024 * 1024);
return (128 * 1024 * 1024);
}


Expand Down

0 comments on commit 6058861

Please sign in to comment.