b3579 #290

Nexesenex · 2024-08-12T20:25:23Z

No description provided.

* gguf-py : Numpy dequantization for most types * gguf-py : Numpy dequantization for grid-based i-quants

ggml-ci

* py : fix requirements check '==' -> '~=' * cont : fix the fix * ci : run on all requirements.txt

Fixes: https://bugs.chromium.org/p/oss-fuzz/issues/detail?id=70724 In order to access the above bug you need to login using one of the emails in https://github.com/google/oss-fuzz/blob/master/projects/llamacpp/project.yaml#L3-L5 Signed-off-by: David Korczynski <[email protected]>

Fixes: https://bugs.chromium.org/p/oss-fuzz/issues/detail?id=70680 Signed-off-by: David Korczynski <[email protected]>

* readme: introduce gpustack GPUStack is an open-source GPU cluster manager for running large language models, which uses llama.cpp as the backend. Signed-off-by: thxCode <[email protected]> * readme: introduce gguf-parser GGUF Parser is a tool to review/check the GGUF file and estimate the memory usage without downloading the whole model. Signed-off-by: thxCode <[email protected]> --------- Signed-off-by: thxCode <[email protected]>

* llama : model-based max number of graph nodes calculation * Update src/llama.cpp --------- Co-authored-by: slaren <[email protected]>

ref: #8912

Signed-off-by: Diogo Teles Sant'Anna <[email protected]>

compilade and others added 10 commits August 11, 2024 14:45

gguf-py : Numpy dequantization for most types (#8939)

4134999

* gguf-py : Numpy dequantization for most types * gguf-py : Numpy dequantization for grid-based i-quants

server : handle models with missing EOS token (#8997)

5ef07e2

ggml-ci

py : fix requirements check '==' -> '~=' (#8982)

d3ae0ee

* py : fix requirements check '==' -> '~=' * cont : fix the fix * ci : run on all requirements.txt

Fix a spelling mistake (#9001)

2589292

grammar-parser : fix possible null-deref (#9004)

1262e7e

Fixes: https://bugs.chromium.org/p/oss-fuzz/issues/detail?id=70680 Signed-off-by: David Korczynski <[email protected]>

llama : model-based max number of graph nodes calculation (#8970)

0fd93cd

* llama : model-based max number of graph nodes calculation * Update src/llama.cpp --------- Co-authored-by: slaren <[email protected]>

ci : enable RPC in all of the released builds (#9006)

1f67436

ref: #8912

ci : fix github workflow vulnerable to script injection (#9008)

fc4ca27

Signed-off-by: Diogo Teles Sant'Anna <[email protected]>

github-actions bot added examples python server ggml devops labels Aug 12, 2024

Nexesenex merged commit 8408090 into Nexesenex:spacestream Aug 12, 2024
23 of 31 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

b3579 #290

b3579 #290

Nexesenex commented Aug 12, 2024

b3579 #290

b3579 #290

Conversation

Nexesenex commented Aug 12, 2024