Skip to content

Releases: sorasoras/llama.cpp

b3922

15 Oct 19:17
755a9b2
Compare
Choose a tag to compare
llama : add infill sampler (#9896)

ggml-ci

b3184

19 Jun 16:09
9c77ec1
Compare
Choose a tag to compare
ggml : synchronize threads using barriers (#7993)

b2938

19 May 16:12
f030ec1
Compare
Choose a tag to compare
Vulkan Embedding Fix (#7360)

* Fix empty Vulkan host buffers

Add fp32 fp16 matmul shader

Fix matmul shader alignment

* Remove deprecated tensor->backend uses

* Fix Vulkan validation errors on embedding models with no offloaded layers

* Fix Vulkan llava segfault when not offloading layers

b2898

15 May 15:48
7257490
Compare
Choose a tag to compare
Merge pull request #15 from ggerganov/master

push update

b2878

14 May 14:20
1265c67
Compare
Choose a tag to compare
Revert "move ndk code to a new library (#6951)" (#7282)

This reverts commit efc8f767c8c8c749a245dd96ad4e2f37c164b54c.

b2872

13 May 06:44
00f5061
Compare
Choose a tag to compare
Merge pull request #12 from JohannesGaessler/server-ngram-4

Server ngram 4

b2866

12 May 18:48
def1fe0
Compare
Choose a tag to compare
Merge pull request #10 from JohannesGaessler/cuda-fa-no-tc-11

Cuda fa no tc 11

b2861

12 May 18:48
6f1b636
Compare
Choose a tag to compare
cmake : fix version cmp (#7227)

b2843

11 May 04:55
e849648
Compare
Choose a tag to compare
llama-bench : add pp+tg test type (#7199)

b2633

08 Apr 16:50
cecd8d3
Compare
Choose a tag to compare
Comment explaining a decision (#6531)