Releases · sorasoras/llama.cpp

15 Oct 19:17

755a9b2

b3922 Latest

Latest

llama : add infill sampler (#9896)

ggml-ci

Assets 22

cudart-llama-bin-win-cu11.7.1-x64.zip

293 MB 2024-10-15T19:17:36Z
cudart-llama-bin-win-cu12.2.0-x64.zip

413 MB 2024-10-15T19:17:44Z
llama-b1-bin-win-hip-x64-gfx1030.zip

236 MB 2024-10-15T19:17:59Z
llama-b1-bin-win-hip-x64-gfx1100.zip

237 MB 2024-10-15T19:18:07Z
llama-b1-bin-win-hip-x64-gfx1101.zip

237 MB 2024-10-15T19:18:16Z
llama-b3922-bin-macos-arm64.zip

52.1 MB 2024-10-15T19:18:25Z
llama-b3922-bin-macos-x64.zip

52.9 MB 2024-10-15T19:18:27Z
llama-b3922-bin-ubuntu-x64.zip

58.6 MB 2024-10-15T19:18:29Z
llama-b3922-bin-win-avx-x64.zip

7.81 MB 2024-10-15T19:18:32Z
llama-b3922-bin-win-avx2-x64.zip

7.8 MB 2024-10-15T19:18:32Z
Source code (zip)

2024-10-15T13:35:33Z
Source code (tar.gz)

2024-10-15T13:35:33Z

19 Jun 16:09

github-actions

b3184

9c77ec1

b3184

ggml : synchronize threads using barriers (#7993)

Assets 20

19 May 16:12

github-actions

b2938

f030ec1

b2938

Vulkan Embedding Fix (#7360)

* Fix empty Vulkan host buffers

Add fp32 fp16 matmul shader

Fix matmul shader alignment

* Remove deprecated tensor->backend uses

* Fix Vulkan validation errors on embedding models with no offloaded layers

* Fix Vulkan llava segfault when not offloading layers

Assets 2

15 May 15:48

github-actions

b2898

7257490

b2898

Merge pull request #15 from ggerganov/master

push update

Assets 20

14 May 14:20

github-actions

b2878

1265c67

b2878

Revert "move ndk code to a new library (#6951)" (#7282)

This reverts commit efc8f767c8c8c749a245dd96ad4e2f37c164b54c.

Assets 20

13 May 06:44

github-actions

b2872

00f5061

b2872

Merge pull request #12 from JohannesGaessler/server-ngram-4

Server ngram 4

Assets 19

12 May 18:48

github-actions

b2866

def1fe0

b2866

Merge pull request #10 from JohannesGaessler/cuda-fa-no-tc-11

Cuda fa no tc 11

Assets 19

12 May 18:48

github-actions

b2861

6f1b636

b2861

cmake : fix version cmp (#7227)

Assets 19

11 May 04:55

github-actions

b2843

e849648

b2843

llama-bench : add pp+tg test type (#7199)

Assets 19

08 Apr 16:50

github-actions

b2633

cecd8d3

b2633

Comment explaining a decision (#6531)

Assets 2

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Releases: sorasoras/llama.cpp

b3922

b3184

b2938

b2898

b2878

b2872

b2866

b2861

b2843

b2633