Releases · ggerganov/llama.cpp

05 Dec 19:59

c9c6e01

b4273

vulkan: Add VK_NV_cooperative_matrix2 support for mul_mat and flash a…

Assets 22

05 Dec 19:25

github-actions

b4272

6fe6247

b4272

llama : add Minerva 7B model support (#10673)

* Support for Minerva 7B

* Update convert_hf_to_gguf_update.py

Assets 22

05 Dec 12:12

github-actions

b4271

0cd182e

b4271

sync : ggml

Assets 22

04 Dec 23:16

github-actions

b4267

f112d19

b4267

Update deprecation-warning.cpp (#10619)

Fixed Path Separator Handling for Cross-Platform Support (Windows File Systems)

Assets 22

04 Dec 21:22

github-actions

b4266

1da7b76

b4266

server : fix speculative decoding with context shift (#10641)

* server : fix speculative decoding with context shift

ggml-ci

* server : take into account speculative limits

ggml-ci

* server : add tests

Assets 22

04 Dec 14:49

github-actions

b4265

59f4db1

b4265

ggml : add predefined list of CPU backend variants to build (#10626)

* ggml : add predefined list of CPU backend variants to build

* update CPU dockerfiles

Assets 22

04 Dec 10:25

github-actions

b4262

8d0cfd5

b4262

llama: Support MiniCPM-1B (with & w/o longrope) (#10559)

Assets 22

04 Dec 08:02

github-actions

b4261

2759916

b4261

vulkan: Implement "fast divide" (mul+shift) for unary ops like copy (…

Assets 22

04 Dec 02:15

github-actions

b4260

40c6d79

b4260

SYCL : Move to compile time oneMKL interface backend selection for NV…

Assets 22

04 Dec 01:45

github-actions

b4258

cd2f37b

b4258

Avoid using __fp16 on ARM with old nvcc (#10616)

Assets 22

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Releases: ggerganov/llama.cpp

b4273

b4272

b4271

b4267

b4266

b4265

b4262

b4261

b4260

b4258