Skip to content

Actions: ggerganov/llama.cpp

Publish Docker image

Actions

Loading...
Loading

Show workflow options

Create status badge

Loading
8,442 workflow run results
8,442 workflow run results

Filter by Event

Filter by Status

Filter by Branch

Filter by Actor

server : handle models with missing EOS token (#8997)
Publish Docker image #14354: Commit 5ef07e2 pushed by ggerganov
August 12, 2024 07:21 3h 0m 56s master
August 12, 2024 07:21 3h 0m 56s
llama : check all graph nodes when searching for result_embd_pooled (…
Publish Docker image #14353: Commit 33309f6 pushed by fairydreaming
August 11, 2024 08:35 3h 24m 48s master
August 11, 2024 08:35 3h 24m 48s
Optimize Vulkan backend for better CPU performance and less GPU synch…
Publish Docker image #14352: Commit 7c5bfd5 pushed by 0cc4m
August 11, 2024 08:09 2h 29m 39s master
August 11, 2024 08:09 2h 29m 39s
metal : fix uninitialized abort_callback (#8968)
Publish Docker image #14351: Commit 6e02327 pushed by slaren
August 10, 2024 13:42 3h 12m 35s master
August 10, 2024 13:42 3h 12m 35s
llama : default n_swa for phi-3 (#8931)
Publish Docker image #14350: Commit 7eb2384 pushed by ngxson
August 10, 2024 11:04 2h 45m 1s master
August 10, 2024 11:04 2h 45m 1s
Add support for encoder-only T5 models (#8900)
Publish Docker image #14349: Commit 7c3f55c pushed by fairydreaming
August 10, 2024 09:43 2h 38m 37s master
August 10, 2024 09:43 2h 38m 37s
Merge commit from fork
Publish Docker image #14348: Commit b72942f pushed by ggerganov
August 9, 2024 20:03 2h 37m 43s master
August 9, 2024 20:03 2h 37m 43s
llama : add support for lora adapters in T5 model (#8938)
Publish Docker image #14347: Commit 6afd1a9 pushed by fairydreaming
August 9, 2024 16:53 2h 47m 20s master
August 9, 2024 16:53 2h 47m 20s
make : fix llava obj file race (#8946)
Publish Docker image #14346: Commit 272e3bd pushed by ggerganov
August 9, 2024 15:24 3h 32m 1s master
August 9, 2024 15:24 3h 32m 1s
llama : better replace_all (cont) (#8926)
Publish Docker image #14345: Commit 45a55b9 pushed by ggerganov
August 9, 2024 15:23 59m 32s master
August 9, 2024 15:23 59m 32s
llava : support MiniCPM-V-2.5 (#7599)
Publish Docker image #14344: Commit 3071c0a pushed by ggerganov
August 9, 2024 10:33 40m 24s master
August 9, 2024 10:33 40m 24s
sync : ggml
Publish Docker image #14343: Commit 4305b57 pushed by ggerganov
August 9, 2024 07:04 2h 46m 3s master
August 9, 2024 07:04 2h 46m 3s
server : add one level list nesting for embeddings (#8936)
Publish Docker image #14342: Commit daef3ab pushed by ggerganov
August 9, 2024 06:32 6m 1s master
August 9, 2024 06:32 6m 1s
llama : reduce useless copies when saving session (#8916)
Publish Docker image #14341: Commit 345a686 pushed by compilade
August 9, 2024 03:54 2h 34m 21s master
August 9, 2024 03:54 2h 34m 21s
sync : ggml
Publish Docker image #14340: Commit e44a561 pushed by ggerganov
August 8, 2024 10:20 2h 28m 33s master
August 8, 2024 10:20 2h 28m 33s
make : clean llamafile objects (#8923)
Publish Docker image #14339: Commit ebd541a pushed by ggerganov
August 8, 2024 08:44 2h 29m 35s master
August 8, 2024 08:44 2h 29m 35s
make : use C compiler to build metal embed object (#8899)
Publish Docker image #14338: Commit 15fa07a pushed by slaren
August 7, 2024 16:24 2h 38m 5s master
August 7, 2024 16:24 2h 38m 5s
ggml-backend : fix async copy from CPU (#8897)
Publish Docker image #14337: Commit be55695 pushed by slaren
August 7, 2024 11:29 2h 32m 6s master
August 7, 2024 11:29 2h 32m 6s
[SYCL] Updated SYCL device filtering (#8901)
Publish Docker image #14336: Commit 0478174 pushed by OuadiElfarouki
August 7, 2024 10:25 2h 31m 32s master
August 7, 2024 10:25 2h 31m 32s
CUDA/HIP: fix tests/test-backend-ops (#8896)
Publish Docker image #14335: Commit a8dbc6f pushed by JohannesGaessler
August 7, 2024 07:07 2h 42m 7s master
August 7, 2024 07:07 2h 42m 7s
llama-bench : add support for getting cpu info on Windows (#8824)
Publish Docker image #14334: Commit 506122d pushed by slaren
August 7, 2024 01:01 2h 55m 15s master
August 7, 2024 01:01 2h 55m 15s
quantize : update usage comment in quantize.cpp (#8889)
Publish Docker image #14333: Commit 725e3d9 pushed by slaren
August 6, 2024 23:43 3h 10m 47s master
August 6, 2024 23:43 3h 10m 47s
typo correction (#8891)
Publish Docker image #14332: Commit 3195854 pushed by slaren
August 6, 2024 23:41 2h 33m 13s master
August 6, 2024 23:41 2h 33m 13s
server : add lora hotswap endpoint (WIP) (#8857)
Publish Docker image #14331: Commit 1e6f655 pushed by ngxson
August 6, 2024 15:33 3h 22m 46s master
August 6, 2024 15:33 3h 22m 46s
CUDA: fix padding logic for FP16/FP32 (#8884)
Publish Docker image #14330: Commit 641f5dd pushed by JohannesGaessler
August 6, 2024 15:13 2h 44m 21s master
August 6, 2024 15:13 2h 44m 21s