Skip to content

Actions: ggerganov/llama.cpp

Publish Docker image

Actions

Loading...
Loading

Show workflow options

Create status badge

Loading
7,777 workflow run results
7,777 workflow run results

Filter by Event

Filter by Status

Filter by Branch

Filter by Actor

llama : use F32 precision in GLM4 attention and no FA (#9130)
Publish Docker image #14390: Commit a07c32e pushed by ggerganov
August 23, 2024 07:27 2h 54m 25s master
August 23, 2024 07:27 2h 54m 25s
[SYCL] Add a space to supress a cmake warning (#9133)
Publish Docker image #14389: Commit 11b84eb pushed by airMeng
August 22, 2024 14:09 2h 30m 29s master
August 22, 2024 14:09 2h 30m 29s
[SYCL] Add oneDNN primitive support (#9091)
Publish Docker image #14388: Commit 1731d42 pushed by airMeng
August 22, 2024 04:50 2h 31m 28s master
August 22, 2024 04:50 2h 31m 28s
llama : simplify Mamba with advanced batch splits (#8526)
Publish Docker image #14387: Commit a1631e5 pushed by compilade
August 21, 2024 21:58 2h 48m 6s master
August 21, 2024 21:58 2h 48m 6s
server : support reading arguments from environment variables (#9105)
Publish Docker image #14386: Commit fc54ef0 pushed by ngxson
August 21, 2024 09:04 5h 46m 37s master
August 21, 2024 09:04 5h 46m 37s
llama : support for falcon-mamba architecture (#9074)
Publish Docker image #14385: Commit b40eb84 pushed by ggerganov
August 21, 2024 08:06 4h 53m 43s master
August 21, 2024 08:06 4h 53m 43s
llava : zero-initialize clip_ctx structure fields with aggregate init…
Publish Docker image #14384: Commit f63f603 pushed by fairydreaming
August 21, 2024 07:45 3h 11m 52s master
August 21, 2024 07:45 3h 11m 52s
llama : std::move llm_bigram_bpe from work_queue (#9062)
Publish Docker image #14383: Commit 8455340 pushed by ggerganov
August 21, 2024 07:33 2h 49m 23s master
August 21, 2024 07:33 2h 49m 23s
llava: Add ACC OP for GPU acceleration to the Vulkan backend in the L…
Publish Docker image #14382: Commit 2f3c146 pushed by 0cc4m
August 20, 2024 19:00 2h 38m 15s master
August 20, 2024 19:00 2h 38m 15s
[SYCL] fallback mmvq (#9088)
Publish Docker image #14381: Commit 50addec pushed by NeoZhangJianyu
August 20, 2024 15:50 2h 32m 15s master
August 20, 2024 15:50 2h 32m 15s
[SYCL] Fix SYCL im2col and convert Overflow with Large Dims (#9052)
Publish Docker image #14380: Commit 4f8d19f pushed by NeoZhangJianyu
August 20, 2024 15:06 2h 39m 39s master
August 20, 2024 15:06 2h 39m 39s
tests : add missing comma in grammar integration tests (#9099)
Publish Docker image #14379: Commit 90db814 pushed by ggerganov
August 20, 2024 09:09 4h 10m 15s master
August 20, 2024 09:09 4h 10m 15s
cann: add doc for cann backend (#8867)
Publish Docker image #14378: Commit cfac111 pushed by hipudding
August 19, 2024 08:46 3h 14m 41s master
August 19, 2024 08:46 3h 14m 41s
rpc : print error message when failed to connect endpoint (#9042)
Publish Docker image #14377: Commit 1b6ff90 pushed by rgerganov
August 19, 2024 07:11 3h 14m 22s master
August 19, 2024 07:11 3h 14m 22s
rpc : prevent crashes on invalid input (#9040)
Publish Docker image #14376: Commit 18eaf29 pushed by rgerganov
August 19, 2024 07:10 2h 33m 58s master
August 19, 2024 07:10 2h 33m 58s
Fix incorrect use of ctx_split for bias tensors (#9063)
Publish Docker image #14375: Commit 2fb9267 pushed by slaren
August 17, 2024 13:34 2h 36m 54s master
August 17, 2024 13:34 2h 36m 54s
server : refactor middleware and /health endpoint (#9056)
Publish Docker image #14374: Commit 8b3befc pushed by ngxson
August 16, 2024 15:19 2h 30m 43s master
August 16, 2024 15:19 2h 30m 43s
llava : support MiniCPM-V-2.6 (#8967)
Publish Docker image #14373: Commit d565bb2 pushed by ggerganov
August 16, 2024 13:34 2h 43m 23s master
August 16, 2024 13:34 2h 43m 23s
llama : add EXAONE model support (#9025)
Publish Docker image #14372: Commit c679e0c pushed by ggerganov
August 16, 2024 06:35 3h 23m 28s master
August 16, 2024 06:35 3h 23m 28s
common : add support for cpu_get_num_physical_cores() on Windows (#8771)
Publish Docker image #14371: Commit fb487bb pushed by ggerganov
August 16, 2024 06:23 3h 0m 17s master
August 16, 2024 06:23 3h 0m 17s
Add Nemotron/Minitron GGUF Conversion & Inference Support (#8922)
Publish Docker image #14370: Commit 2a24c8c pushed by slaren
August 16, 2024 02:23 3h 16m 28s master
August 16, 2024 02:23 3h 16m 28s
ggml : dynamic ggml_sched_max_splits based on graph_size (#9047)
Publish Docker image #14369: Commit e3f6fd5 pushed by slaren
August 16, 2024 02:22 2h 51m 33s master
August 16, 2024 02:22 2h 51m 33s
retrieval : fix memory leak in retrieval query handling (#8955)
Publish Docker image #14368: Commit 4b9afbb pushed by ggerganov
August 15, 2024 07:40 4h 46m 54s master
August 15, 2024 07:40 4h 46m 54s
server : fix duplicated n_predict key in the generation_settings (#8994)
Publish Docker image #14367: Commit 37501d9 pushed by ggerganov
August 15, 2024 07:28 4h 4m 1s master
August 15, 2024 07:28 4h 4m 1s
common : remove duplicate function llama_should_add_bos_token (#8778)
Publish Docker image #14366: Commit 4af8420 pushed by ggerganov
August 15, 2024 07:23 3h 42m 11s master
August 15, 2024 07:23 3h 42m 11s