Skip to content

Actions: ggerganov/llama.cpp

Publish Docker image

Actions

Loading...
Loading

Show workflow options

Create status badge

Loading
7,172 workflow run results
7,172 workflow run results

Filter by Event

Filter by Status

Filter by Branch

Filter by Actor

json: fix additionalProperties, allow space after enum/const
Publish Docker image #14086: Pull request #7840 synchronize by ochafik
June 25, 2024 00:43 1h 57m 30s ochafik:json-additional
June 25, 2024 00:43 1h 57m 30s
gfx1010 optimizations
Publish Docker image #14085: Pull request #8085 synchronize by daniandtheweb
June 25, 2024 00:15 2h 10m 42s daniandtheweb:gfx1010_optimizations
June 25, 2024 00:15 2h 10m 42s
[SYCL] Re-enabled mul_mat_batched_sycl
Publish Docker image #14084: Pull request #8095 synchronize by airMeng
June 25, 2024 00:09 1h 20m 1s sycl-mul-mat-batched
June 25, 2024 00:09 1h 20m 1s
CUDA: fix matrix multiplication algorithm choice (#8102)
Publish Docker image #14083: Commit 2df373a pushed by JohannesGaessler
June 24, 2024 23:22 2h 39m 18s master
June 24, 2024 23:22 2h 39m 18s
json: support integer minimum, maximum, exclusiveMinimum, exclusiveMaximum
Publish Docker image #14082: Pull request #7797 synchronize by ochafik
June 24, 2024 22:44 1h 44m 17s ochafik:json-bounds2
June 24, 2024 22:44 1h 44m 17s
json: fix additionalProperties, allow space after enum/const
Publish Docker image #14080: Pull request #7840 synchronize by ochafik
June 24, 2024 20:31 2h 59m 47s ochafik:json-additional
June 24, 2024 20:31 2h 59m 47s
json: support integer minimum, maximum, exclusiveMinimum, exclusiveMaximum
Publish Docker image #14079: Pull request #7797 synchronize by ochafik
June 24, 2024 20:29 2h 15m 12s ochafik:json-bounds2
June 24, 2024 20:29 2h 15m 12s
json: better support for "type" unions (e.g. nullable arrays w/ typed items)
Publish Docker image #14078: Pull request #7863 synchronize by ochafik
June 24, 2024 20:20 2h 29m 55s ochafik:json-type
June 24, 2024 20:20 2h 29m 55s
CUDA: fix MMQ writeback for int8 tensor cores (#8100)
Publish Docker image #14077: Commit 3b099bc pushed by JohannesGaessler
June 24, 2024 20:15 3h 21m 18s master
June 24, 2024 20:15 3h 21m 18s
Add Unigram tokenizer needed by T5 and FLAN-T5 model families
Publish Docker image #14076: Pull request #8089 synchronize by fairydreaming
June 24, 2024 19:39 2h 29m 35s fairydreaming:t5-clean-2
June 24, 2024 19:39 2h 29m 35s
llama : reorganize source code + improve CMake
Publish Docker image #14075: Pull request #8006 synchronize by ggerganov
June 24, 2024 19:06 5s gg/reorganize-project
June 24, 2024 19:06 5s
Detokenizer fixes
Publish Docker image #14073: Pull request #8039 synchronize by jaime-m-p
June 24, 2024 18:56 4s jaime-m-p:detokenizer
June 24, 2024 18:56 4s
Streamline embeddings from "non-embedding" models
Publish Docker image #14072: Pull request #8087 synchronize by iamlemec
June 24, 2024 16:54 3h 3m 17s iamlemec:attention-type
June 24, 2024 16:54 3h 3m 17s
gfx1010 optimizations
Publish Docker image #14071: Pull request #8085 synchronize by daniandtheweb
June 24, 2024 16:49 2h 38m 40s daniandtheweb:gfx1010_optimizations
June 24, 2024 16:49 2h 38m 40s
CUDA: use MMQ instead of cuBLAS by default (#8075)
Publish Docker image #14070: Commit a818f30 pushed by JohannesGaessler
June 24, 2024 15:43 3h 34m 15s master
June 24, 2024 15:43 3h 34m 15s
nix: update flake.lock
Publish Docker image #14069: Pull request #8071 synchronize by philiptaron
June 24, 2024 15:26 2h 27m 25s update_flake_lock_action
June 24, 2024 15:26 2h 27m 25s
gfx1010 optimizations
Publish Docker image #14068: Pull request #8085 synchronize by daniandtheweb
June 24, 2024 14:47 2h 2m 18s daniandtheweb:gfx1010_optimizations
June 24, 2024 14:47 2h 2m 18s
llama : reorganize source code + improve CMake
Publish Docker image #14067: Pull request #8006 synchronize by ggerganov
June 24, 2024 13:54 3s gg/reorganize-project
June 24, 2024 13:54 3s
llama : reorganize source code + improve CMake
Publish Docker image #14066: Pull request #8006 synchronize by ggerganov
June 24, 2024 13:51 4s gg/reorganize-project
June 24, 2024 13:51 4s
[SYCL] Re-enabled mul_mat_batched_sycl
Publish Docker image #14065: Pull request #8095 opened by airMeng
June 24, 2024 12:57 2h 48m 55s sycl-mul-mat-batched
June 24, 2024 12:57 2h 48m 55s
CUDA: use MMQ instead of cuBLAS by default
Publish Docker image #14064: Pull request #8075 synchronize by JohannesGaessler
June 24, 2024 12:35 2h 28m 9s JohannesGaessler:cuda-mmq-default
June 24, 2024 12:35 2h 28m 9s
gguf-py : fix tensor groups for encoder-decoder models in gguf-dump.p…
Publish Docker image #14063: Commit d62e4aa pushed by fairydreaming
June 24, 2024 12:13 2h 4m 26s master
June 24, 2024 12:13 2h 4m 26s
llama : return nullptr from llama_grammar_init
Publish Docker image #14062: Pull request #8093 opened by danbev
June 24, 2024 11:25 1h 48m 21s danbev:grammar-init-return-null
June 24, 2024 11:25 1h 48m 21s