Skip to content

Actions: ggerganov/llama.cpp

Publish Docker image

Actions

Loading...
Loading

Show workflow options

Create status badge

Loading
7,841 workflow run results
7,841 workflow run results

Filter by Event

Filter by Status

Filter by Branch

Filter by Actor

Fix tensor groups for encoder-decoder models in gguf-dump.py
Publish Docker image #14061: Pull request #8090 synchronize by mofosyne
June 24, 2024 10:58 1h 12m 57s fairydreaming:gguf-dump-grouping-fix
June 24, 2024 10:58 1h 12m 57s
CUDA: optimize MMQ int8 tensor core performance (#8062)
Publish Docker image #14059: Commit 9a590c8 pushed by JohannesGaessler
June 24, 2024 10:41 1h 48m 52s master
June 24, 2024 10:41 1h 48m 52s
CUDA: optimize MMQ int8 tensor core performance
Publish Docker image #14058: Pull request #8062 synchronize by JohannesGaessler
June 24, 2024 09:44 1h 50m 37s JohannesGaessler:cuda-mmq-2xa-3
June 24, 2024 09:44 1h 50m 37s
Option to split during conversion (#6942)
Publish Docker image #14057: Commit 52fc870 pushed by mofosyne
June 24, 2024 09:42 2h 15m 27s master
June 24, 2024 09:42 2h 15m 27s
Add chat template support for llama-cli
Publish Docker image #14056: Pull request #8068 synchronize by ngxson
June 24, 2024 09:07 1h 36m 33s ngxson:xsn/main_chat_template_2
June 24, 2024 09:07 1h 36m 33s
Add chat template support for llama-cli
Publish Docker image #14055: Pull request #8068 synchronize by ngxson
June 24, 2024 09:00 7m 12s ngxson:xsn/main_chat_template_2
June 24, 2024 09:00 7m 12s
Add chat template support for llama-cli
Publish Docker image #14054: Pull request #8068 synchronize by ngxson
June 24, 2024 08:57 2m 27s ngxson:xsn/main_chat_template_2
June 24, 2024 08:57 2m 27s
Add chat template support for llama-cli
Publish Docker image #14053: Pull request #8068 synchronize by ngxson
June 24, 2024 08:57 1m 0s ngxson:xsn/main_chat_template_2
June 24, 2024 08:57 1m 0s
Add chat template support for llama-cli
Publish Docker image #14052: Pull request #8068 synchronize by ngxson
June 24, 2024 08:52 4m 45s ngxson:xsn/main_chat_template_2
June 24, 2024 08:52 4m 45s
llama : reorganize source code + improve CMake
Publish Docker image #14049: Pull request #8006 synchronize by ggerganov
June 24, 2024 08:21 3s gg/reorganize-project
June 24, 2024 08:21 3s
Add Unigram tokenizer needed by T5 and FLAN-T5 model families
Publish Docker image #14048: Pull request #8089 opened by fairydreaming
June 24, 2024 07:23 1h 59m 11s fairydreaming:t5-clean-2
June 24, 2024 07:23 1h 59m 11s
llama : reorganize source code + improve CMake
Publish Docker image #14047: Pull request #8006 synchronize by ggerganov
June 24, 2024 07:12 3s gg/reorganize-project
June 24, 2024 07:12 3s
llama : reorganize source code + improve CMake
Publish Docker image #14046: Pull request #8006 synchronize by ggerganov
June 24, 2024 07:07 3s gg/reorganize-project
June 24, 2024 07:07 3s
llama : reorganize source code + improve CMake
Publish Docker image #14045: Pull request #8006 synchronize by ggerganov
June 24, 2024 06:55 4s gg/reorganize-project
June 24, 2024 06:55 4s
CUDA: use MMQ instead of cuBLAS by default
Publish Docker image #14044: Pull request #8075 synchronize by JohannesGaessler
June 24, 2024 06:51 2h 4m 30s JohannesGaessler:cuda-mmq-default
June 24, 2024 06:51 2h 4m 30s
CUDA: optimize MMQ int8 tensor core performance
Publish Docker image #14043: Pull request #8062 synchronize by JohannesGaessler
June 24, 2024 06:48 1h 18m 14s JohannesGaessler:cuda-mmq-2xa-3
June 24, 2024 06:48 1h 18m 14s
Streamline embeddings from "non-embedding" models
Publish Docker image #14042: Pull request #8087 synchronize by iamlemec
June 24, 2024 06:42 58m 45s iamlemec:attention-type
June 24, 2024 06:42 58m 45s
Option to split during conversion
Publish Docker image #14041: Pull request #6942 synchronize by christianazinn
June 24, 2024 06:14 1h 12m 30s christianazinn:convert-split
June 24, 2024 06:14 1h 12m 30s
Streamline embeddings from "non-embedding" models
Publish Docker image #14040: Pull request #8087 synchronize by iamlemec
June 24, 2024 06:04 38m 8s iamlemec:attention-type
June 24, 2024 06:04 38m 8s
Streamline embeddings from "non-embedding" models
Publish Docker image #14039: Pull request #8087 opened by iamlemec
June 24, 2024 06:00 4m 3s iamlemec:attention-type
June 24, 2024 06:00 4m 3s
disable publishing the full-rocm docker image (#8083)
Publish Docker image #14038: Commit 8cb508d pushed by ggerganov
June 24, 2024 05:36 1h 54m 7s master
June 24, 2024 05:36 1h 54m 7s
embedding : more cli arguments (#7458)
Publish Docker image #14037: Commit 646ef4a pushed by ggerganov
June 24, 2024 05:30 37m 14s master
June 24, 2024 05:30 37m 14s