Skip to content

CUDA: quantized KV support for FA vec #3613

CUDA: quantized KV support for FA vec

CUDA: quantized KV support for FA vec #3613

Triggered via pull request May 27, 2024 22:39
@JohannesGaesslerJohannesGaessler
synchronize #7527
Status Success
Total duration 14m 1s
Artifacts

server.yml

on: pull_request_target
Matrix: server
Fit to window
Zoom out
Zoom in