Skip to content

Support BF16 kvcache, rope and attentions for inference of GGUF/GGML … #4388

Support BF16 kvcache, rope and attentions for inference of GGUF/GGML …

Support BF16 kvcache, rope and attentions for inference of GGUF/GGML … #4388

Annotations

5 warnings

Docs

succeeded Dec 30, 2024 in 2m 58s