Skip to content

Support BF16 kvcache, rope and attentions for inference of GGUF/GGML … #4388

Support BF16 kvcache, rope and attentions for inference of GGUF/GGML …

Support BF16 kvcache, rope and attentions for inference of GGUF/GGML … #4388

Annotations

5 warnings

Test Suite (ubuntu-latest, stable)

succeeded Dec 30, 2024 in 5m 37s