Skip to content

Support BF16 kvcache, rope and attentions for inference of GGUF/GGML models #4370

Support BF16 kvcache, rope and attentions for inference of GGUF/GGML models

Support BF16 kvcache, rope and attentions for inference of GGUF/GGML models #4370

Annotations

4 warnings

Test Suite (macOS-latest, stable)

succeeded Dec 27, 2024 in 3m 21s