Release gguf-v0.10.0 · ggerganov/llama.cpp · GitHub

gguf-v0.10.0
a07c32e
Verified

This commit was created on GitHub.com and signed with GitHub’s verified signature.

GPG key ID: B5690EEEBB952194

Learn about vigilant mode
Compare

Choose a tag to compare

Loading

View all tags

gguf-v0.10.0

gguf-v0.10.0
a07c32e
Compare

Choose a tag to compare

Loading

View all tags
Verified

This commit was created on GitHub.com and signed with GitHub’s verified signature.

GPG key ID: B5690EEEBB952194

Learn about vigilant mode

piDack tagged this 23 Aug 07:27

llama : use F32 precision in GLM4 attention and no FA (#9130)

Assets 2