Releases: sorasoras/llama.cpp
Releases · sorasoras/llama.cpp
b2400
Server: format error to json (#5961) * server: format error to json * server: do not crash on grammar error * fix api key test case * revert limit max n_predict * small fix * correct coding style * update completion.js * launch_slot_with_task * update docs * update_slots * update webui * update readme
b2380
perplexity : support using multiple sequences to allow larger batch s…