Skip to content

Releases: sorasoras/llama.cpp

b2400

11 Mar 11:49
caa106d
Compare
Choose a tag to compare
Server: format error to json (#5961)

* server: format error to json

* server: do not crash on grammar error

* fix api key test case

* revert limit max n_predict

* small fix

* correct coding style

* update completion.js

* launch_slot_with_task

* update docs

* update_slots

* update webui

* update readme

b2380

09 Mar 20:07
d894f35
Compare
Choose a tag to compare
perplexity : support using multiple sequences to allow larger batch s…