Skip to content

b3947

Compare
Choose a tag to compare
@github-actions github-actions released this 21 Oct 08:00
bc21975
speculative : fix handling of some input params (#9963)

* speculative : fix batch sizes at initialization

ggml-ci

* speculative : handle params.n_predict == -1

* speculative : limit batch size to llama_n_batch