Skip to content

[v0.24.1] Handle [DONE] signal from TGI + remove logic for "non-TGI servers"

Compare
Choose a tag to compare
@Wauplin Wauplin released this 23 Jul 14:44
· 139 commits to main since this release
1dd68a9

This release fixes 2 things:

  • handle "[DONE]" message in chat stream (related to TGI update huggingface/text-generation-inference#2221)
  • remove the "non-TGI" logic in chat completion since all models support server-side rendering now that even transformers-backed models are TGI-server.

See #2410 for more details.

Full Changelog: v0.24.0...v0.24.1