[v0.24.1] Handle [DONE] signal from TGI + remove logic for "non-TGI servers"
This release fixes 2 things:
- handle
"[DONE]"
message in chat stream (related to TGI update huggingface/text-generation-inference#2221) - remove the "non-TGI" logic in chat completion since all models support server-side rendering now that even transformers-backed models are TGI-server.
See #2410 for more details.
Full Changelog: v0.24.0...v0.24.1