Commit
This commit does not belong to any branch on this repository, and may belong to a fork outside of the repository.
- Build `vulkan_1_x` for general GPU. - Build `cuda 12.4.1` as default. Changed: - Renamed examples for TinyLlama (chat, tool calling) and OpenAI. - Updated demo models definitions. - Updated examples (chat, tool calling). - `get_special_tokens` not supports parameter `force_standard_special_tokens: bool=False` which bypasses tokenizer's special tokens with standard/common ones. - Build `cuda 12.5.1` as additional build target but packaged on PyPI. - Build `cuda 12.6` as additional build target but packaged on PyPI. - Build `openblas` as additional build target but packaged on PyPI. Fixed: - Handle `Options.no_display_prompt` on Python side.
- Loading branch information