[pull] main from mlc-ai:main #302

pull · 2024-11-16T05:03:15Z

See Commits and Changes for more details.

Can you help keep this open source service alive? 💖 Please sponsor : )

Typo engine.py

Relative Path causes this error. #2600 One can really only use an absolute path here

docs: Update model template link address

This PR migrates the implementation of the grammar engine to an external library XGrammar. The implementation of XGrammar is basically the same as our current version, but more modularized and robust. XGrammar will be continuously enhanced in the future. See https://github.com/mlc-ai/xgrammar for more details.

[docs] Updated path pointers in both docs files to point to the `conversation_template` directory instead of linking to the non-existent `conversation_template.py` file. Co-authored-by: 涧波 <[email protected]>

This PR bumps the 3rdparty/tokenizer-cpp. The SentencePiece tokenizer was disabled by default to reduce the binary size of the built library, while it causes some error when users expect to use the SentencePiece tokenizer. This PR enables it by default. And we will need to manually disable it if we need to reduce its binary size.

This PR fixes the function table initialization, so that when sliding window is enabled, we won't pick the FlashInfer attn kernel.

This PR bumps the 3rdparty tokenizers-cpp to update the HuggingFace tokenizers package dependency.

This PR adds the sentencepiece package as a Python installation requirement for tokenizer coverage when running `gen_config`.

Update deepseek_model.py

shyeonn and others added 13 commits October 31, 2024 10:34

[Fix] Typo in serve/engine.py (#3000)

b656b2d

Typo engine.py

Fix relativ path comment (#3008)

81976e0

Relative Path causes this error. #2600 One can really only use an absolute path here

Simplify obvious choices in gen_cmake_config.py (#3006)

2ec4e38

[Docs] Update model template link address (#3007)

8436a92

docs: Update model template link address

Auto updated submodule references

f7b794b

[Docs] Fix typo export command with compile. (#3021)

2f37a63

[docs] Updated conversation template doc (#3020)

b68f7b7

[docs] Updated path pointers in both docs files to point to the `conversation_template` directory instead of linking to the non-existent `conversation_template.py` file. Co-authored-by: 涧波 <[email protected]>

[Fix] Disable FlashInfer when sliding window is enabled (#3026)

a9baa1f

This PR fixes the function table initialization, so that when sliding window is enabled, we won't pick the FlashInfer attn kernel.

[3rdparty] Bump tokenizers-cpp for HuggingFace tokenizer update (#3029)

3474073

This PR bumps the 3rdparty tokenizers-cpp to update the HuggingFace tokenizers package dependency.

[Python] Add sentencepiece as installation requirement (#3030)

e462885

This PR adds the sentencepiece package as a Python installation requirement for tokenizer coverage when running `gen_config`.

[Model] Update default prefill chunk size of Deepseek (#3009)

967fb76

Update deepseek_model.py

pull bot added the ⤵️ pull label Nov 16, 2024

pull bot merged commit 967fb76 into kp-forks:main Nov 16, 2024
1 check passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[pull] main from mlc-ai:main #302

[pull] main from mlc-ai:main #302

pull bot commented Nov 16, 2024 •

edited

Loading

[pull] main from mlc-ai:main #302

[pull] main from mlc-ai:main #302

Conversation

pull bot commented Nov 16, 2024 • edited Loading

pull bot commented Nov 16, 2024 •

edited

Loading