[pull] main from mlc-ai:main #315

pull · 2024-12-19T18:04:01Z

See Commits and Changes for more details.

Created by pull[bot] (v2.0.0-alpha.1)

Can you help keep this open source service alive? 💖 Please sponsor : )

This PR adds support for Nemotron architecture, and is in reference to #2901 [Request for Nemotron-Mini-4B-Instruct] Based on my analysis of the Nemotron architecture in the huggingface repository, it appears to share similarities with the Llama architecture, but with the following key distinctions: - The activation function used in the MLP is `relu2` (squared ReLU). - The MLP includes `up_proj` and `down_proj`, but does not have a `gate_proj` as seen in Llama. - It uses `layernorm1p`, and the normalization layer incorporates a bias term. - The architecture employs a `partial_rotary_factor`, which is similar to the approach used in the Phi architecture.

This PR supports TP function of GPTJ Model and fix minor typo of OlMo Model.

hrishi121 and others added 2 commits December 19, 2024 20:03

[SLM] GPTJ Multi-GPU support (#3070)

1825fed

This PR supports TP function of GPTJ Model and fix minor typo of OlMo Model.

pull bot added the ⤵️ pull label Dec 19, 2024

pull bot merged commit 1825fed into kp-forks:main Dec 19, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[pull] main from mlc-ai:main #315

[pull] main from mlc-ai:main #315

pull bot commented Dec 19, 2024 •

edited

Loading

[pull] main from mlc-ai:main #315

[pull] main from mlc-ai:main #315

Conversation

pull bot commented Dec 19, 2024 • edited Loading

pull bot commented Dec 19, 2024 •

edited

Loading