I want to help Keras Hub be compatible with the Qwen and Interlm models. Before that, I have some questions I'd like to ask. #1980

pass-lin · 2024-11-11T06:57:45Z

Qwen and interlm is chinese llm
Their architectures are basically the same as Llama3, with just a few more biases in the attention layer. If I want to support them in Keras, should I create a new class called QwenCasualModel, or simply add a few more configuration options to the existing Llama3?
Additionally, is it possible for me not to provide a Kaggle link, but instead convert the weights directly through HF?

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

I want to help Keras Hub be compatible with the Qwen and Interlm models. Before that, I have some questions I'd like to ask. #1980

I want to help Keras Hub be compatible with the Qwen and Interlm models. Before that, I have some questions I'd like to ask. #1980

pass-lin commented Nov 11, 2024 •

edited

Loading

I want to help Keras Hub be compatible with the Qwen and Interlm models. Before that, I have some questions I'd like to ask. #1980

I want to help Keras Hub be compatible with the Qwen and Interlm models. Before that, I have some questions I'd like to ask. #1980

Comments

pass-lin commented Nov 11, 2024 • edited Loading

pass-lin commented Nov 11, 2024 •

edited

Loading