Add qwen2-0.5B-instruct support. #35

lavilao · 2024-07-03T19:50:23Z

Its a really good model for its size and it aligns with the goal of this project.

jncraton · 2024-07-03T21:00:11Z

I've explored this model, and I hope to add it at some point. It isn't currently supported in the ctranslate2 backend that we use for inference. If/when it is supported there it shouldn't be difficult to add here.

lavilao · 2024-07-05T14:33:16Z

Umm, small question. Now that llama-cpp supports flan-t5, would you consider to change from ctranslate2 to it? it would allow a broader model and quantization support (making it easier to mantain as you dont have to convert your own models).
PD: support was added yesterday so llama-cpp-python support is still not there but should be comming.

jncraton · 2024-07-05T20:46:19Z

I'm open to that. At the moment I'm not aware of well-maintained Python bindings that support batched inference for llama-cpp. I would prefer not to lose that performance benefit. There is work being done in on this in llama-cpp-python.

jncraton · 2024-09-26T01:40:31Z

@lavilao There's still no Qwen2 (or 2.5) support, but I did recently update the package to support the following instruct models:

Llama-3.2-3B-Instruct
Llama-3.2-1B-Instruct
SmolLM-1.7B-Instruct
SmolLM-360M-Instruct
SmolLM-135M-Instruct
h2o-danube3-500m-chat
h2o-danube3-4b-chat

lavilao · 2024-09-26T02:39:07Z

Awesome, i wonder if llama 3.2 1b Will run on My potato.

jncraton added the enhancement New feature or request label Jul 3, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add qwen2-0.5B-instruct support. #35

Add qwen2-0.5B-instruct support. #35

lavilao commented Jul 3, 2024

jncraton commented Jul 3, 2024

lavilao commented Jul 5, 2024 •

edited

Loading

jncraton commented Jul 5, 2024

jncraton commented Sep 26, 2024

lavilao commented Sep 26, 2024

Add qwen2-0.5B-instruct support. #35

Add qwen2-0.5B-instruct support. #35

Comments

lavilao commented Jul 3, 2024

jncraton commented Jul 3, 2024

lavilao commented Jul 5, 2024 • edited Loading

jncraton commented Jul 5, 2024

jncraton commented Sep 26, 2024

lavilao commented Sep 26, 2024

lavilao commented Jul 5, 2024 •

edited

Loading