bug: Fix, update & improve models in Jan Hub #46

imtuyethan · 2024-10-21T06:40:59Z

Problem

I have encountered many issues with the wrong model default settings (incorrect prompt template, the stop words missing, etc.).
e.g., comments in Jan 0.5.7 Release Sign Off janhq/jan#3818

Model Testing Results

I have tested 45 models from Jan Hub, here are the results.

Next step

Update correct default settings for failed models
Better description for all models
Consider removing legacy models
Update Hub with new trending models?

cc @hahuyhoang411

No.	Model Name	Issue Identified
1	Llama 3.2 1B Instruct Q8
2	Llama 3.2 3B Instruct Q8
3	Qwen2.5 7B Instruct Q4
4	Qwen2.5 Coder 7B Instruct Q4
5	Llama 3.1 8B Instruct Q4
6	Qwen2.5 14B Instruct Q4
7	Codestral 22B Q4	Error in response format, wrong prompt template?
8	TinyLlama Chat 1.1B Q4	Garbled response, error in response format
9	LlamaCorn 1.1B Q8
10	Deepseek Coder 1.3B Instruct Q8
11	Gemma 1.1 2B Q4	Error in response format, wrong prompt template?
12	Gemma 2 2B Q4
13	Phi-3 Mini Instruct Q4
14	Stable Zephyr 3B Q8
15	Llama 2 Chat 7B Q4	Error in response format, wrong stop word insertion?
16	CodeNinja 7B Q4	Error in response format, wrong prompt template?
17	LaVa 7B	Garbled response, sometimes cannot run
18	Mistral 7B Instruct Q4	Error in response format, wrong stop word insertion?
19	Noromaid 7B Q4
20	Openchat-3.5 7B Q4
21	Stealth 7B Q4
22	Trinity-v1.2 7B Q4
23	Vistral 7B Q4	Error in response format, wrong stop word insertion?
24	Qwen 2 7B Instruct Q4	Error in response format, wrong prompt template?
25	Qwen Chat 7B Q4
26	Llama 3 8B Instruct Q4
27	Hermes Pro Llama 3 8B Q4
28	Aya 23 8B Q4
29	Gemma 1.1 7B Q4	Error in response format, wrong stop word insertion?
30	BakLlava 1	Garbled response, sometimes cannot run, wrong stop word insertion?
31	Gemma 2 9B Q4
32	LaVa 13B Q4	Garbled response; prompt template issue?
33	Wizard Coder Python 13B Q4	Garbled response; prompt template issue?
34	Phi-3 Medium Instruct Q4
35	Gemma 2 27B Q4
36	Qwen2.5 32B Instruct Q4
37	Deepseek Coder 35B Instruct Q4
38	Phind 34B Q4	Error in response format, wrong stop word insertion?
39	Yi 34B Q4
40	Command-R v01 34B Q4	Garbled response; prompt template issue?
41	Aya 23 35B Q4
42	Mixtral 8x7B Instruct Q4	Error in response format, wrong stop word insertion?
43	Llama 3.1 70B Instruct Q4
44	Llama 2 Chat 70B Q4	Error in response format, wrong stop word insertion?
45	Qwen2.5 72B Instruct Q4

On one note

We will need to develop model.yaml to easily define model capabilities (e.g. function calling, vision, etc). Users are facing an issue with imported LlaVa: janhq/jan#3855

model.yaml should have some sort of capabilities field, e.g. tools: true
Jan allows users to "edit" Models, e.g. view a model's functionalities + edit it
Cortex: users will just edit model.yaml directly