Qwen-VL-7B model +vllm assert len(indices) == len(inputs) AssertionError #542

sssunXw · 2024-11-18T08:56:22Z

Using webui or vllm api will result in an error. The startup command is as follows:

CUDA_VISIBLE_DEVICES=0,1,2,3 API_PORT=8000 llamafactory-cli api
--model_name_or_path /mnt/workspace/models/qwen2_vl_7b
--template qwen2_vl
--infer_backend vllm
--vllm_enforce_eager

or llamafactory-cli webui

curl http://localhost:8000/v1/chat/completions
-H "Content-Type: application/json"
-d '{
"model": "Qwen2-VL-7B-Instruct",
"messages": [
{"role": "system", "content": "你是资深分析师."},
{"role": "user", "content": [
{"type": "image_url", "image_url": {"url": "xxx"}},
{"type": "text", "text": "描述用户文化程度水平分布"}
]}
]
}'

The error is as follows：

sssunXw changed the title ~~Qwen-VL-7B model +vllm推理图片内容报错~~ Qwen-VL-7B model +vllm assert len(indices) == len(inputs) AssertionError Nov 18, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Qwen-VL-7B model +vllm assert len(indices) == len(inputs) AssertionError #542

Qwen-VL-7B model +vllm assert len(indices) == len(inputs) AssertionError #542

sssunXw commented Nov 18, 2024 •

edited

Loading

Qwen-VL-7B model +vllm assert len(indices) == len(inputs) AssertionError #542

Qwen-VL-7B model +vllm assert len(indices) == len(inputs) AssertionError #542

Comments

sssunXw commented Nov 18, 2024 • edited Loading

sssunXw commented Nov 18, 2024 •

edited

Loading