Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

用vllm部署Qwen2-VL-7B-Instruct模型,单次请求输入2张图片生成内容不完整,输入单张图片生成内容是完整的 #545

Open
xj260098061 opened this issue Nov 19, 2024 · 1 comment

Comments

@xj260098061
Copy link

上下文长度已经设为了最大值:max-model-len=32768

请求返回结果:
{'id': 'chat-68c5031ee03f4c2ca9cf00539fb388f8', 'object': 'chat.completion', 'created': 1732007500, 'model': 'Qwen2-VL-7B-Instruct', 'choices': [{'index': 0, 'message': {'role': 'assistant', 'content': '\n请', 'tool_calls': []}, 'logprobs': None, 'finish_reason': 'stop', 'stop_reason': None}], 'usage': {'prompt_tokens': 9565, 'total_tokens': 9571, 'completion_tokens': 6}, 'prompt_logprobs': None}

其中content=\n请,不知道为什么不继续生成了?

@Sijiyu
Copy link

Sijiyu commented Dec 11, 2024

差不多的问题,我是超过6张就不行

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants