Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

The attention mask is not set and cannot be inferred from input because pad token is same as eos token. As a consequence, you may observe unexpected behavior. Please pass your input's attention_mask to obtain reliable results. #267

Open
yangyang1-0 opened this issue Oct 12, 2024 · 1 comment

Comments

@yangyang1-0
Copy link

使用 Qwen2.5-7B-Instruct FastApi 部署调用 文档部署 Qwen2.5-32B-Instruct 模型,在最后的服务启动后,尝试使用 curl 测试推理,但是发送curl 命令后,服务打印 The attention mask is not set and cannot be inferred from input because pad token is same as eos token. As a consequence, you may observe unexpected behavior. Please pass your input's attention_mask to obtain reliable results. 后边没响应了。
image

@weijia24K
Copy link

兄弟,解决了吗?我也卡在这儿

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants