Skip to content

Issues: vllm-project/vllm

[Roadmap] vLLM Roadmap Q1 2025
#11862 opened Jan 8, 2025 by simon-mo
Open
vLLM's V1 Engine Architecture
#8779 opened Sep 24, 2024 by simon-mo
Open 10
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Assignee
Filter by who’s assigned
Sort

Issues list

[Bug]: Fail to use beamsearch with llm.chat bug Something isn't working
#12183 opened Jan 18, 2025 by gystar
1 task done
[Bug]: Multi-Node Online Inference on TPUs Failing bug Something isn't working
#12179 opened Jan 17, 2025 by BabyChouSr
1 task done
[Bug]: Slow huggingface weights download. Sequential download bug Something isn't working
#12177 opened Jan 17, 2025 by NikolaBorisov
1 task done
[New Model]: openbmb/MiniCPM-o-2_6 new model Requests to new models
#12162 opened Jan 17, 2025 by myoss
1 task done
[Usage]: Terminates without any error 30 seconds after a successful run. usage How to use vllm
#12160 opened Jan 17, 2025 by hznnnnnn
1 task done
[New Model]: jinaai/jina-embeddings-v3 new model Requests to new models
#12154 opened Jan 17, 2025 by TC10127
1 task done
[Performance]: Very low generation throughput on CPU performance Performance-related issues
#12153 opened Jan 17, 2025 by SLIBM
1 task done
[Usage]: vllm context length handling method usage How to use vllm
#12146 opened Jan 17, 2025 by whoo9112
1 task done
[Bug]: High and unstable CPU usage when deployed on GPU bug Something isn't working
#12142 opened Jan 17, 2025 by yh-yao
1 task done
[New Model]: NV-Embed-v2 new model Requests to new models
#12137 opened Jan 17, 2025 by Hypothesis-Z
1 task done
[Bug]: Phi-3-small-8k cannot be served for vllm >= 0.6.5 bug Something isn't working
#12124 opened Jan 16, 2025 by JGSweets
1 task done
[Bug]: XGrammar-based CFG decoding degraded after 0.6.5 bug Something isn't working structured-output
#12122 opened Jan 16, 2025 by AlbertoCastelo
1 task done
ProTip! Find all open issues with in progress development work with linked:pr.