-
-
Notifications
You must be signed in to change notification settings - Fork 5.7k
Pull requests: vllm-project/vllm
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
[Bugfix] Handle content type with optional parameters
frontend
#13383
opened Feb 17, 2025 by
zifeitong
Loading…
[VLM] Check required fields before initializing field config in Improvements or additions to documentation
ready
ONLY add when PR is ready to merge/full CI is needed
DictEmbeddingItems
documentation
#13380
opened Feb 17, 2025 by
DarkLight1337
Loading…
Integrate the new ragged paged attention kernel with vLLM v1 on TPU
ci/build
v1
#13379
opened Feb 17, 2025 by
vanbasten23
•
Draft
[MISC] tiny fixes
ready
ONLY add when PR is ready to merge/full CI is needed
#13378
opened Feb 17, 2025 by
MengqingCao
Loading…
[V1][Spec Decode] Optimize N-gram matching with Numba
ci/build
v1
#13365
opened Feb 17, 2025 by
WoosukKwon
•
Draft
[Misc] Avoid calling unnecessary
hf_list_repo_files
for local model path
#13348
opened Feb 16, 2025 by
Isotr0py
Loading…
Use 88 as the line length to be compatible with Black
#13347
opened Feb 16, 2025 by
houseroad
Loading…
[V1] Get input tokens from scheduler
ready
ONLY add when PR is ready to merge/full CI is needed
v1
#13339
opened Feb 15, 2025 by
WoosukKwon
Loading…
[Bugfix]: DeepseekR1 model load fails with weights tied error
#13335
opened Feb 15, 2025 by
cennn
Loading…
[Model] Add support for GraniteMoeShared models
#13313
opened Feb 15, 2025 by
tjohnson31415
Loading…
Previous Next
ProTip!
Follow long discussions with comments:>50.