Skip to content

Activity

Remove pytorch comments for outlines + compressed-tensors (vllm-proje…

zhaoxudong01pushed 213 commits to main • 08fb75c…9a7c3a0 • 
29 days ago

[Bugfix] Fix LLaVA-NeXT feature size precision error (for real) (vllm…

zhaoxudong01pushed 2112 commits to main • 61e85db…08fb75c • 
on Jan 7

[Bugfix] Fix LLaVA-NeXT feature size precision error (for real) (vllm…

Force push
zhaoxudong01force pushed to yuan • 3d6cece…08fb75c • 
on Jan 7

[Doc] xpu backend requires running setvars.sh (vllm-project#6393)

zhaoxudong01pushed 441 commits to main • ec784b2…61e85db • 
on Jul 15, 2024

[CI/Build] Add inputs tests (vllm-project#5215)

zhaoxudong01pushed 355 commits to main • 11d652b…ec784b2 • 
on Jun 4, 2024

update multi request script

zhaoxudong01pushed 1 commit to yuan • 6e5ef80…3d6cece • 
on May 16, 2024

add yuan moe

zhaoxudong01created yuan • 6e5ef80 • 
on May 9, 2024

[CI] Move CPU/AMD tests to after wait (vllm-project#4123)

zhaoxudong01pushed 67 commits to main • 0ce0539…11d652b • 
on Apr 17, 2024

[Bugfix] Fix Llava inference with Tensor Parallelism. (vllm-project#3883

zhaoxudong01pushed 379 commits to main • 7a0b011…0ce0539 • 
on Apr 8, 2024