Activity
Remove pytorch comments for outlines + compressed-tensors (vllm-proje…
Remove pytorch comments for outlines + compressed-tensors (vllm-proje…
[Bugfix] Fix LLaVA-NeXT feature size precision error (for real) (vllm…
[Bugfix] Fix LLaVA-NeXT feature size precision error (for real) (vllm…
[Bugfix] Fix LLaVA-NeXT feature size precision error (for real) (vllm…
[Bugfix] Fix LLaVA-NeXT feature size precision error (for real) (vllm…
Force push
[Doc] xpu backend requires running setvars.sh (vllm-project#6393)
[Doc] xpu backend requires running setvars.sh (vllm-project#6393)
[CI/Build] Add inputs tests (vllm-project#5215)
[CI/Build] Add inputs tests (vllm-project#5215)
update multi request script
update multi request script
add yuan moe
add yuan moe
[CI] Move CPU/AMD tests to after wait (vllm-project#4123)
[CI] Move CPU/AMD tests to after wait (vllm-project#4123)
[Bugfix] Fix Llava inference with Tensor Parallelism. (vllm-project#3883
[Bugfix] Fix Llava inference with Tensor Parallelism. (vllm-project#3883