Issue: Update VLLM to Version .5.0++, and a few suggestions #83

nerdylive123 · 2024-07-03T04:07:39Z

Description

🌟 Upgrade VLLM: We need to rocket VLLM to version 0.5.0++ or beyond! 🚀
🤖 Tensorize Awesomeness: The tensorize feature is like giving VLLM a turbo boost. 🏎️ Check out the Tensorize VLLM example for a sneak peek.
- 🚀 It lets us load the model during download (but remember, the model needs a little conversion magic).
📦 Pip It Up: Why build VLLM from scratch when we can summon it with a pip package? Efficiency, my friend! 🧙‍♂️

Kudos to the stellar maintainer! 🌟🙌

The text was updated successfully, but these errors were encountered:

FrederikHandberg · 2024-07-04T12:29:40Z

+1! I really would like to run Phi3VForCausalLM

Sapessii · 2024-07-05T13:37:44Z

+1!

shivanker · 2024-07-08T23:12:16Z

+1, Gemma 2 support has been recently rolled out in vLLM!

avacaondata · 2024-07-09T08:49:39Z

+1, it would make much more sense to pip install vllm so that when a new model is released and implemented in vLLM it is automatically integrated in this worker @alpayariyak

d4rk6un · 2024-07-22T00:04:46Z

Are there any plans to upgrade the VLLM version and if so, can you provide a date?

PhoenixSmaug · 2024-07-22T12:11:30Z

+1, then we could finally run DeepSeek-Coder v2

harshal-pr · 2024-07-25T18:10:24Z

+1

Llama 3.1 needs 0.5.3 https://github.com/vllm-project/vllm/releases/tag/v0.5.3

Can we upgrade this worker to support this out of box in runpod serverless vllm ?

Lhemamou · 2024-07-26T21:54:09Z

waiting also for the update :) let me know if i can help !

alpayariyak · 2024-07-26T22:00:30Z

Hi all, thank you so much for the suggestions! I've joined a different company, so @pandyamarut will be taking over. It's been a great pleasure serving you all!

Lhemamou · 2024-07-26T22:10:42Z

I wish you an amazing next work experience ;) welcome aboard @pandyamarut !

pandyamarut · 2024-07-26T22:14:02Z

Working on it ,Sorry for the delay. Thanks for maintaining the repo @alpayariyak

TheAlexPG · 2024-08-05T07:57:38Z

Guys, do we know anything about the approximate time frame for the update? So that you can somehow plan the update of the models in the roadmap. Thanks

nerdylive123 · 2024-08-08T06:42:19Z

Pls support new quantization fp8, refer to this docs:
vllm docs

I've got a whole new menu with a bunch of new options i guess its all of the arguments thats very great thank you for the update staffs and maintainers! just the options value needs to be updated :)

nerdylive123 changed the title ~~Issue: Update VLLM to Version .5.0 or Later, n a few suggestions~~ Issue: Update VLLM to Version .5.0++, and a few suggestions Jul 3, 2024

nerdylive123 mentioned this issue Jul 6, 2024

A new version of VLLM has been released #84

Closed

Saran33 mentioned this issue Jul 22, 2024

Support for tools / tool_choice="auto" in OpenAI-compatible API #85

Open

2 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Issue: Update VLLM to Version .5.0++, and a few suggestions #83

Issue: Update VLLM to Version .5.0++, and a few suggestions #83

nerdylive123 commented Jul 3, 2024

FrederikHandberg commented Jul 4, 2024 •

edited

Loading

Sapessii commented Jul 5, 2024

shivanker commented Jul 8, 2024

avacaondata commented Jul 9, 2024

d4rk6un commented Jul 22, 2024

PhoenixSmaug commented Jul 22, 2024

harshal-pr commented Jul 25, 2024

Lhemamou commented Jul 26, 2024

alpayariyak commented Jul 26, 2024 •

edited

Loading

Lhemamou commented Jul 26, 2024

pandyamarut commented Jul 26, 2024 •

edited

Loading

TheAlexPG commented Aug 5, 2024

nerdylive123 commented Aug 8, 2024

Issue: Update VLLM to Version .5.0++, and a few suggestions #83

Issue: Update VLLM to Version .5.0++, and a few suggestions #83

Comments

nerdylive123 commented Jul 3, 2024

Description

FrederikHandberg commented Jul 4, 2024 • edited Loading

Sapessii commented Jul 5, 2024

shivanker commented Jul 8, 2024

avacaondata commented Jul 9, 2024

d4rk6un commented Jul 22, 2024

PhoenixSmaug commented Jul 22, 2024

harshal-pr commented Jul 25, 2024

Lhemamou commented Jul 26, 2024

alpayariyak commented Jul 26, 2024 • edited Loading

Lhemamou commented Jul 26, 2024

pandyamarut commented Jul 26, 2024 • edited Loading

TheAlexPG commented Aug 5, 2024

nerdylive123 commented Aug 8, 2024

FrederikHandberg commented Jul 4, 2024 •

edited

Loading

alpayariyak commented Jul 26, 2024 •

edited

Loading

pandyamarut commented Jul 26, 2024 •

edited

Loading