Deploying this fine-tuned model #11

babla9 · 2024-07-06T04:53:36Z

Thanks so much for your work on this!

How can I deploy this fine-tuned model (expose via API endpoint)? Can I use vLLM or a library like this: https://github.com/EricLBuehler/mistral.rs, which supports Phi3-Vision?

Thanks!

2U1 · 2024-07-15T08:49:39Z

@babla9 vLLM and mistral.rs both supports phi3-vision.
But, merging the lora weights should be more easier, when you use it.

kevintee · 2024-07-19T20:08:30Z

@2U1 What do you mean by merging the lora weights? Lora for phi-3 vision is currently not supported in vLLM.

2U1 · 2024-07-20T07:29:06Z

@kevintee When you fine tune the model with the code. You will get a adapter weight(Because it finetunes phi3 with lora). You should merge in to the original model to use it with vLLM.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Deploying this fine-tuned model #11

Deploying this fine-tuned model #11

babla9 commented Jul 6, 2024

2U1 commented Jul 15, 2024

kevintee commented Jul 19, 2024

2U1 commented Jul 20, 2024 •

edited

Loading

Deploying this fine-tuned model #11

Deploying this fine-tuned model #11

Comments

babla9 commented Jul 6, 2024

2U1 commented Jul 15, 2024

kevintee commented Jul 19, 2024

2U1 commented Jul 20, 2024 • edited Loading

2U1 commented Jul 20, 2024 •

edited

Loading