Support for smaller text encoders #3

vladmandic · 2024-05-16T16:15:24Z

Currently MuLan internally uses OpenGVLab/InternVL-14B-224px as default text encoder
While its possible to pass path to any downloadable encoder, which ones did you test?

Note that InternVL-14B-224px is a massive model at 27GB in size and requires ~17GB of VRAM to execute in FP16 context which prohibits usage of this library on any normal consumer GPU

The text was updated successfully, but these errors were encountered:

Zeqiang-Lai · 2024-05-17T02:40:36Z

Great suggestion! Making MuLan available for every one is our ultimate goal and we will make some attempts on the smaller one.

zengjie617789 · 2024-05-24T09:31:51Z

I used Mini-InternVL-Chat-2B-V1-5 as text_encoder, but it loaded so slow and want to enter yes to trust_remote_code.
what is the problem?
now, it worked.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Support for smaller text encoders #3

Support for smaller text encoders #3

vladmandic commented May 16, 2024

Zeqiang-Lai commented May 17, 2024

zengjie617789 commented May 24, 2024 •

edited

Loading

Support for smaller text encoders #3

Support for smaller text encoders #3

Comments

vladmandic commented May 16, 2024

Zeqiang-Lai commented May 17, 2024

zengjie617789 commented May 24, 2024 • edited Loading

zengjie617789 commented May 24, 2024 •

edited

Loading