Release 0.40.0 · matatonic/openedai-vision

Version 0.40.0

new model support: AIDC-AI/Ovis1.6-Llama3.2-3B, AIDC-AI/Ovis1.6-Gemma2-27B
new model support: BAAI/Aquila-VL-2B-llava-qwen
new model support: HuggingFaceTB/SmolVLM-Instruct
new model support: google/paligemma2 family of models (very limited instruct/chat training so far)
Qwen2-VL: unpin Qwen2-VL-7B & remove Qwen hacks, GTPT-Int4/8 working again (still slow - why?)
pin bitsandbytes==0.44.1
⚠️ DEPRECATED MODELS (use the 0.39.2 docker image for support of these models): internlm-xcomposer2-7b, internlm-xcomposer2-7b-4bit, internlm-xcomposer2-vl-1_8b, internlm-xcomposer2-vl-7b, internlm-xcomposer2-vl-7b-4bit, nvidia/NVLM-D-72B, Llama-3-8B-Dragonfly-Med-v1, Llama-3-8B-Dragonfly-v1

Provide feedback