Skip to content

0.40.0

Compare
Choose a tag to compare
@matatonic matatonic released this 07 Dec 04:06
· 3 commits to main since this release

Version 0.40.0

  • new model support: AIDC-AI/Ovis1.6-Llama3.2-3B, AIDC-AI/Ovis1.6-Gemma2-27B
  • new model support: BAAI/Aquila-VL-2B-llava-qwen
  • new model support: HuggingFaceTB/SmolVLM-Instruct
  • new model support: google/paligemma2 family of models (very limited instruct/chat training so far)
  • Qwen2-VL: unpin Qwen2-VL-7B & remove Qwen hacks, GTPT-Int4/8 working again (still slow - why?)
  • pin bitsandbytes==0.44.1
  • ⚠️ DEPRECATED MODELS (use the 0.39.2 docker image for support of these models): internlm-xcomposer2-7b, internlm-xcomposer2-7b-4bit, internlm-xcomposer2-vl-1_8b, internlm-xcomposer2-vl-7b, internlm-xcomposer2-vl-7b-4bit, nvidia/NVLM-D-72B, Llama-3-8B-Dragonfly-Med-v1, Llama-3-8B-Dragonfly-v1