Skip to content

0.21.0

Compare
Choose a tag to compare
@matatonic matatonic released this 05 Jun 02:15
· 107 commits to main since this release

Recent updates

Version 0.21.0

  • new model support: Salesforce/xgen-mm-phi3-mini-instruct-r-v1
  • Major improvements in quality and compatibility for --load-in-4/8bit for many models (InternVL-Chat-V1-5, cogvlm2, MiniCPM-Llama3-V-2_5, Bunny, Monkey, ...). Layer skip with quantized loading.