v2.6.0
English Version
Models
- Support Qwen2.5 coder models
Feature
- Correct and support the new loss and gradient accumulation algorithm from transformers.trainer
中文版本
模型
- 支持千问coder系列模型
功能
- 支持新的transformers loss和GA计算算法,并修正了其中的bug
What's Changed
- fix gptq ovis quant (inputs_embeds) by @Jintao-Huang in #2378
- [TorchAcc] fix qwen2 for transformers>=4.45 by @baoleai in #2379
- fix trl transformers==4.46 compat by @Jintao-Huang in #2385
- fix deploy stream media_type by @Jintao-Huang in #2393
- fix_mplug_owl3_cut_shape by @Jintao-Huang in #2394
- fix swift deploy (lmdeploy stream) by @Jintao-Huang in #2397
- fix lmdeploy warning & gptq-int4 support by @Jintao-Huang in #2401
- support qwen2.5-coder by @Jintao-Huang in #2400
- fix qwen_vl npu by @Jintao-Huang in #2408
- fix model path by @yingdachen in #2410
- fix qwen_vl dpo by @Jintao-Huang in #2411
- Compat transformers 4.46.2 loss by @Jintao-Huang in #2413
- Fix molmo infer by @Jintao-Huang in #2419
- Fix dataset map by @Jintao-Huang in #2421
- Support qwen2 5 coder series model by @Jintao-Huang in #2422
- fix PPO by @hjh0119 in #2377
- fix docs by @Jintao-Huang in #2425
- compat transformers==4.47 by @Jintao-Huang in #2426
- fix internvl2 lmdeploy>=0.6 deploy by @Jintao-Huang in #2429
Full Changelog: v2.5.2...v2.6.0