Skip to content

v2.6.0

Compare
Choose a tag to compare
@tastelikefeet tastelikefeet released this 13 Nov 08:06
· 19 commits to main since this release

English Version

Models

  1. Support Qwen2.5 coder models

Feature

  1. Correct and support the new loss and gradient accumulation algorithm from transformers.trainer

中文版本

模型

  1. 支持千问coder系列模型

功能

  1. 支持新的transformers loss和GA计算算法,并修正了其中的bug

What's Changed

Full Changelog: v2.5.2...v2.6.0