Skip to content

Latest commit

 

History

History
7 lines (6 loc) · 1.71 KB

efficient_training.md

File metadata and controls

7 lines (6 loc) · 1.71 KB

Efficient Training

Title & Authors Introduction Links
StarPublish
Scalable Efficient Training of Large Language Models with Low-dimensional Projected Attention
Xingtai Lv, Ning Ding, Kaiyan Zhang, Ermo Hua, Ganqu Cui, Bowen Zhou
image Github
Paper
Star
COAT: Compressing Optimizer states and Activation for Memory-Efficient FP8 Training
Haocheng Xi, Han Cai, Ligeng Zhu, Yao Lu, Kurt Keutzer, Jianfei Chen, Song Han
image Github
Paper
Star
BitPipe: Bidirectional Interleaved Pipeline Parallelism for Accelerating Large Models Training
Houming Wu, Ling Chen, Wenjie Yu
image Github
Paper