Tutel v0.1.2
What's New in v0.1.2:
- General-purpose top-k gating with
{'type': 'top', 'k': 2}
; - Add Megatron-ML Tensor Parallel as gating type;
- Add deepspeed-based & megatron-based helloworld example for fair comparison;
- Add torch.bfloat16 datatype support for single-GPU;
How to Setup:
python3 -m pip install --user https://github.com/microsoft/tutel/archive/refs/tags/v0.1.2.tar.gz
Contributors: @ghostplant, @EricWangCN, @foreveronehundred.