Skip to content

ARK v0.5.0

Latest
Compare
Choose a tag to compare
@chhwang chhwang released this 16 Dec 11:22
· 32 commits to main since this release
1762798
  • Integrate with MSCCL++
  • Removed dependency on gpudma
  • Add AMD CDNA3 architecture support
  • Support communication for AMD GPUs
  • Optimize OpGraph scheduling
  • Add a multi-GPU Llama2 example

See details from #168.