[Feature] Support Efficient Sparse HiP Attention (InfiniteHiP) with Long-Context Generalization and KV Offloading Capabilties #1787
This workflow is awaiting approval from a maintainer in #3930
This workflow is awaiting approval from a maintainer in #3930
pr-test-amd.yml
on: pull_request
accuracy-test-1-gpu-amd
mla-test-1-gpu-amd
finish