Commit
This commit does not belong to any branch on this repository, and may belong to a fork outside of the repository.
[FA2] tiling-qkv F32/F16 + swizzle q/qk/qkv🎉 (#213)
* Update flash_attn_mma_tiling_qkv.cu * Update flash_attn_mma_tiling_qkv_F32F16F16F32.cu * Update flash_attn_mma_tiling_qkv_swizzle_q.cu * Update flash_attn.cc * Update flash_attn_mma.py * Update flash_attn_mma_tiling_qkv_swizzle_qk.cu * Update flash_attn.cc * Update flash_attn_mma.py * Update flash_attn_mma_tiling_qkv_swizzle_qkv.cu * Update flash_attn_mma_tiling_qkv_swizzle_q.cu * Update flash_attn_mma_tiling_qkv_swizzle_qk.cu * Update flash_attn_mma_tiling_qkv_swizzle_q.cu * Update flash_attn_mma_tiling_qkv_swizzle_qkv.cu * Update flash_attn.cc * Update flash_attn_mma.py * Update flash_attn_mma_share_kv.cu * Update flash_attn_mma_share_kv_F32F16F16F32.cu * Update flash_attn_mma_split_kv.cu * Update flash_attn_mma_split_q.cu * Update flash_attn_mma_tiling_qk.cu * Update flash_attn_mma_tiling_qk_F32F16F16F32.cu * Update flash_attn_mma_tiling_qkv.cu * Update flash_attn_mma_tiling_qkv_F32F16F16F32.cu * Create flash_attn_mma_tiling_qkv_swizzle_q_F32F16F16F32.cu * Create flash_attn_mma_tiling_qkv_swizzle_qk_F32F16F16F32.cu * Create flash_attn_mma_tiling_qkv_swizzle_qkv_F32F16F16F32.cu * Update README.md * Update README.md * Update flash_attn_mma.py * Update README.md * Update README.md * Update README.md * Update README.md * Update README.md * Update flash_attn_mma_tiling_qkv_swizzle_q.cu * Update flash_attn_mma_tiling_qkv_swizzle_q.cu * Update flash_attn_mma_tiling_qkv_swizzle_qk.cu * Update flash_attn_mma_tiling_qkv_swizzle_qk.cu * Update flash_attn_mma_tiling_qkv_swizzle_q_F32F16F16F32.cu * Update flash_attn_mma_tiling_qkv_swizzle_qk_F32F16F16F32.cu * Update flash_attn.cc * Update flash_attn_mma.py * Update flash_attn_mma_tiling_qkv_F32F16F16F32.cu * Update flash_attn_mma_tiling_qkv_swizzle_qk_F32F16F16F32.cu * Update flash_attn_mma_tiling_qkv_swizzle_qkv_F32F16F16F32.cu * Update flash_attn_mma.py * Update flash_attn.cc * Update README.md * Update README.md
- Loading branch information