Skip to content

fuse fp8 quant in kv copying and add flashinfer decode mla operator in the attention module #648

fuse fp8 quant in kv copying and add flashinfer decode mla operator in the attention module

fuse fp8 quant in kv copying and add flashinfer decode mla operator in the attention module #648