Skip to content

persistent kernel version of the flash attention forward + FLOP calculation fix when seqlen_q != seqlen_k #330

persistent kernel version of the flash attention forward + FLOP calculation fix when seqlen_q != seqlen_k

persistent kernel version of the flash attention forward + FLOP calculation fix when seqlen_q != seqlen_k #330

Integration-Tests-AMD (self-hosted, gfx942)

succeeded Dec 19, 2024 in 1h 24m 21s