Skip to content

persistent kernel version of the flash attention forward + FLOP calculation fix when seqlen_q != seqlen_k #330

persistent kernel version of the flash attention forward + FLOP calculation fix when seqlen_q != seqlen_k

persistent kernel version of the flash attention forward + FLOP calculation fix when seqlen_q != seqlen_k #330

Annotations

1 warning

pre-commit (code formatting)

succeeded Dec 19, 2024 in 49s