Icon
Core Velocity Lab

Attention

FlashAttention-2 in Vulkan with Tensor Cores support

Gradient of the attention op