Optimize a Triton gated dot-product attention kernel.
All challenges
Keywordstriton
Optimize a fused dual-linear Jensen-Shannon divergence Triton kernel.
Optimize a fused linear and cross-entropy Triton kernel.
Optimize a Triton flash-attention kernel with causal masking.
Optimize a Triton decoding-attention kernel for decoder-style attention shapes.
Optimize a Triton cross-entropy loss kernel against PyTorch GPU baselines.