Back to researchers
Atri Rudra
Fast, memory-efficient attention
Co-authored FlashAttention: one of the most impactful attention-kernel optimizations.
Highlights
FlashAttentionEfficient attentionSystems
Focus: Fast, memory-efficient attention
Why it matters: Co-authored FlashAttention: one of the most impactful attention-kernel optimizations.
Research Areas
FlashAttentionEfficient attentionSystems