Back to researchers
Yuandong Tian
Streaming + long-context stability (attention sinks)
Co-authored Attention Sinks: a practical trick for stable streaming and long-context attention.
Highlights
Long contextInferenceEfficiency
Focus: Streaming + long-context stability (attention sinks)
Why it matters: Co-authored Attention Sinks: a practical trick for stable streaming and long-context attention.
Research Areas
Long contextInferenceEfficiency