Home/Researchers/Daniel Y. Fu

Researcher Profile

Editor reviewed

Daniel Y. Fu

Fast, memory-efficient attention

Researcher on long-context systems and fast model kernels at Together AI

One of the more useful people to follow for the systems side of modern model building, especially where better kernels and sequence methods translate directly into frontier-model training and inference speed.

Organizations

Together AIStanford University

About This Page

This profile is meant to help you get oriented quickly: why this researcher matters, what to read first, and where to explore next.

Official And External Links

Known For

The ideas, systems, and research directions that make this person worth knowing.

01

FlashAttention

02

FlashFFTConv

03

Kernel work for modern training and inference stacks

04

Fast, memory-efficient attention

05

FlashAttention: Fast and Memory-Efficient Exact Attention with IO-Awareness

06

FlashAttention (GitHub)

Start Here

Canonical papers, project pages, or repositories that anchor this profile.

Signature Works

Additional papers, projects, or repositories that help flesh out the profile.

Related Researchers

People worth exploring next because they share topics, labs, or source material with this profile.

Shared canonical source

Atri Rudra

Fast, memory-efficient attention

4 sources

Worth following because he brings a real theory background into the model-systems layer, especially where structured linear algebra and sequence methods end up mattering for practical modern architectures.

Start HereAtri Rudra

Shared canonical source

Christopher Ré

Fast, memory-efficient attention

4 sources

Important because he sits at a productive seam between machine learning, data systems, and model infrastructure, with work that ranges from weak supervision to some of the most important efficiency breakthroughs in modern training stacks.

Shared canonical source

Tri Dao

Efficient sequence models + attention kernels

4 sources

One of the clearest researchers to follow for efficient sequence-model systems, especially the line of work that made frontier training and inference materially faster rather than merely cleaner on paper.

Start HereTri Dao