A strong person to study for the modern NLP stack because his work spans denoising pretraining, retrieval-augmented generation, and later long-context inference tricks rather than only one phase of the language-model pipeline.
Researcher Profile
Editor reviewedGuangxuan Xiao
Streaming + long-context stability (attention sinks)
Researcher at MIT working on efficient long-context and multimodal inference
A strong systems page because his work repeatedly shows up where inference efficiency meets usable long context, especially in attention sinks, StreamingLLM, post-training quantization, and later long-context head designs.
Organizations
About This Page
This profile is meant to help you get oriented quickly: why this researcher matters, what to read first, and where to explore next.
Known For
The ideas, systems, and research directions that make this person worth knowing.
01
Attention sinks and StreamingLLM
02
Long-context inference efficiency
03
Quantization and efficient deployment
04
Streaming + long-context stability (attention sinks)
05
Efficient Streaming Language Models with Attention Sinks
06
Long context
Start Here
Canonical papers, project pages, or repositories that anchor this profile.
Signature Works
Additional papers, projects, or repositories that help flesh out the profile.
Supporting Sources
Additional links that help verify and flesh out this profile.
Related Researchers
People worth exploring next because they share topics, labs, or source material with this profile.
A high-signal researcher for the systems side of modern AI, especially where reinforcement learning, memory-efficient large-model training, and long-context inference meet.
A strong researcher to follow for efficient and long-context LLM systems, especially where inference tricks and memory management make large models practical to run.
One of the clearest researchers to follow for efficient AI systems, especially the line of work that makes large models smaller, faster, and easier to deploy without giving up too much quality.
A good person to follow if you care about what deployment-minded safety work looks like inside a frontier lab, especially around moderation, image systems, and system-card style evaluation.
A key person for understanding how foundation-model evaluation, governance, and research tooling became a coherent agenda rather than a scattered set of concerns.