Co-authored vLLM: a widely used serving stack for efficient LLM inference.
Researcher Profile
Joseph E. Gonzalez
Fast, cheap LLM serving (PagedAttention)
Co-author, vLLM
Co-authored vLLM: a widely used serving stack for efficient LLM inference.
Topics
About This Page
This profile is meant to help you get oriented quickly: why this researcher matters, what to read first, and where to explore next.
Last updated
March 20, 2026
Known For
The ideas, systems, and research directions that make this person worth knowing.
01
Fast, cheap LLM serving (PagedAttention)
02
vLLM: Easy, Fast, and Cheap LLM Serving with PagedAttention
03
vLLM (GitHub)
04
vLLM
05
Serving
06
LLM Serving
Start Here
Canonical papers, project pages, or repositories that anchor this profile.
Related Researchers
People worth exploring next because they share topics, labs, or source material with this profile.
Co-authored vLLM: a widely used serving stack for efficient LLM inference.
Co-authored vLLM: a widely used serving stack for efficient LLM inference.
Co-authored vLLM: a widely used serving stack for efficient LLM inference.
Co-authored vLLM: a widely used serving stack for efficient LLM inference.
Co-authored vLLM: a widely used serving stack for efficient LLM inference.