One of the better people to track for the sequence from Gopher to retrieval-augmented language models and then into Gemini, especially if you care about how DeepMind actually iterated on the frontier-model recipe.
Researcher Profile
Editor reviewedJordan Hoffmann
Compute-optimal scaling for LLM training
Researcher behind DeepMind’s retrieval and compute-optimal language-model work
One of the clearest people to follow for the sequence from retrieval-augmented language models to compute-optimal scaling and then into Gemini.
Organizations
Labs
About This Page
This profile is meant to help you get oriented quickly: why this researcher matters, what to read first, and where to explore next.
Known For
The ideas, systems, and research directions that make this person worth knowing.
01
Chinchilla and compute-optimal scaling
02
RETRO and retrieval-augmented language models
03
Gemini
04
Compute-optimal scaling for LLM training
05
Training Compute-Optimal Large Language Models
06
DeepMind
Start Here
Canonical papers, project pages, or repositories that anchor this profile.
Supporting Sources
Additional links that help verify and flesh out this profile.
Related Researchers
People worth exploring next because they share topics, labs, or source material with this profile.
A useful page for the DeepMind work that connected large-language-model scaling to the multimodal Gemini push, with a clearer safety-and-evaluation flavor than many purely scaling-focused pages.
Worth tracking for the DeepMind thread that links large-model scaling research to the multimodal Gemini stack, rather than treating those as separate eras.
A useful profile for the core DeepMind contributor layer behind Chinchilla, Gopher, and Gemini rather than only the more public faces of those systems.
A useful profile for the DeepMind researchers who helped carry the lab’s language-model program from scaling-law work into Gemini rather than appearing only on the final product layer.
A useful profile for the DeepMind researchers who sat inside the core language-model program as it moved from scaling-law analysis into the Gemini family.