A high-signal reinforcement-learning researcher whose work sits on the path from AlphaGo-era planning systems to Gemini-era reasoning and post-training techniques.
Researcher Profile
Editor reviewedAndrew M. Dai
Gemini (multimodal foundation models)
Research scientist at Google Research
A good researcher to follow for the infrastructure side of frontier language models, especially mixture-of-experts scaling, instruction tuning, and the data systems that make very large models usable.
Organizations
About This Page
This profile is meant to help you get oriented quickly: why this researcher matters, what to read first, and where to explore next.
Last reviewed
March 18, 2026
Official And External Links
Known For
The ideas, systems, and research directions that make this person worth knowing.
01
Mixture-of-experts language models
02
Instruction tuning and FLAN
03
Data and scaling work behind Gemini-era systems
04
Gemini (multimodal foundation models)
05
Gemini: A Family of Highly Capable Multimodal Models
06
Gemini
Start Here
Canonical papers, project pages, or repositories that anchor this profile.
Signature Works
Additional papers, projects, or repositories that help flesh out the profile.
Supporting Sources
Additional links that help verify and flesh out this profile.
Related Researchers
People worth exploring next because they share topics, labs, or source material with this profile.
One of the central Google researchers to follow for the line from large-scale language modeling into instruction tuning, multilingual systems, and practical model scaling.
One of the more useful people to study for the Gemini era because his work spans both the text-core of multimodal frontier models and the optimization tricks that make those systems cheaper and more stable to train.
A high-signal researcher for understanding the modern scaling playbook, especially around compute-optimal training, retrieval-augmented language models, and the text side of Gemini-era multimodal systems.
Important for understanding how multilingual NLP, translation, and multimodal reasoning meet inside production-scale frontier systems rather than staying separate research tracks.
Worth tracking for the data side of multimodal frontier models, where the quality and shape of training mixtures strongly determine what large systems can actually do.