Researcher Profile

Editor reviewed

Andrew M. Dai

Gemini (multimodal foundation models)

Research scientist at Google Research

A good researcher to follow for the infrastructure side of frontier language models, especially mixture-of-experts scaling, instruction tuning, and the data systems that make very large models usable.

Organizations

Google

Topics

Multimodal Post-Training & Alignment Systems & Infrastructure

About This Page

This profile is meant to help you get oriented quickly: why this researcher matters, what to read first, and where to explore next.

Last reviewed

March 18, 2026

Best First Clicks

More Efficient In-Context Learning with GLaMarticle Finetuned Language Models are Zero-Shot Learnerspaper Gemini: A Family of Highly Capable Multimodal Modelspaper

Official And External Links

ORCID ↗OpenAlex ↗

Known For

The ideas, systems, and research directions that make this person worth knowing.

Mixture-of-experts language models

Instruction tuning and FLAN

Data and scaling work behind Gemini-era systems

Gemini (multimodal foundation models)

Gemini: A Family of Highly Capable Multimodal Models

Gemini

Start Here

Canonical papers, project pages, or repositories that anchor this profile.

More Efficient In-Context Learning with GLaMarticle Finetuned Language Models are Zero-Shot Learnerspaper Gemini: A Family of Highly Capable Multimodal Modelspaper Gemini: A Family of Highly Capable Multimodal Modelspaper

Signature Works

Additional papers, projects, or repositories that help flesh out the profile.

OpenAlexprofile ORCIDprofile

Supporting Sources

Additional links that help verify and flesh out this profile.

Introducing FLAN: More generalizable Language Models with Instruction Fine-Tuningarticle OpenAlexprofile ORCIDprofile

Related Researchers

People worth exploring next because they share topics, labs, or source material with this profile.

Shared canonical source

Ioannis Antonoglou

Gemini (multimodal foundation models)

4 sources

A high-signal reinforcement-learning researcher whose work sits on the path from AlphaGo-era planning systems to Gemini-era reasoning and post-training techniques.

Multimodal Post-Training & Alignment

Start HereMastering Chess and Shogi by Self-Play with a General Reinforcement Learning Algorithm

Shared canonical source

Quoc V. Le

Gemini (multimodal foundation models)

4 sources

One of the central Google researchers to follow for the line from large-scale language modeling into instruction tuning, multilingual systems, and practical model scaling.

Multimodal Post-Training & Alignment

Start HereQuoc V. Le

Shared canonical source

Rohan Anil

Gemini (multimodal foundation models)

4 sources

One of the more useful people to study for the Gemini era because his work spans both the text-core of multimodal frontier models and the optimization tricks that make those systems cheaper and more stable to train.

Open Models Multimodal

Start HereGemini: A Family of Highly Capable Multimodal Models

Shared canonical source

Sebastian Borgeaud

Gemini (multimodal foundation models)

4 sources

A high-signal researcher for understanding the modern scaling playbook, especially around compute-optimal training, retrieval-augmented language models, and the text side of Gemini-era multimodal systems.

Multimodal Evaluation & Benchmarks

Start HereAn empirical analysis of compute-optimal large language model training

Shared canonical source

Radu Soricut

Gemini (multimodal foundation models)

4 sources

Important for understanding how multilingual NLP, translation, and multimodal reasoning meet inside production-scale frontier systems rather than staying separate research tracks.

Multimodal Systems & Infrastructure

Start HereRadu Soricut

Shared canonical source

Katie Millican

Gemini (multimodal foundation models)

4 sources

Worth tracking for the data side of multimodal frontier models, where the quality and shape of training mixtures strongly determine what large systems can actually do.

Multimodal Systems & Infrastructure

Start HereGemini: A Family of Highly Capable Multimodal Models