One of the more useful people to study for the Gemini era because his work spans both the text-core of multimodal frontier models and the optimization tricks that make those systems cheaper and more stable to train.
Researcher Profile
Editor reviewedAngeliki Lazaridou
Gemini (multimodal foundation models)
Research scientist at DeepMind
A high-signal researcher for grounded language and retrieval-heavy systems, especially if you want to understand how language models stay useful as the world changes around them.
Organizations
About This Page
This profile is meant to help you get oriented quickly: why this researcher matters, what to read first, and where to explore next.
Last reviewed
March 18, 2026
Official And External Links
Known For
The ideas, systems, and research directions that make this person worth knowing.
01
Grounded language and multimodal learning
02
Temporal generalization in language models
03
Retrieval and internet-augmented language modeling
04
Gemini (multimodal foundation models)
05
Gemini: A Family of Highly Capable Multimodal Models
06
Gemini
Start Here
Canonical papers, project pages, or repositories that anchor this profile.
Signature Works
Additional papers, projects, or repositories that help flesh out the profile.
Supporting Sources
Additional links that help verify and flesh out this profile.
Related Researchers
People worth exploring next because they share topics, labs, or source material with this profile.
A high-signal researcher for understanding the modern scaling playbook, especially around compute-optimal training, retrieval-augmented language models, and the text side of Gemini-era multimodal systems.
A strong person to study for multilingual systems and instruction tuning, especially where translation, speech, and large-model post-training intersect.
One of the clearest multimodal researchers to track if you want to understand how frontier labs turned vision-language work from narrow benchmarks into general-purpose model capability.
Important for understanding how multilingual NLP, translation, and multimodal reasoning meet inside production-scale frontier systems rather than staying separate research tracks.
A useful name for the speech side of Google’s frontier stack, especially if you want the lineage from voice search and speech recognition systems into Gemini’s audio capabilities.