Researchers

A curated starting point: people and work worth tracking in frontier AI.

Tags

Select research areas...

Showing 3619 of 3619 researchers

Dario Amodei

Alignment, post-training, frontier LLMs

AnthropicFrontier labAlignmentPost-training

Leads one of the most influential frontier-model labs; key voice on alignment and safe deployment.

Start here

Constitutional AI: Harmlessness from AI FeedbackPaper →

Jared Kaplan

Scaling laws, LLM training dynamics

AnthropicFrontier labScalingLLMs

Core contributor to modern scaling-law intuition used across frontier training and evaluation.

Start here

Scaling Laws for Neural Language ModelsPaper →

Jack Clark

AI policy, frontier-lab strategy, analysis

AnthropicIndustryPolicy

Bridges frontier labs and public understanding; consistently useful writing on what matters and why.

Start here

Jack Clark (website)Link →

Amanda Askell

Alignment, behavior shaping, safety

AnthropicAlignmentPost-training

Works on practical alignment and steering of frontier models; useful lens for post-training tradeoffs.

Start here

Constitutional AI: Harmlessness from AI FeedbackPaper →

Jan Leike

Alignment research, scalable oversight

AlignmentFrontier labRL

Prominent alignment researcher focused on making safety work scale with increasingly capable models.

Start here

Scalable agent alignment via reward modelingPaper →

Chris Olah

Mechanistic interpretability, visualization

InterpretabilityFrontier labSafety

Pushed interpretability forward with tools and approaches that shaped how people reason about neural nets.

Start here

Chris Olah (blog)Blog →

Paul Christiano

Alignment theory, reward modeling

AlignmentRLHFSafety

Major influence on reward-modeling and oversight ideas that feed into modern post-training.

Start here

Deep Reinforcement Learning from Human PreferencesPaper →

Ilya Sutskever

Deep learning, large-scale training

OpenAIFrontier labLLMs

Central figure in the modern deep-learning wave; shaped large-scale training culture and capability focus.

Start here

Sequence to Sequence Learning with Neural NetworksPaper →

John Schulman

Reinforcement learning, post-training

OpenAIRLHFRL

Key contributor to practical RL algorithms and RLHF-era post-training used in modern assistants.

Start here

Proximal Policy Optimization AlgorithmsPaper →Training language models to follow instructions with human feedbackPaper →

Alec Radford

Generative pretraining, multimodal models

OpenAILLMsVision-language

Drove several foundational generative-pretraining efforts that set patterns for modern foundation models.

Start here

Language Models are Unsupervised Multitask Learners (GPT-2)Report →Learning Transferable Visual Models From Natural Language Supervision (CLIP)Paper →

Tom Brown

Large-scale language modeling

OpenAILLMsScaling

Helped establish the modern era of large-scale language modeling and the evaluation mindset around it.

Start here

Language Models are Few-Shot Learners (GPT-3)Paper →

Andrej Karpathy

Deep learning engineering, LLM education

IndustryEducationLLMs

Excellent at translating frontier ideas into practical intuition and tooling for builders.

Start here

Andrej Karpathy (website)Link →

Sandhini Agarwal

Instruction tuning and RLHF

OpenAIRLHFPost-training

Worked on instruction-following and RLHF practices that became the standard post-training recipe.

Start here

Training language models to follow instructions with human feedbackPaper →

Pamela Mishkin

Instruction following, alignment

OpenAIPost-training

Contributed to post-training workflows and datasets powering instruction-following behavior.

Start here

Training language models to follow instructions with human feedbackPaper →

Jeff Wu

Instruction following, post-training

OpenAIPost-training

Worked on instruction-following models and post-training practice that influenced the ecosystem.

Start here

Training language models to follow instructions with human feedbackPaper →

Nicholas Carlini

Adversarial ML, security of deployed models

SecuritySafetyRed teaming

High-signal work on real failure modes: adversarial examples, extraction, and practical model security.

Start here

Nicholas Carlini (site)Link →

Demis Hassabis

Deep RL, scientific AI, leadership

DeepMindGeminiFrontier labRL

Built the organization behind many of the last decade’s most visible RL and scientific-AI breakthroughs.

Start here

Mastering the game of Go with deep neural networks and tree search (AlphaGo)Paper →

David Silver

Deep RL, planning, games

DeepMindGeminiRLPlanning

Shaped modern deep RL in practice; a reliable anchor for understanding learning + search.

Start here

Mastering the game of Go with deep neural networks and tree search (AlphaGo)Paper →

Koray Kavukcuoglu

Large-scale training, systems

DeepMindGeminiSystemsTraining

Leads work that makes large training runs possible and repeatable; crucial but often underappreciated layer.

Start here

GShard: Scaling Giant Models with Conditional Computation and Automatic ShardingPaper →

Geoffrey Irving

Reasoning, verification, math

DeepMindGeminiMath reasoning

Strong signal in math/reasoning and verification-style approaches for reliability.

Start here

Geoffrey Irving (Google Scholar)Profile →

Oriol Vinyals

Sequence models, large-scale ML

DeepMindGeminiLLMs

Key figure in sequence modeling and large-scale ML; contributes to frontier model development.

Start here

Sequence to Sequence Learning with Neural NetworksPaper →

Noam Shazeer

Transformers, Mixture-of-Experts, scaling

GeminiTransformersMoEScaling

One of the most important builders behind Transformers and MoE scaling that power modern LLMs.

Start here

Attention Is All You NeedPaper →Switch Transformers: Scaling to Trillion Parameter Models with Simple and Efficient SparsityPaper →

Jeff Dean

ML systems, large-scale infrastructure

GoogleGeminiSystemsScaling

One of the most influential figures in ML systems; shaped the infrastructure that makes frontier training feasible.

Start here

TensorFlow: Large-Scale Machine Learning on Heterogeneous SystemsProject →

Pushmeet Kohli

Robotics, vision, structured prediction

DeepMindRoboticsPerception

Bridges perception and action; useful pick for the robotics + foundation-model convergence.

Start here

Pushmeet Kohli (homepage)Link →

Show:per page

1 of 151