One of the clearest anchors for understanding why scaling laws became such a central planning tool for frontier-model research and training strategy.
Researcher Profile
FeaturedAndrej Karpathy
Deep learning engineering, LLM education
Founder at Eureka Labs
Important not only for his direct research contributions, but for translating frontier deep-learning ideas into builder intuition that spreads across the industry.
Organizations
About This Page
This profile is meant to help you get oriented quickly: why this researcher matters, what to read first, and where to explore next.
Last reviewed
March 18, 2026
Official And External Links
Known For
The ideas, systems, and research directions that make this person worth knowing.
01
Generative pretraining and practical deep-learning intuition
02
Bridging frontier research and hands-on engineering
03
Clear explanations that shape how builders learn modern AI systems
04
Deep learning engineering, LLM education
05
Andrej Karpathy (website)
06
Industry
Start Here
Canonical papers, project pages, or repositories that anchor this profile.
Signature Works
Additional papers, projects, or repositories that help flesh out the profile.
Supporting Sources
Additional links that help verify and flesh out this profile.
Related Researchers
People worth exploring next because they share topics, labs, or source material with this profile.
Useful not just for his own technical work, but because he consistently translates frontier research, deployment shifts, and policy implications into a coherent field-level picture.
A defining figure of the deep-learning era whose influence comes from both landmark technical contributions and his role in setting the ambition level of frontier-model labs.
Important because several of the modern foundation-model playbooks trace back to work he helped drive, especially around generative pretraining and multimodal transfer.
One of the clearest researchers to study for the GPT-3 era, especially around few-shot learning, scaling behavior, and what larger language models started making possible in practice.
A high-signal researcher for understanding how DeepMind approaches generality, especially in areas where reinforcement learning, multimodality, and large-scale systems meet.