Co-authored MMLU: a widely used benchmark for general LLM capability across many subjects.
Researcher Profile
Jacob Steinhardt
Broad capability evaluation (MMLU)
Co-author, MMLU
Co-authored MMLU: a widely used benchmark for general LLM capability across many subjects.
Topics
About This Page
This profile is meant to help you get oriented quickly: why this researcher matters, what to read first, and where to explore next.
Last updated
March 20, 2026
Best First Clicks
Known For
The ideas, systems, and research directions that make this person worth knowing.
01
Broad capability evaluation (MMLU)
02
Measuring Massive Multitask Language Understanding
03
Evaluation
04
Benchmarks
Start Here
Canonical papers, project pages, or repositories that anchor this profile.
Related Researchers
People worth exploring next because they share topics, labs, or source material with this profile.
Co-authored MMLU: a widely used benchmark for general LLM capability across many subjects.
Co-authored MMLU: a widely used benchmark for general LLM capability across many subjects.
Co-authored MMLU: a widely used benchmark for general LLM capability across many subjects.
Co-authored MMLU: a widely used benchmark for general LLM capability across many subjects.
Co-authored MMLU: a widely used benchmark for general LLM capability across many subjects.