Co-authored MMLU: a widely used benchmark for general LLM capability across many subjects.
Researcher Profile
Dawn Song
Broad capability evaluation (MMLU)
Researcher at Berkeley College
Co-authored MMLU: a widely used benchmark for general LLM capability across many subjects.
Organizations
Topics
About This Page
This profile is meant to help you get oriented quickly: why this researcher matters, what to read first, and where to explore next.
Last updated
March 20, 2026
Best First Clicks
Official And External Links
Known For
The ideas, systems, and research directions that make this person worth knowing.
01
Broad capability evaluation (MMLU)
02
Measuring Massive Multitask Language Understanding
03
Evaluation
04
Benchmarks
Start Here
Canonical papers, project pages, or repositories that anchor this profile.
Signature Works
Additional papers, projects, or repositories that help flesh out the profile.
Supporting Sources
Additional links that help verify and flesh out this profile.
Related Researchers
People worth exploring next because they share topics, labs, or source material with this profile.
Co-authored MMLU: a widely used benchmark for general LLM capability across many subjects.
Co-authored MMLU: a widely used benchmark for general LLM capability across many subjects.
Co-authored MMLU: a widely used benchmark for general LLM capability across many subjects.
Co-authored MMLU: a widely used benchmark for general LLM capability across many subjects.
Co-authored MMLU: a widely used benchmark for general LLM capability across many subjects.