Co-authored model-written evals: a practical technique for discovering and measuring LM behaviors.
Researcher Profile
Christopher Olah
Model-written evaluations for LM behavior
Co-author, Model-Written Evals
Co-authored model-written evals: a practical technique for discovering and measuring LM behaviors.
Labs
About This Page
This profile is meant to help you get oriented quickly: why this researcher matters, what to read first, and where to explore next.
Last updated
March 20, 2026
Official And External Links
Known For
The ideas, systems, and research directions that make this person worth knowing.
01
Model-written evaluations for LM behavior
02
Discovering Language Model Behaviors with Model-Written Evaluations
03
Anthropic
04
Evaluation
05
Safety
Start Here
Canonical papers, project pages, or repositories that anchor this profile.
Signature Works
Additional papers, projects, or repositories that help flesh out the profile.
Related Researchers
People worth exploring next because they share topics, labs, or source material with this profile.
Co-authored model-written evals: a practical technique for discovering and measuring LM behaviors.
Co-authored model-written evals: a practical technique for discovering and measuring LM behaviors.
Co-authored model-written evals: a practical technique for discovering and measuring LM behaviors.
Co-authored model-written evals: a practical technique for discovering and measuring LM behaviors.
Co-authored model-written evals: a practical technique for discovering and measuring LM behaviors.