Jacob Hilton

WebGPT: Browser-assisted Question-Answering with Human Feedback
TruthfulQA: Measuring How Models Mimic Human Falsehoods
Training Verifiers to Solve Math Word Problems
Training language models to follow instructions with human feedback