500AI

Search

Yuntao Bai

Constitutional AI: Harmlessness from AI Feedback
Many-shot Jailbreaking
Training a Helpful and Harmless Assistant with Reinforcement Learning from Human Feedback

All names