500AI

Search

Peter Welinder

Training Language Models to Follow Instructions with Human Feedback
Solving Rubik’s Cube with a Robot Hand
Learning Dexterous In-Hand Manipulation

All names