Alex Ray

Hindsight Experience Replay
Learning Dexterous In-Hand Manipulation
Evaluating Large Language Models Trained on Code
Beyond the Imitation Game: Quantifying and Extrapolating the Capabilities of Language Models
Training language models to follow instructions with human feedback