500AI

Search

Neel Nanda

Training a Helpful and Harmless Assistant with Reinforcement Learning from Human Feedback

All names