Back to researchers

Jason Weston

Self-rewarding post-training

Co-authored Self-Rewarding Language Models: explores self-improvement via internal reward modeling.

Highlights

Post-trainingAlignmentPreferences
Focus: Self-rewarding post-training
Why it matters: Co-authored Self-Rewarding Language Models: explores self-improvement via internal reward modeling.

Research Areas

Post-trainingAlignmentPreferences
Jason Weston - AI Researcher Profile | 500AI