Back to topics

Topic

Reinforcement Learning

Researchers working on decision-making, planning, self-play, and RL methods that still shape modern AI systems.

Start with Demis Hassabis, Chris Olah, Dario Amodei if you want the clearest first pass through reinforcement learning as it shows up in practice.

This area overlaps heavily with Anthropic, Google DeepMind, AI21. Common institution signals include Anthropic, Google DeepMind, Google. Recurring starting points include Constitutional AI: Harmlessness from AI Feedback, Training a Helpful and Harmless Assistant with Reinforcement Learning from Human Feedback.

Snapshot

Researchers

90

Related labs

6

Starting points

8

Developed dossiers

27

Institution Signals

Frequent institutions showing up across profiles in this area.

Anthropic (48)Google DeepMind (12)Google (7)Meta (2)AISLE (1)Alignment Research Center (1)Center for AI Policy (1)Istituto Nazionale di Fisica Nucleare, Sezione di Pisa (1)

Canonical Starting Points

Papers, project pages, and repositories that recur across this part of the field.

Frequently Linked Sources

Source clusters that repeatedly anchor researchers in this area.

Researchers To Start With

A stronger first pass through reinforcement learning, ranked by profile depth, evidence, and editorial importance.

All Researchers In This Topic

90 linked profiles.