Topic

Agents & Reasoning

People exploring planning, tool use, and reasoning-heavy model behavior for longer-horizon tasks.

Start with Jan Leike, Danny Hernandez, Amelia Glaese if you want the clearest first pass through agents & reasoning as it shows up in practice.

This area overlaps heavily with Google DeepMind, Anthropic, AI21. Common institution signals include Anthropic, Google DeepMind, AI21 Labs. Recurring starting points include A Generalist Agent, Constitutional AI: Harmlessness from AI Feedback.

Snapshot

Researchers

110

Related labs

Starting points

Developed dossiers

Angles To Understand

Useful entry points pulled from the strongest linked researcher dossiers.

Scalable oversight

Via Jan Leike

Helpful and harmless assistant training

Via Danny Hernandez

Safety evaluation and monitorability

Via Amelia Glaese

Mistral 7B and Mixtral

Via Devendra Singh Chaplot

Mistral 7B

Via Albert Q. Jiang

AI safety via debate

Via Geoffrey Irving

Institution Signals

Frequent institutions showing up across profiles in this area.

Anthropic (16)Google DeepMind (7)AI21 Labs (5)Google (4)OpenAI (4)Allen Institute (2)EleutherAI (2)Mistral AI (2)

Canonical Starting Points

Papers, project pages, and repositories that recur across this part of the field.

A Generalist Agent

Linked by 18 profiles in this topic

Constitutional AI: Harmlessness from AI Feedback

Linked by 15 profiles in this topic

Training a Helpful and Harmless Assistant with Reinforcement Learning from Human Feedback

Linked by 15 profiles in this topic

Question Decomposition Improves the Faithfulness of Model-Generated Reasoning

Linked by 9 profiles in this topic

Measuring Faithfulness in Chain-of-Thought Reasoning

Linked by 8 profiles in this topic

Jamba-1.5: Hybrid Transformer-Mamba Models at Scale

Linked by 7 profiles in this topic

Jamba: A Hybrid Transformer-Mamba Language Model

Linked by 7 profiles in this topic

ReAct: Synergizing Reasoning and Acting in Language Models

Linked by 7 profiles in this topic

Frequently Linked Sources

Source clusters that repeatedly anchor researchers in this area.

A Generalist Agent

Used across 18 researcher pages in this topic

Constitutional AI: Harmlessness from AI Feedback

Used across 15 researcher pages in this topic

Training a Helpful and Harmless Assistant with Reinforcement Learning from Human Feedback

Used across 15 researcher pages in this topic

Jamba-1.5: Hybrid Transformer-Mamba Models at Scale

Used across 7 researcher pages in this topic

Jamba: A Hybrid Transformer-Mamba Language Model

Used across 7 researcher pages in this topic

ReAct: Synergizing Reasoning and Acting in Language Models

Used across 7 researcher pages in this topic

Researchers To Start With

A stronger first pass through agents & reasoning, ranked by profile depth, evidence, and editorial importance.

Jan Leike

Alignment research, scalable oversight

3 sources

One of the clearest public anchors for scalable oversight and alignment research in the frontier-model era.

Post-Training & Alignment Agents & Reasoning

Start HereScalable agent alignment via reward modeling

Danny Hernandez

Alignment via AI feedback (Constitutional AI)

5 sources

A strong person to follow for how Anthropic moved from assistant training into more explicit evaluation work around model behavior, red-teaming, and chain-of-thought faithfulness.

Anthropic Post-Training & Alignment Evaluation & Benchmarks

Start HereTraining a Helpful and Harmless Assistant with Reinforcement Learning from Human Feedback

Amelia Glaese

Gemini (multimodal foundation models)

4 sources

A useful researcher to follow if you care about the bridge between safety evaluation, human data, and how frontier models are turned into practical tools and benchmarks.

Multimodal Evaluation & Benchmarks

Start HereMonitoring Monitorability

Devendra Singh Chaplot

Mixture-of-experts LLMs

3 sources

A useful person to follow if you care about the bridge between embodied-agent research and modern open-weight language-model systems, rather than treating those worlds as separate.

Mistral Open Models Systems & Infrastructure

Start HereMistral 7B

Albert Q. Jiang

Mixture-of-experts LLMs

3 sources

A strong person to know for the Mistral line of open-weight models, especially if you care about the arc from compact performant base models into mixture-of-experts, multimodal systems, and reasoning models.

Mistral Open Models Multimodal

Start HereMistral 7B

Geoffrey Irving

Reasoning, verification, math

4 sources

A useful person to study if you care about alignment proposals that try to make superhuman systems legible enough for humans to supervise in practice.

Google DeepMind Multimodal Post-Training & Alignment

Start HereRed Teaming Language Models with Language Models

Oriol Vinyals

Sequence models, large-scale ML

4 sources

A high-signal researcher for understanding how DeepMind approaches generality, especially in areas where reinforcement learning, multimodality, and large-scale systems meet.

Google DeepMind Multimodal Systems & Infrastructure

Start HereDeepMind and Blizzard open StarCraft II as an AI research environment

Nando de Freitas

Deep learning, research leadership

4 sources

A long-running builder of ML intuition whose influence spans Bayesian methods, reinforcement learning, and recent work on generalist and generative environments.

Google DeepMind Agents & Reasoning Reinforcement Learning

Start HereNando de Freitas

Matan Kalman

Faster LLM inference via speculative decoding

3 sources

An important systems page because he is one of the named authors on speculative decoding, a technique that became part of the mainstream conversation about making large-model inference materially faster without changing outputs.

Systems & Infrastructure Agents & Reasoning

Start HereFast Inference from Transformers via Speculative Decoding