Researchers — page 26

Co-authored “The Llama 3 Herd of Models”.

3021

Thomas Brovelli

Multimodal frontier models (Gemini)

Multimodal Gemini: A Family of Highly Capable Multimodal Models

Co-authored Gemini: A Family of Highly Capable Multimodal Models.

3022

Thomas Degry

Frontier model development (GPT-4)

Co-authored the GPT-4 Technical Report: a key reference for the GPT-4-era frontier.

3023

Thomas Georgiou

Open-weight frontier models (Llama 3)

Co-authored “The Llama 3 Herd of Models”.

3024

Thomas Hubert

Self-play RL with search (AlphaZero)

Reinforcement Learning Mastering Chess and Shogi by Self-Play with a General Reinforcement Learning Algorithm

Co-authored AlphaZero: a canonical reference for self-play + search in RL.

3025

Thomas Icard

Holistic evaluation of language models (HELM)

Co-authored HELM: a framework for evaluating language models across many axes beyond raw accuracy.

Evaluation & Benchmarks Holistic Evaluation of Language Models

3026

Thomas Jurdi

Multimodal frontier models (Gemini)

Multimodal Gemini: A Family of Highly Capable Multimodal Models

Co-authored Gemini: A Family of Highly Capable Multimodal Models.

3027

Thomas Mesnard

Open language models from Google (Gemma)

Open Models Gemma: Open Models Based on Gemini Research and Technology

Co-authored Gemma: open models based on Gemini research and technology.

3028

Thomas Portet

Small, capable models (Phi-3)

Co-authored the Phi-3 Technical Report (capable models designed for smaller footprints).

Open Models Systems & Infrastructure Phi-3 Technical Report: A Highly Capable Language Model Locally on Your Phone

3029

Thomas Robinson

Open-weight frontier models (Llama 3)

Co-authored “The Llama 3 Herd of Models”.

3030

Thomas Scialom

Open-weight chat and foundation models (Llama 2)

Open Models Llama 2: Open Foundation and Fine-Tuned Chat Models

Co-authored Llama 2: Open Foundation and Fine-Tuned Chat Models.

3031

Thomas Unterthiner

Vision Transformers (ViT)

Co-authored ViT: a turning point for transformers in vision.

Vision & Robotics An Image is Worth 16x16 Words: Transformers for Image Recognition at Scale

3032

Thomas Wang

Mixture-of-experts LLMs

Mistral

Co-authored Mixtral of Experts: a key MoE reference in the open-weights frontier.

Open Models Systems & Infrastructure Mixtral of Experts

3033

Thomas Wolf

Open-source tooling for modern NLP (Transformers library)

Hugging Face

Co-authored the Hugging Face Transformers paper that helped standardize modern NLP workflows.

Open Models Transformers: State-of-the-Art Natural Language Processing

3034

Thore Graepel

Self-play RL with search (AlphaZero)

Reinforcement Learning Mastering Chess and Shogi by Self-Play with a General Reinforcement Learning Algorithm

Co-authored AlphaZero: a canonical reference for self-play + search in RL.

3035

Tian Huey Teh

Multimodal frontier models (Gemini)

Multimodal Gemini: A Family of Highly Capable Multimodal Models

Co-authored Gemini: A Family of Highly Capable Multimodal Models.

3036

Tian LIN

Multimodal frontier models (Gemini)

Multimodal Gemini: A Family of Highly Capable Multimodal Models

Co-authored Gemini: A Family of Highly Capable Multimodal Models.

3037

Tian Pei

Open-model frontier reports (DeepSeek-V3)

DeepSeek

Co-authored the DeepSeek-V3 Technical Report.

Open Models DeepSeek-V3 Technical Report

3038

Tianhang Zhu

Open-weight LLMs (Qwen)

Qwen

Co-authored the Qwen Technical Report.

Open Models Qwen Technical Report

3039

Tianhao Li

Open-weight LLMs (Qwen2)

Qwen

Co-authored the Qwen2 Technical Report.

Open Models Qwen2 Technical Report

3040

Tianhao Zheng

Frontier model development (GPT-4)

Co-authored the GPT-4 Technical Report: a key reference for the GPT-4-era frontier.

3041

Tianhe Li

Open-weight frontier models (Llama 3)

Co-authored “The Llama 3 Herd of Models”.

3042

Tianhe Yu

Multimodal frontier models (Gemini)

Multimodal Gemini: A Family of Highly Capable Multimodal Models

Co-authored Gemini: A Family of Highly Capable Multimodal Models.

3043

Tianjun Zhang

Open-weight frontier models (Llama 3)

Co-authored “The Llama 3 Herd of Models”.

Evaluation & Benchmarks Chatbot Arena: An Open Platform for Evaluating LLMs by Human Preference

3044

Tianle Li

Human preference evaluation at scale (Chatbot Arena)

Co-authored Chatbot Arena: a high-impact human-preference evaluation platform for LLMs.

3045

Tianqi Liu

Open multimodal models (Gemma 3)

Open Models Multimodal Gemma 3 Technical Report

Co-authored the Gemma 3 Technical Report.

3046

Tianrun Li

Multimodal frontier models (Gemini)

Multimodal Gemini: A Family of Highly Capable Multimodal Models

Co-authored Gemini: A Family of Highly Capable Multimodal Models.

3047

Tianyi Zhang

Holistic evaluation of language models (HELM)

Co-authored HELM: a framework for evaluating language models across many axes beyond raw accuracy.

Evaluation & Benchmarks Holistic Evaluation of Language Models

3048

Tianyu Liu

Open-weight LLMs (Qwen2)

Qwen

Co-authored the Qwen2 Technical Report.

Open Models Qwen2 Technical Report

3049

Tianyu Sun

Open-model frontier reports (DeepSeek-V3)

DeepSeek

Co-authored the DeepSeek-V3 Technical Report.

Open Models DeepSeek-V3 Technical Report

3050

Tim Brooks

Frontier model development (GPT-4)

Co-authored the GPT-4 Technical Report: a key reference for the GPT-4-era frontier.

Systems & Infrastructure About Me — Tim Dettmers

3051

Tim Dettmers

Efficient finetuning of quantized LLMs

A core person to know for making serious language-model finetuning and inference feasible on smaller hardware, especially through quantization and optimizer tooling that working builders actually use.

3052

Tim Green

Multimodal frontier models (Gemini)

Multimodal Gemini: A Family of Highly Capable Multimodal Models

Co-authored Gemini: A Family of Highly Capable Multimodal Models.

3053

Tim Matthews

Open-weight frontier models (Llama 3)

Co-authored “The Llama 3 Herd of Models”.

Open Models Transformers: State-of-the-Art Natural Language Processing

3054

Tim Rault

Open-source tooling for modern NLP (Transformers library)

Hugging Face

Co-authored the Hugging Face Transformers paper that helped standardize modern NLP workflows.

3055

Tim Rocktäschel

Retrieval-augmented generation (RAG)

Co-authored RAG: a canonical reference for retrieval-augmented generation in NLP.

Evaluation & Benchmarks Retrieval-Augmented Generation for Knowledge-Intensive NLP Tasks

3056

Tim Salimans

Text-to-image diffusion with strong language understanding (Imagen)

Diffusion & Generative Media Photorealistic Text-to-Image Diffusion Models with Deep Language Understanding

Co-authored Imagen: a milestone for photorealistic text-to-image diffusion models.

3057

Timo Schick

Teaching LMs to use tools (Toolformer)

Co-authored Toolformer: an influential approach to tool use via self-supervision.

Agents & Reasoning Toolformer: Language Models Can Teach Themselves to Use Tools

3058

Timothée Lacroix

Open-weight LLMs and training infrastructure

Mistral

One of the clearest people to follow for the open-weight frontier-model line, especially where Meta’s LLaMA work flows directly into Mistral’s more aggressive efficiency push.

Open Models Systems & Infrastructure Mistral AI

3059

Timothée Lottaz

Multimodal frontier models (Gemini)

Multimodal Gemini: A Family of Highly Capable Multimodal Models

Co-authored Gemini: A Family of Highly Capable Multimodal Models.

3060

Timothy Chou

Open-weight frontier models (Llama 3)

Co-authored “The Llama 3 Herd of Models”.

3061

Timothy Chung

Multimodal frontier models (Gemini)

Multimodal Gemini: A Family of Highly Capable Multimodal Models

Co-authored Gemini: A Family of Highly Capable Multimodal Models.

3062

Timothy Dozat

Multimodal frontier models (Gemini)

Multimodal Gemini: A Family of Highly Capable Multimodal Models

Co-authored Gemini: A Family of Highly Capable Multimodal Models.

3063

Timothy Jordan

Open language models (Gemma 2)

Open Models Gemma 2: Improving Open Language Models at a Practical Size

Co-authored Gemma 2: improving open language models at a practical size.

3064

Timothy Lillicrap

Gemini (multimodal foundation models)

Important for the branch of DeepMind research that connects control, world models, and modern agent behavior rather than treating them as separate eras.

Multimodal Agents & Reasoning Reinforcement Learning Continuous control with deep reinforcement learning

3065

Timothy Telleen-Lawton

Alignment via AI feedback (Constitutional AI)

Post-Training & Alignment Evaluation & Benchmarks Agents & Reasoning Measuring Faithfulness in Chain-of-Thought Reasoning

A useful page for the more evaluation-heavy side of Anthropic’s alignment program, especially where constitutional methods, model-written evals, and faithfulness checks start to connect.

3066

Tina Chen

Multimodal frontier models (Gemini)

Multimodal Gemini: A Family of Highly Capable Multimodal Models

Co-authored Gemini: A Family of Highly Capable Multimodal Models.

3067

Ting Yu

Open language models (Gemma 2)

Open Models Gemma 2: Improving Open Language Models at a Practical Size

Co-authored Gemma 2: improving open language models at a practical size.

3068

Ting Zhou

Multimodal frontier models (Gemini)

Multimodal Gemini: A Family of Highly Capable Multimodal Models

Co-authored Gemini: A Family of Highly Capable Multimodal Models.

3069

TJ Lu

Multimodal frontier models (Gemini)

Multimodal Gemini: A Family of Highly Capable Multimodal Models

Co-authored Gemini: A Family of Highly Capable Multimodal Models.

3070

Tobias Speckbacher

Open-weight frontier models (Llama 3)

Co-authored “The Llama 3 Herd of Models”.

Systems & Infrastructure xAI (site)

3071

Toby Pohlen

Frontier-scale training infrastructure

xAI

Builds core infrastructure for xAI’s frontier models.

3072

Toby Shevlane

Multimodal frontier models (Gemini)

Multimodal Gemini: A Family of Highly Capable Multimodal Models

Co-authored Gemini: A Family of Highly Capable Multimodal Models.

3073

Todor Markov

Frontier model development (GPT-4)

Co-authored the GPT-4 Technical Report: a key reference for the GPT-4-era frontier.

3074

Todor Mihaylov

Open-weight chat and foundation models (Llama 2)

Open Models Llama 2: Open Foundation and Fine-Tuned Chat Models

Co-authored Llama 2: Open Foundation and Fine-Tuned Chat Models.

3075

Toju Duke

Pathways-scale language modeling (PaLM)

PaLM: Scaling Language Modeling with Pathways

Co-authored PaLM: Scaling Language Modeling with Pathways.

3076

Toki Sherbakov

Frontier model development (GPT-4)

Co-authored the GPT-4 Technical Report: a key reference for the GPT-4-era frontier.

3077

Tolga Bolukbasi

Multimodal frontier models (Gemini)

Multimodal Gemini: A Family of Highly Capable Multimodal Models

Co-authored Gemini: A Family of Highly Capable Multimodal Models.

3078

Tolly Powell

Frontier model development (GPT-4)

Co-authored the GPT-4 Technical Report: a key reference for the GPT-4-era frontier.

Post-Training & Alignment Reinforcement Learning Deep Reinforcement Learning from Human Preferences

3079

Tom B. Brown

Practical RL from human feedback

Co-authored Deep RL from Human Preferences: an early anchor for RLHF-style post-training.

3080

Tom Ben Gal

Hybrid Transformer–Mamba language models (Jamba)

AI21

Useful because it puts a name and a clear role on one of the engineers working at the boundary between research and implementation for AI21’s hybrid-model stack.

Systems & Infrastructure Tom Ben-Gal

3081

Tom Braude

Hybrid Transformer–Mamba language models (Jamba)

AI21

A solid head-page upgrade because it turns another thin Jamba coauthor page into a real profile tied to pre- and post-training, the part of the stack where hybrid-model behavior gets tuned into something shippable.

Post-Training & Alignment Systems & Infrastructure JAMBA: Hybrid Transformer-Mamba Language Models

3082

Tom Brown

Large-scale language modeling

Security & Robustness Language models are few-shot learners

One of the clearest researchers to study for the GPT-3 era, especially around few-shot learning, scaling behavior, and what larger language models started making possible in practice.

3083

Tom Conerly

Alignment via AI feedback (Constitutional AI)

Post-Training & Alignment Reinforcement Learning In-context Learning and Induction Heads

Worth knowing because his paper trail hits several of the most useful early Anthropic threads at once: induction heads, calibration, repeated-data scaling, and the practical behavior of post-trained assistants.

3084

Tom Duerig

Multimodal frontier models (Gemini)

Multimodal Gemini: A Family of Highly Capable Multimodal Models

Co-authored Gemini: A Family of Highly Capable Multimodal Models.

3085

Tom Eccles

Generalist agents (Gato)

Multimodal Agents & Reasoning A Generalist Agent

Co-authored Gato: a key reference for generalist, multi-task agents.

3086

Tom Henighan

Alignment via AI feedback (Constitutional AI)

Post-Training & Alignment Evaluation & Benchmarks Systems & Infrastructure Training a Helpful and Harmless Assistant with Reinforcement Learning from Human Feedback

A good profile for the less public part of frontier-model progress, where pretraining quality, evaluation loops, and systems choices do a lot of the real work.

3087

Tom Hennigan

Compute-optimal scaling for LLM training

Multimodal Gemini: A Family of Highly Capable Multimodal Models

A useful page for the research layer behind DeepMind’s frontier-language-model program, especially across Gopher, Chinchilla, and Gemini.

3088

Tom Hudson

Multimodal frontier models (Gemini)

Multimodal Gemini: A Family of Highly Capable Multimodal Models

Co-authored Gemini: A Family of Highly Capable Multimodal Models.

3089

Tom Kwiatkowski

Multimodal frontier models (Gemini)

Multimodal Gemini: A Family of Highly Capable Multimodal Models

Co-authored Gemini: A Family of Highly Capable Multimodal Models.

3090

Tom Le Paine

Multimodal frontier models (Gemini)

Multimodal Gemini: A Family of Highly Capable Multimodal Models

Co-authored Gemini: A Family of Highly Capable Multimodal Models.

3091

Tom Natan

Multimodal frontier models (Gemini)

Multimodal Gemini: A Family of Highly Capable Multimodal Models

Co-authored Gemini: A Family of Highly Capable Multimodal Models.

3092

Tom van der Weide

Multimodal frontier models (Gemini)

Multimodal Gemini: A Family of Highly Capable Multimodal Models

Co-authored Gemini: A Family of Highly Capable Multimodal Models.

3093

Tomas Kocisky

Open language models (Gemma 2)

Open Models Gemma 2: Improving Open Language Models at a Practical Size

Co-authored Gemma 2: improving open language models at a practical size.

3094

Tomasz Kępa

Multimodal frontier models (Gemini)

Multimodal Gemini: A Family of Highly Capable Multimodal Models

Co-authored Gemini: A Family of Highly Capable Multimodal Models.

3095

Tomer Asida

Hybrid Transformer–Mamba language models (Jamba)

AI21

A sensible page to keep because his name appears directly on the original Jamba paper, giving users another concrete entry point into the people who built AI21’s hybrid architecture.

Systems & Infrastructure Jamba: A Hybrid Transformer-Mamba Language Model

3096

Tomer Kaftan

Frontier model development (GPT-4)

Co-authored the GPT-4 Technical Report: a key reference for the GPT-4-era frontier.

3097

Tomer Shani

Multimodal frontier models (Gemini)

Multimodal Gemini: A Family of Highly Capable Multimodal Models

Co-authored Gemini: A Family of Highly Capable Multimodal Models.

3098

Tomy Tsai

Multimodal frontier models (Gemini)

Multimodal Gemini: A Family of Highly Capable Multimodal Models

Co-authored Gemini: A Family of Highly Capable Multimodal Models.

3099

Tong Mu

Frontier model development (GPT-4)

Co-authored the GPT-4 Technical Report: a key reference for the GPT-4-era frontier.

3100

Tong Xiao

Open-weight frontier models (Llama 3)

Co-authored “The Llama 3 Herd of Models”.

Open Models Code Models StarCoder: may the source be with you!

3101

Tony Lee

Open code LLMs (StarCoder)

Co-authored StarCoder: a foundational open code model effort (BigCode).

3102

Travis Choma

Multimodal frontier models (Gemini)

Multimodal Gemini: A Family of Highly Capable Multimodal Models

Co-authored Gemini: A Family of Highly Capable Multimodal Models.

3103

Travis Hoppe

Open-source LLMs (EleutherAI)

EleutherAI

Worth knowing as one of the early open-data contributors around the EleutherAI orbit, with a profile that mixes work on The Pile with a long tail of small, public NLP and machine-learning experiments.

Open Models Travis Hoppe

3104

Travis Wolfe

Multimodal frontier models (Gemini)

Multimodal Gemini: A Family of Highly Capable Multimodal Models

Co-authored Gemini: A Family of Highly Capable Multimodal Models.

3105

Trevor Cai

Compute-optimal scaling for LLM training

Multimodal Systems & Infrastructure Gemini: A Family of Highly Capable Multimodal Models

A useful profile for the core DeepMind contributor layer behind Chinchilla, Gopher, and Gemini rather than only the more public faces of those systems.

3106

Trevor Killeen

Deep learning infrastructure (PyTorch)

Co-authored the PyTorch paper describing the imperative-style deep learning framework.

Open Models Systems & Infrastructure PyTorch: An Imperative Style, High-Performance Deep Learning Library

3107

Trevor Strohman

Multimodal frontier models (Gemini)

Multimodal Gemini: A Family of Highly Capable Multimodal Models

Co-authored Gemini: A Family of Highly Capable Multimodal Models.

3108

Trevor Yacovone

Open multimodal models (Gemma 3)

Open Models Multimodal Gemma 3 Technical Report

Co-authored the Gemma 3 Technical Report.

3109

Tri Dao

Efficient sequence models + attention kernels

One of the clearest researchers to follow for efficient sequence-model systems, especially the line of work that made frontier training and inference materially faster rather than merely cleaner on paper.

Systems & Infrastructure Tri Dao

3110

Tris Warkentin

Open language models from Google (Gemma)

Open Models Gemma: Open Models Based on Gemini Research and Technology

Co-authored Gemma: open models based on Gemini research and technology.

3111

Tristan Hume

Alignment via AI feedback (Constitutional AI)

Post-Training & Alignment Evaluation & Benchmarks Systems & Infrastructure Training a Helpful and Harmless Assistant with Reinforcement Learning from Human Feedback

A useful profile for the systems side of alignment work, especially where infrastructure choices and evaluation throughput determine what a lab can actually test.

3112

Tu Vu

Multimodal frontier models (Gemini)

Multimodal Gemini: A Family of Highly Capable Multimodal Models

Co-authored Gemini: A Family of Highly Capable Multimodal Models.

3113

Tulsee Doshi

Open language models (Gemma 2)