Researchers — page 2

Multimodal Gemini: A Family of Highly Capable Multimodal Models

Co-authored Gemini: A Family of Highly Capable Multimodal Models.

132

Alex Vaughan

Open-weight frontier models (Llama 3)

Co-authored “The Llama 3 Herd of Models”.

133

Alexander Kirillov

Promptable segmentation foundation models (SAM)

Vision & Robotics Segment Anything

Co-authored Segment Anything.

134

Alexander Kolesnikov

Vision Transformers (ViT)

Co-authored ViT: a turning point for transformers in vision.

Vision & Robotics An Image is Worth 16x16 Words: Transformers for Image Recognition at Scale

135

Alexander Neitz

Multimodal frontier models (Gemini)

Multimodal Gemini: A Family of Highly Capable Multimodal Models

Co-authored Gemini: A Family of Highly Capable Multimodal Models.

136

Alexander Novikov

Generalist agents (Gato)

Multimodal Agents & Reasoning A Generalist Agent

Co-authored Gato: a key reference for generalist, multi-task agents.

137

Alexander Pritzel

Multimodal frontier models (Gemini)

Multimodal Gemini: A Family of Highly Capable Multimodal Models

Co-authored Gemini: A Family of Highly Capable Multimodal Models.

138

Alexander Spiridonov

Pathways-scale language modeling (PaLM)

PaLM: Scaling Language Modeling with Pathways

Co-authored PaLM: Scaling Language Modeling with Pathways.

139

Alexander Wettig

Measuring real-world coding ability (SWE-bench)

Co-authored SWE-bench: a key benchmark for whether models can resolve real GitHub issues.

Evaluation & Benchmarks Agents & Reasoning SWE-bench: Can Language Models Resolve Real-World GitHub Issues?

140

Alexandre Défossez

Open foundation models for code (Code Llama)

Open Models Code Models Code Llama: Open Foundation Models for Code

Co-authored Code Llama: a key open-model reference for code generation and coding assistants.

141

Alexandre Frechette

Multimodal frontier models (Gemini)

Multimodal Gemini: A Family of Highly Capable Multimodal Models

Co-authored Gemini: A Family of Highly Capable Multimodal Models.

142

Alexandre Moufarek

Multimodal frontier models (Gemini)

Multimodal Gemini: A Family of Highly Capable Multimodal Models

Co-authored Gemini: A Family of Highly Capable Multimodal Models.

143

Alexandre Ramé

Open language models (Gemma 2)

Open Models Gemma 2: Improving Open Language Models at a Practical Size

Co-authored Gemma 2: improving open language models at a practical size.

144

Alexandre Sablayrolles

Mixture-of-experts LLMs

Mistral

Useful because his work connects earlier privacy and representation-learning research to some of Mistral’s most important open-weight model releases.

Open Models Systems & Infrastructure Mistral 7B

145

Alexei Baevski

Open-weight frontier models (Llama 3)

Co-authored “The Llama 3 Herd of Models”.

146

Alexei Bendebury

Open multimodal models (Gemma 3)

Co-authored the Gemma 3 Technical Report.

147

Alexei Robsky

Multimodal frontier models (Gemini)

Multimodal Gemini: A Family of Highly Capable Multimodal Models

Co-authored Gemini: A Family of Highly Capable Multimodal Models.

148

Alexey Dosovitskiy

Vision Transformers (ViT)

Co-authored ViT: a turning point for transformers in vision.

Vision & Robotics An Image is Worth 16x16 Words: Transformers for Image Recognition at Scale

149

Alexey Guseynov

Multimodal frontier models (Gemini)

Multimodal Gemini: A Family of Highly Capable Multimodal Models

Co-authored Gemini: A Family of Highly Capable Multimodal Models.

150

Alfonso Castaño

Multimodal frontier models (Gemini)

Multimodal Gemini: A Family of Highly Capable Multimodal Models

Co-authored Gemini: A Family of Highly Capable Multimodal Models.

151

Ali Eichenbaum

Multimodal frontier models (Gemini)

Multimodal Gemini: A Family of Highly Capable Multimodal Models

Co-authored Gemini: A Family of Highly Capable Multimodal Models.

152

Ali Elqursh

Multimodal frontier models (Gemini)

Multimodal Gemini: A Family of Highly Capable Multimodal Models

Co-authored Gemini: A Family of Highly Capable Multimodal Models.

153

Ali Farhadi

Commonsense reasoning evaluation (HellaSwag)

Co-authored HellaSwag: a widely used commonsense benchmark for language understanding.

Evaluation & Benchmarks Agents & Reasoning HellaSwag: Can a Machine Really Finish Your Sentence?

154

Ali Ghorbani

Multimodal frontier models (Gemini)

Multimodal Gemini: A Family of Highly Capable Multimodal Models

Co-authored Gemini: A Family of Highly Capable Multimodal Models.

155

Ali Ibrahim

Multimodal frontier models (Gemini)

Multimodal Gemini: A Family of Highly Capable Multimodal Models

Co-authored Gemini: A Family of Highly Capable Multimodal Models.

156

Ali Kamali

Frontier model development (GPT-4)

Co-authored the GPT-4 Technical Report: a key reference for the GPT-4-era frontier.

Open Models Systems & Infrastructure Phi-3 Technical Report: A Highly Capable Language Model Locally on Your Phone

157

Ali Mahmoudzadeh

Small, capable models (Phi-3)

Co-authored the Phi-3 Technical Report (capable models designed for smaller footprints).

158

Ali Razavi

Generalist agents (Gato)

Multimodal Agents & Reasoning A Generalist Agent

Co-authored Gato: a key reference for generalist, multi-task agents.

159

Aliaksei Severyn

Open language models (Gemma 2)

Open Models Gemma 2: Improving Open Language Models at a Practical Size

Co-authored Gemma 2: improving open language models at a practical size.

160

Alice Talbert

Multimodal frontier models (Gemini)

Multimodal Gemini: A Family of Highly Capable Multimodal Models

Co-authored Gemini: A Family of Highly Capable Multimodal Models.

161

Alicia Parrish

Open language models (Gemma 2)

Open Models Gemma 2: Improving Open Language Models at a Practical Size

Co-authored Gemma 2: improving open language models at a practical size.

162

Alina Oprea

Training-data extraction and privacy risks

Co-authored Extracting Training Data from Large Language Models: a core paper on memorization and extraction risk.

Security & Robustness Extracting Training Data from Large Language Models

163

Alireza Ghaffarkhah

Multimodal frontier models (Gemini)

Multimodal Gemini: A Family of Highly Capable Multimodal Models

Co-authored Gemini: A Family of Highly Capable Multimodal Models.

164

Alisa Liu

Synthetic instructions for alignment (Self-Instruct)

Co-authored Self-Instruct: a key reference for instruction data generation pipelines.

Post-Training & Alignment Self-Instruct: Aligning Language Models with Self-Generated Instructions

165

Alison Reid

Multimodal frontier models (Gemini)

Multimodal Gemini: A Family of Highly Capable Multimodal Models

Co-authored Gemini: A Family of Highly Capable Multimodal Models.

166

Aliya Ahmad

Open language models (Gemma 2)

Open Models Gemma 2: Improving Open Language Models at a Practical Size

Co-authored Gemma 2: improving open language models at a practical size.

167

Allan Dafoe

Multimodal frontier models (Gemini)

Multimodal Gemini: A Family of Highly Capable Multimodal Models

Co-authored Gemini: A Family of Highly Capable Multimodal Models.

168

Allen Hutchison

Open language models (Gemma 2)

Open Models Gemma 2: Improving Open Language Models at a Practical Size

Co-authored Gemma 2: improving open language models at a practical size.

169

Allie Del Giorno

Small, capable models (Phi-3)

Co-authored the Phi-3 Technical Report (capable models designed for smaller footprints).

Open Models Systems & Infrastructure Phi-3 Technical Report: A Highly Capable Language Model Locally on Your Phone

170

Allie Feinstein

Open-weight frontier models (Llama 3)

Co-authored “The Llama 3 Herd of Models”.

Open Models Evaluation & Benchmarks Systems & Infrastructure Alon Albalak

171

Alon Albalak

RWKV and efficient sequence modeling

A strong open-model and data-centric page because his work sits close to the infrastructure that made OLMo and Dolma useful to the broader research community rather than just another benchmark-driven model release.

172

Alon Benhaim

Small, capable models (Phi-3)

Co-authored the Phi-3 Technical Report (capable models designed for smaller footprints).

Open Models Systems & Infrastructure Phi-3 Technical Report: A Highly Capable Language Model Locally on Your Phone

173

Alvin Abdagic

Open language models (Gemma 2)

Open Models Gemma 2: Improving Open Language Models at a Practical Size

Co-authored Gemma 2: improving open language models at a practical size.

174

Alvin Wang

Frontier model development (GPT-4)

Co-authored the GPT-4 Technical Report: a key reference for the GPT-4-era frontier.

Post-Training & Alignment Reinforcement Learning Claude's Constitution

175

Amanda Askell

Alignment, behavior shaping, safety

Anthropic

A high-signal researcher for understanding how post-training and behavioral steering become concrete product behavior rather than abstract alignment talk.

176

Amanda Carl

Open language models (Gemma 2)

Open Models Gemma 2: Improving Open Language Models at a Practical Size

Co-authored Gemma 2: improving open language models at a practical size.

177

Amanda Kallet

Open-weight frontier models (Llama 3)

Co-authored “The Llama 3 Herd of Models”.

178

Amar Subramanya

Multimodal frontier models (Gemini)

Multimodal Gemini: A Family of Highly Capable Multimodal Models

Co-authored Gemini: A Family of Highly Capable Multimodal Models.

179

Ambrose Slone

Open language models from Google (Gemma)

Open Models Gemma: Open Models Based on Gemini Research and Technology

Co-authored Gemma: open models based on Gemini research and technology.

180

Amelia Glaese

Gemini (multimodal foundation models)

A useful researcher to follow if you care about the bridge between safety evaluation, human data, and how frontier models are turned into practical tools and benchmarks.

Multimodal Evaluation & Benchmarks Agents & Reasoning Monitoring Monitorability

181

Amélie Héliou

Open language models from Google (Gemma)

Open Models Gemma: Open Models Based on Gemini Research and Technology

Co-authored Gemma: open models based on Gemini research and technology.

182

Amin Saied

Small, capable models (Phi-3)

Co-authored the Phi-3 Technical Report (capable models designed for smaller footprints).

Open Models Systems & Infrastructure Phi-3 Technical Report: A Highly Capable Language Model Locally on Your Phone

183

Amin Tootoonchian

Frontier model development (GPT-4)

Co-authored the GPT-4 Technical Report: a key reference for the GPT-4-era frontier.

Systems & Infrastructure AI21 Labs

184

Amir Bergman

Hybrid Transformer–Mamba language models (Jamba)

AI21

A useful systems-facing page because it ties one of the less-public engineers on the Jamba line to the practical work of turning hybrid-model research into shipped model releases.

185

Amir Globerson

Multimodal frontier models (Gemini)

Multimodal Gemini: A Family of Highly Capable Multimodal Models

Co-authored Gemini: A Family of Highly Capable Multimodal Models.

186

Amit Bahree

Small, capable models (Phi-3)

Co-authored the Phi-3 Technical Report (capable models designed for smaller footprints).

Open Models Systems & Infrastructure Phi-3 Technical Report: A Highly Capable Language Model Locally on Your Phone

187

Amit Garg

Small, capable models (Phi-3)

Co-authored the Phi-3 Technical Report (capable models designed for smaller footprints).

Open Models Systems & Infrastructure Phi-3 Technical Report: A Highly Capable Language Model Locally on Your Phone

188

Amit Marathe

Multimodal frontier models (Gemini)

Multimodal Gemini: A Family of Highly Capable Multimodal Models

Co-authored Gemini: A Family of Highly Capable Multimodal Models.

189

Amit Raul

Multimodal frontier models (Gemini)

Multimodal Gemini: A Family of Highly Capable Multimodal Models

Co-authored Gemini: A Family of Highly Capable Multimodal Models.

190

Amit Sangani

Open-weight frontier models (Llama 3)

Co-authored “The Llama 3 Herd of Models”.

191

Amit Vadi

Open multimodal models (Gemma 3)

Co-authored the Gemma 3 Technical Report.

192

Amjad Almahairi

Open-weight chat and foundation models (Llama 2)

Open Models Llama 2: Open Foundation and Fine-Tuned Chat Models

Co-authored Llama 2: Open Foundation and Fine-Tuned Chat Models.

193

Ammar Ahmad Awan

Large-scale transformer inference (DeepSpeed)

Co-authored DeepSpeed Inference: practical inference optimizations for serving large transformer models.

Systems & Infrastructure DeepSpeed Inference: Enabling Efficient Inference of Transformer Models at Unprecedented Scale

194

Amol Mandhane

Multimodal frontier models (Gemini)

Multimodal Gemini: A Family of Highly Capable Multimodal Models

Co-authored Gemini: A Family of Highly Capable Multimodal Models.

195

Amos Teo

Open-weight frontier models (Llama 3)

Co-authored “The Llama 3 Herd of Models”.

196

Amruta Muthal

Multimodal frontier models (Gemini)

Multimodal Gemini: A Family of Highly Capable Multimodal Models

Co-authored Gemini: A Family of Highly Capable Multimodal Models.

197

Amy Shen

Open language models (Gemma 2)

Open Models Gemma 2: Improving Open Language Models at a Practical Size

Co-authored Gemma 2: improving open language models at a practical size.

198

Amy Yang

Open-weight frontier models (Llama 3)

Co-authored “The Llama 3 Herd of Models”.

Open Models Qwen Technical Report

199

An Yang

Open-weight LLMs (Qwen)

Qwen

Co-authored the Qwen Technical Report.

200

Anaïs White

Multimodal frontier models (Gemini)

Multimodal Gemini: A Family of Highly Capable Multimodal Models

Co-authored Gemini: A Family of Highly Capable Multimodal Models.

201

Anam Yunus

Open-weight frontier models (Llama 3)

Co-authored “The Llama 3 Herd of Models”.

202

Anand Gokulchandran

Multimodal frontier models (Gemini)

Multimodal Gemini: A Family of Highly Capable Multimodal Models

Co-authored Gemini: A Family of Highly Capable Multimodal Models.

203

Anand Iyer

Multimodal frontier models (Gemini)

Multimodal Gemini: A Family of Highly Capable Multimodal Models

Co-authored Gemini: A Family of Highly Capable Multimodal Models.

204

Anand Rao

Open language models (Gemma 2)

Open Models Gemma 2: Improving Open Language Models at a Practical Size

Co-authored Gemma 2: improving open language models at a practical size.

205

Ananth Agarwal

Multimodal frontier models (Gemini)

Multimodal Gemini: A Family of Highly Capable Multimodal Models

Co-authored Gemini: A Family of Highly Capable Multimodal Models.

206

Ananya Harsh Jha

Open, fully-documented language models (OLMo)

AI2

Co-authored OLMo: Accelerating the Science of Language Models.

Open Models OLMo: Accelerating the Science of Language Models

207

Ananya Kumar

Holistic evaluation of language models (HELM)

Co-authored HELM: a framework for evaluating language models across many axes beyond raw accuracy.

Evaluation & Benchmarks Holistic Evaluation of Language Models

208

Anastasia Petrushkina

Multimodal frontier models (Gemini)

Multimodal Gemini: A Family of Highly Capable Multimodal Models

Co-authored Gemini: A Family of Highly Capable Multimodal Models.

209

Anastasios Nikolas Angelopoulos

Human preference evaluation at scale (Chatbot Arena)

Co-authored Chatbot Arena: a high-impact human-preference evaluation platform for LLMs.

Evaluation & Benchmarks Chatbot Arena: An Open Platform for Evaluating LLMs by Human Preference

210

Anca Dragan

Open language models (Gemma 2)

Open Models Gemma 2: Improving Open Language Models at a Practical Size

Co-authored Gemma 2: improving open language models at a practical size.

211

Anca Stefanoiu

Multimodal frontier models (Gemini)

Multimodal Gemini: A Family of Highly Capable Multimodal Models

Co-authored Gemini: A Family of Highly Capable Multimodal Models.

212

Anders Andreassen

Multimodal frontier models (Gemini)

Multimodal Gemini: A Family of Highly Capable Multimodal Models

Co-authored Gemini: A Family of Highly Capable Multimodal Models.

213

András György

Open multimodal models (Gemma 3)

Co-authored the Gemma 3 Technical Report.

214

Andras Orban

Multimodal frontier models (Gemini)

Multimodal Gemini: A Family of Highly Capable Multimodal Models

Co-authored Gemini: A Family of Highly Capable Multimodal Models.

215

André Susano Pinto

Open multimodal models (Gemma 3)

Co-authored the Gemma 3 Technical Report.

216

Andrea Hu

Open code models (CodeGemma)

Open Models Code Models CodeGemma: Open Code Models Based on Gemma

Co-authored CodeGemma: open code models based on Gemma.

217

Andrea Siciliano

Multimodal frontier models (Gemini)

Multimodal Gemini: A Family of Highly Capable Multimodal Models

Co-authored Gemini: A Family of Highly Capable Multimodal Models.

218

Andrea Tacchetti

Open language models from Google (Gemma)

Open Models Gemma: Open Models Based on Gemini Research and Technology

Co-authored Gemma: open models based on Gemini research and technology.

219

Andrea Tupini

Small, capable models (Phi-3)

Co-authored the Phi-3 Technical Report (capable models designed for smaller footprints).

Open Models Systems & Infrastructure Phi-3 Technical Report: A Highly Capable Language Model Locally on Your Phone

220

Andrea Vallone

Frontier model development (GPT-4)

Co-authored the GPT-4 Technical Report: a key reference for the GPT-4-era frontier.

Diffusion & Generative Media High-Resolution Image Synthesis with Latent Diffusion Models

221

Andreas Blattmann

Latent diffusion for high-res generation

Co-authored Latent Diffusion Models: the foundation behind Stable Diffusion-style pipelines.

222

Andreas Fidjeland

Multimodal frontier models (Gemini)

Multimodal Gemini: A Family of Highly Capable Multimodal Models

Co-authored Gemini: A Family of Highly Capable Multimodal Models.

223

Andreas Köpf

Deep learning infrastructure (PyTorch)

Co-authored the PyTorch paper describing the imperative-style deep learning framework.

Open Models Systems & Infrastructure PyTorch: An Imperative Style, High-Performance Deep Learning Library

224

Andreas Santucci

Multimodal frontier models (Gemini)

Multimodal Gemini: A Family of Highly Capable Multimodal Models

Co-authored Gemini: A Family of Highly Capable Multimodal Models.

225

Andreas Steiner

Scaled multilingual vision-language models (PaLI)

Multimodal PaLI: A Jointly-Scaled Multilingual Language-Image Model

Co-authored PaLI: a key reference for scaling multilingual vision-language models.

226

Andrei Lupu

Open-weight frontier models (Llama 3)

Co-authored “The Llama 3 Herd of Models”.

227

Andrei Sozanschi

Multimodal frontier models (Gemini)

Multimodal Gemini: A Family of Highly Capable Multimodal Models

Co-authored Gemini: A Family of Highly Capable Multimodal Models.

228

Andrej Karpathy

Deep learning engineering, LLM education

Important not only for his direct research contributions, but for translating frontier deep-learning ideas into builder intuition that spreads across the industry.

Andrej Karpathy

229

Andres Alvarado

Open-weight frontier models (Llama 3)

Co-authored “The Llama 3 Herd of Models”.

230

Andrew Brock

Few-shot vision-language models (Flamingo)

Multimodal Flamingo: a Visual Language Model for Few-Shot Learning

Co-authored Flamingo: an influential multimodal model for few-shot vision-language tasks.

231

Andrew Cann

Frontier model development (GPT-4)

Co-authored the GPT-4 Technical Report: a key reference for the GPT-4-era frontier.

232

Andrew Caples

Open-weight frontier models (Llama 3)

Co-authored “The Llama 3 Herd of Models”.

233

Andrew Goodman

Multimodal frontier models (Gemini)

Multimodal Gemini: A Family of Highly Capable Multimodal Models

Co-authored Gemini: A Family of Highly Capable Multimodal Models.

234

Andrew Gu

Fully Sharded Data Parallel training (FSDP)

Co-authored PyTorch FSDP: practical lessons for scaling fully-sharded training workloads.

Systems & Infrastructure PyTorch FSDP: Experiences on Scaling Fully Sharded Data Parallel

235

Andrew Ho

Open-weight frontier models (Llama 3)

Co-authored “The Llama 3 Herd of Models”.

Post-Training & Alignment Security & Robustness Adversarial Examples Are Not Bugs, They Are Features

236

Andrew Ilyas

Adversarial robustness and feature learning

Co-authored “Adversarial Examples Are Not Bugs, They Are Features”.

237

Andrew Kondrich

Frontier model development (GPT-4)

Co-authored the GPT-4 Technical Report: a key reference for the GPT-4-era frontier.

238

Andrew Lee

Multimodal frontier models (Gemini)

Multimodal Gemini: A Family of Highly Capable Multimodal Models

Co-authored Gemini: A Family of Highly Capable Multimodal Models.

239

Andrew M. Dai

Gemini (multimodal foundation models)

A good researcher to follow for the infrastructure side of frontier language models, especially mixture-of-experts scaling, instruction tuning, and the data systems that make very large models usable.

Multimodal Post-Training & Alignment Systems & Infrastructure More Efficient In-Context Learning with GLaM

240

Andrew Mayne

Frontier model development (GPT-4)

Co-authored the GPT-4 Technical Report: a key reference for the GPT-4-era frontier.