Researchers — page 3

One of the earlier Anthropic contributors worth tracking if you care about the transition from RLHF-style assistant training into scaling and evaluation work.

Post-Training & Alignment Evaluation & Benchmarks Reinforcement Learning Training a Helpful and Harmless Assistant with Reinforcement Learning from Human Feedback

253

Andy Zou

Broad capability evaluation (MMLU)

Co-authored MMLU: a widely used benchmark for general LLM capability across many subjects.

Evaluation & Benchmarks Measuring Massive Multitask Language Understanding

254

Anelia Angelova

Scaled multilingual vision-language models (PaLI)

Google

Co-authored PaLI: a key reference for scaling multilingual vision-language models.

Multimodal PaLI: A Jointly-Scaled Multilingual Language-Image Model

255

Angela Fan

Open-weight chat and foundation models (Llama 2)

Meta

Co-authored Llama 2: Open Foundation and Fine-Tuned Chat Models.

Open Models Llama 2: Open Foundation and Fine-Tuned Chat Models

256

Angela Jiang

Frontier model development (GPT-4)

OpenAI

Co-authored the GPT-4 Technical Report: a key reference for the GPT-4-era frontier.

GPT-4 Technical Report

257

Angeliki Lazaridou

Gemini (multimodal foundation models)

A high-signal researcher for grounded language and retrieval-heavy systems, especially if you want to understand how language models stay useful as the world changes around them.

Multimodal Evaluation & Benchmarks Systems & Infrastructure Angeliki Lazaridou

258

Angelos Filos

Multimodal frontier models (Gemini)

Google DeepMind

Co-authored Gemini: A Family of Highly Capable Multimodal Models.

Multimodal Gemini: A Family of Highly Capable Multimodal Models

259

Anh Nguyen

Small, capable models (Phi-3)

Co-authored the Phi-3 Technical Report (capable models designed for smaller footprints).

Open Models Systems & Infrastructure Phi-3 Technical Report: A Highly Capable Language Model Locally on Your Phone

260

Anhad Mohananey

Multimodal frontier models (Gemini)

Google DeepMind

Co-authored Gemini: A Family of Highly Capable Multimodal Models.

Multimodal Gemini: A Family of Highly Capable Multimodal Models

261

Anil Das

Open multimodal models (Gemma 3)

Google

Co-authored the Gemma 3 Technical Report.

Open Models Multimodal Gemma 3 Technical Report

262

Anirudh Baddepudi

Multimodal frontier models (Gemini)

Google DeepMind

Co-authored Gemini: A Family of Highly Capable Multimodal Models.

Multimodal Gemini: A Family of Highly Capable Multimodal Models

263

Anirudh Goyal

Open-weight frontier models (Llama 3)

Meta

Co-authored “The Llama 3 Herd of Models”.

Open Models The Llama 3 Herd of Models

264

Anish Thite

Open-source LLMs (EleutherAI)

EleutherAI

Useful to follow if you care about the practical evaluation layer of open models, especially where benchmark tooling and reproducible comparisons actually shape what the ecosystem measures.

Open Models Evaluation & Benchmarks Anish Thite

265

Anita Gergely

Multimodal frontier models (Gemini)

Google DeepMind

Co-authored Gemini: A Family of Highly Capable Multimodal Models.

Multimodal Gemini: A Family of Highly Capable Multimodal Models

266

Anitha Vijayakumar

Multimodal frontier models (Gemini)

Google DeepMind

Co-authored Gemini: A Family of Highly Capable Multimodal Models.

Multimodal Gemini: A Family of Highly Capable Multimodal Models

267

Anja Hauth

Multimodal frontier models (Gemini)

Google DeepMind

Co-authored Gemini: A Family of Highly Capable Multimodal Models.

Multimodal Gemini: A Family of Highly Capable Multimodal Models

268

Ankesh Anand

Multimodal frontier models (Gemini)

Google DeepMind

Co-authored Gemini: A Family of Highly Capable Multimodal Models.

Multimodal Gemini: A Family of Highly Capable Multimodal Models

269

Ankit Ramchandani

Open-weight frontier models (Llama 3)

Meta

Co-authored “The Llama 3 Herd of Models”.

Open Models The Llama 3 Herd of Models

270

Ankur Bapna

Open multimodal models (Gemma 3)

Google

Co-authored the Gemma 3 Technical Report.

Open Models Multimodal Gemma 3 Technical Report

271

Ankur Garg

Multimodal frontier models (Gemini)

Google DeepMind

Co-authored Gemini: A Family of Highly Capable Multimodal Models.

Multimodal Gemini: A Family of Highly Capable Multimodal Models

272

Ankush Garg

Multimodal frontier models (Gemini)

Google DeepMind

Co-authored Gemini: A Family of Highly Capable Multimodal Models.

Multimodal Gemini: A Family of Highly Capable Multimodal Models

273

Anmol Gulati

Multimodal frontier models (Gemini)

Google DeepMind

Co-authored Gemini: A Family of Highly Capable Multimodal Models.

Multimodal Gemini: A Family of Highly Capable Multimodal Models

274

Anna Bortsova

Multimodal frontier models (Gemini)

Google DeepMind

Co-authored Gemini: A Family of Highly Capable Multimodal Models.

Multimodal Gemini: A Family of Highly Capable Multimodal Models

275

Anna Bulanova

Open language models from Google (Gemma)

Google

Co-authored Gemma: open models based on Gemini research and technology.

Open Models Gemma: Open Models Based on Gemini Research and Technology

276

Anna Chen

Alignment via AI feedback (Constitutional AI)

Anthropic

A high-signal person to follow for the part of alignment research that asks whether a model’s stated reasoning can actually be trusted and measured.

Post-Training & Alignment Agents & Reasoning Reinforcement Learning Question Decomposition Improves the Faithfulness of Model-Generated Reasoning

277

Anna Goldie

Alignment via AI feedback (Constitutional AI)

Anthropic

A strong person to follow for the point where machine learning research starts shaping the compute stack itself, especially in chip placement and systems-aware optimization.

Multimodal Post-Training & Alignment Systems & Infrastructure How AlphaChip transformed computer chip design

278

Anna Makanju

Frontier model development (GPT-4)

OpenAI

Co-authored the GPT-4 Technical Report: a key reference for the GPT-4-era frontier.

GPT-4 Technical Report

279

Anna-Luisa Brakman

Frontier model development (GPT-4)

OpenAI

Co-authored the GPT-4 Technical Report: a key reference for the GPT-4-era frontier.

GPT-4 Technical Report

280

Annie Dong

Open-weight frontier models (Llama 3)

Meta

Co-authored “The Llama 3 Herd of Models”.

Open Models The Llama 3 Herd of Models

281

Annie Franco

Open-weight frontier models (Llama 3)

Meta

Co-authored “The Llama 3 Herd of Models”.

Open Models The Llama 3 Herd of Models

282

Annie Louis

Multimodal frontier models (Gemini)

Google DeepMind

Co-authored Gemini: A Family of Highly Capable Multimodal Models.

Multimodal Gemini: A Family of Highly Capable Multimodal Models

283

Anoop Sinha

Multimodal frontier models (Gemini)

Google DeepMind

Co-authored Gemini: A Family of Highly Capable Multimodal Models.

Multimodal Gemini: A Family of Highly Capable Multimodal Models

284

Anselm Levskaya

Pathways-scale language modeling (PaLM)

Google

Co-authored PaLM: Scaling Language Modeling with Pathways.

PaLM: Scaling Language Modeling with Pathways

285

Ante Kärrman

Multimodal frontier models (Gemini)

Google DeepMind

Co-authored Gemini: A Family of Highly Capable Multimodal Models.

Multimodal Gemini: A Family of Highly Capable Multimodal Models

286

Anthony Chen

Multimodal frontier models (Gemini)

Google DeepMind

Co-authored Gemini: A Family of Highly Capable Multimodal Models.

Multimodal Gemini: A Family of Highly Capable Multimodal Models

287

Anthony Hartshorn

Open-weight chat and foundation models (Llama 2)

Meta

Co-authored Llama 2: Open Foundation and Fine-Tuned Chat Models.

Open Models Llama 2: Open Foundation and Fine-Tuned Chat Models

288

Anthony Laforge

Open language models (Gemma 2)

Google

Co-authored Gemma 2: improving open language models at a practical size.

Open Models Gemma 2: Improving Open Language Models at a Practical Size

289

Anthony Moi

Open-source tooling for modern NLP (Transformers library)

Hugging Face

Co-authored the Hugging Face Transformers paper that helped standardize modern NLP workflows.

Open Models Transformers: State-of-the-Art Natural Language Processing

290

Anthony Urbanowicz

Multimodal frontier models (Gemini)

Google DeepMind

Co-authored Gemini: A Family of Highly Capable Multimodal Models.

Multimodal Gemini: A Family of Highly Capable Multimodal Models

291

Anthony Yu

Multimodal frontier models (Gemini)

Google DeepMind

Co-authored Gemini: A Family of Highly Capable Multimodal Models.

Multimodal Gemini: A Family of Highly Capable Multimodal Models

292

Antoine He

Multimodal frontier models (Gemini)

Google DeepMind

Co-authored Gemini: A Family of Highly Capable Multimodal Models.

Multimodal Gemini: A Family of Highly Capable Multimodal Models

293

Antoine Miech

Few-shot vision-language models (Flamingo)

Google DeepMind

Co-authored Flamingo: an influential multimodal model for few-shot vision-language tasks.

Multimodal Flamingo: a Visual Language Model for Few-Shot Learning

294

Antoine Roux

Mixture-of-experts LLMs

Mistral

Co-authored Mixtral of Experts: a key MoE reference in the open-weights frontier.

Open Models Systems & Infrastructure Mixtral of Experts

295

Antoine Yang

Open multimodal models (Gemma 3)

Google

Co-authored the Gemma 3 Technical Report.

Open Models Multimodal Gemma 3 Technical Report

296

Anton Älgmyr

Multimodal frontier models (Gemini)

Google DeepMind

Co-authored Gemini: A Family of Highly Capable Multimodal Models.

Multimodal Gemini: A Family of Highly Capable Multimodal Models

297

Anton Briukhov

Multimodal frontier models (Gemini)

Google DeepMind

Co-authored Gemini: A Family of Highly Capable Multimodal Models.

Multimodal Gemini: A Family of Highly Capable Multimodal Models

298

Anton Tsitsulin

Open language models (Gemma 2)

Google

Co-authored Gemma 2: improving open language models at a practical size.

Open Models Gemma 2: Improving Open Language Models at a Practical Size

299

Antonia Paterson

Open language models from Google (Gemma)

Google

Co-authored Gemma: open models based on Gemini research and technology.

Open Models Gemma: Open Models Based on Gemini Research and Technology

300

Antonio Sanchez

Multimodal frontier models (Gemini)

Google DeepMind

Co-authored Gemini: A Family of Highly Capable Multimodal Models.

Multimodal Gemini: A Family of Highly Capable Multimodal Models

301

Antonio Stella

Multimodal frontier models (Gemini)

Google DeepMind

Co-authored Gemini: A Family of Highly Capable Multimodal Models.

Multimodal Gemini: A Family of Highly Capable Multimodal Models

302

Anudhyan Boral

Multimodal frontier models (Gemini)

Google DeepMind

Co-authored Gemini: A Family of Highly Capable Multimodal Models.

Multimodal Gemini: A Family of Highly Capable Multimodal Models

303

Anuj Goyal

Open-weight frontier models (Llama 3)

Meta

Co-authored “The Llama 3 Herd of Models”.

Open Models The Llama 3 Herd of Models

304

Anuj Khare

Multimodal frontier models (Gemini)

Google DeepMind

Co-authored Gemini: A Family of Highly Capable Multimodal Models.

Multimodal Gemini: A Family of Highly Capable Multimodal Models

305

Aobo Yang

Open-weight frontier models (Llama 3)

Meta

Co-authored “The Llama 3 Herd of Models”.

Open Models The Llama 3 Herd of Models

306

Aparajita Saraf

Open-weight frontier models (Llama 3)

Meta

Co-authored “The Llama 3 Herd of Models”.

Open Models The Llama 3 Herd of Models

307

Aran Komatsuzaki

Open-source LLMs (EleutherAI)

EleutherAI

An important open-model researcher for understanding how early public LLM efforts, scaling heuristics, and open data work fed into the broader modern model ecosystem.

Open Models About Me – Aran Komatsuzaki

308

Arash Bakhtiari

Small, capable models (Phi-3)

Co-authored the Phi-3 Technical Report (capable models designed for smaller footprints).

Open Models Systems & Infrastructure Phi-3 Technical Report: A Highly Capable Language Model Locally on Your Phone

309

Archi Mitra

Open-weight frontier models (Llama 3)

Meta

Co-authored “The Llama 3 Herd of Models”.

Open Models The Llama 3 Herd of Models

310

Archie Sravankumar

Open-weight frontier models (Llama 3)

Meta

Co-authored “The Llama 3 Herd of Models”.

Open Models The Llama 3 Herd of Models

311

Archit Sharma

Direct preference optimization (DPO)

A useful person to follow for the bridge between reinforcement-learning instincts and later alignment methods like DPO, especially where preference optimization is treated as a core learning problem rather than a bolt-on finetuning trick.

Post-Training & Alignment Evaluation & Benchmarks Reinforcement Learning Direct Preference Optimization: Your Language Model is Secretly a Reward Model

312

Ari Holtzman

Efficient finetuning of quantized LLMs

Important because he helped define how people think about language-model decoding quality, and his work keeps showing up where practical generation behavior matters more than benchmark theater.

Evaluation & Benchmarks Systems & Infrastructure Ari Holtzman – Department of Computer Science

313

Ariel Herbert-Voss

Large-scale language modeling (GPT-3)

OpenAI

Co-authored GPT-3: Language Models are Few-Shot Learners.

Language Models are Few-Shot Learners (GPT-3)

314

Ariel Stolovich

Multimodal frontier models (Gemini)

Google DeepMind

Co-authored Gemini: A Family of Highly Capable Multimodal Models.

Multimodal Gemini: A Family of Highly Capable Multimodal Models

315

Arindam Mitra

Small, capable models (Phi-3)

Co-authored the Phi-3 Technical Report (capable models designed for smaller footprints).

Open Models Systems & Infrastructure Phi-3 Technical Report: A Highly Capable Language Model Locally on Your Phone

316

Aris Konstantinidis

Frontier model development (GPT-4)

OpenAI

Co-authored the GPT-4 Technical Report: a key reference for the GPT-4-era frontier.

GPT-4 Technical Report

317

Arjun Guha

Open code LLMs (StarCoder)

Co-authored StarCoder: a foundational open code model effort (BigCode).

Open Models Code Models StarCoder: may the source be with you!

318

Arka Dhar

Frontier model development (GPT-4)

OpenAI

Co-authored the GPT-4 Technical Report: a key reference for the GPT-4-era frontier.

GPT-4 Technical Report

319

Arkabandhu Chowdhury

Open-weight frontier models (Llama 3)

Meta

Co-authored “The Llama 3 Herd of Models”.

Open Models The Llama 3 Herd of Models

320

Arman Cohan

Open, fully-documented language models (OLMo)

AI2

Co-authored OLMo: Accelerating the Science of Language Models.

Open Models OLMo: Accelerating the Science of Language Models

321

Armand Joulin

Open-weight foundation models (LLaMA)

Meta

A strong bridge figure between the older fastText and self-supervision era and the newer open-weight LLaMA wave at Meta.

Open Models Systems & Infrastructure Bag of Tricks for Efficient Text Classification

322

Armel Zebaze

Open code LLMs (StarCoder)

Co-authored StarCoder: a foundational open code model effort (BigCode).

Open Models Code Models StarCoder: may the source be with you!

323

Aroma Mahendru

Multimodal frontier models (Gemini)

Google DeepMind

Co-authored Gemini: A Family of Highly Capable Multimodal Models.

Multimodal Gemini: A Family of Highly Capable Multimodal Models

324

Arpi Vezer

Multimodal frontier models (Gemini)

Google DeepMind

Co-authored Gemini: A Family of Highly Capable Multimodal Models.

Multimodal Gemini: A Family of Highly Capable Multimodal Models

325

Artem Korenev

Open-weight chat and foundation models (Llama 2)

Meta

Co-authored Llama 2: Open Foundation and Fine-Tuned Chat Models.

Open Models Llama 2: Open Foundation and Fine-Tuned Chat Models

326

Arthur Bražinskas

Multimodal frontier models (Gemini)

Google DeepMind

Co-authored Gemini: A Family of Highly Capable Multimodal Models.

Multimodal Gemini: A Family of Highly Capable Multimodal Models

327

Arthur Guez

Self-play RL with search (AlphaZero)

Google DeepMind

Co-authored AlphaZero: a canonical reference for self-play + search in RL.

Reinforcement Learning Mastering Chess and Shogi by Self-Play with a General Reinforcement Learning Algorithm

328

Arthur Hinsvark

Open-weight frontier models (Llama 3)

Meta

Co-authored “The Llama 3 Herd of Models”.

Open Models The Llama 3 Herd of Models

329

Arthur Mensch

Open-weight LLMs

Mistral

One of the clearest people to track if you want to understand how frontier open-weight labs balance model quality, deployment speed, and product ambition.

Open Models Mistral AI

330

Artidoro Pagnoni

Efficient finetuning of quantized LLMs

Co-authored QLoRA: made high-quality fine-tuning feasible on modest hardware.

Systems & Infrastructure QLoRA: Efficient Finetuning of Quantized LLMs

331

Artyom Kozhevnikov

Open foundation models for code (Code Llama)

Meta

Co-authored Code Llama: a key open-model reference for code generation and coding assistants.

Open Models Code Models Code Llama: Open Foundation Models for Code

332

Arun Ahuja

Multimodal frontier models (Gemini)

Google DeepMind

Co-authored Gemini: A Family of Highly Capable Multimodal Models.

Multimodal Gemini: A Family of Highly Capable Multimodal Models

333

Arun Kishore

Multimodal frontier models (Gemini)

Google DeepMind

Co-authored Gemini: A Family of Highly Capable Multimodal Models.

Multimodal Gemini: A Family of Highly Capable Multimodal Models

334

Arun Rao

Open-weight frontier models (Llama 3)

Meta

Co-authored “The Llama 3 Herd of Models”.

Open Models The Llama 3 Herd of Models

335

Arun Vijayvergiya

Frontier model development (GPT-4)

OpenAI

Co-authored the GPT-4 Technical Report: a key reference for the GPT-4-era frontier.

GPT-4 Technical Report

336

Arvind Neelakantan

Large-scale language modeling (GPT-3)

OpenAI

Co-authored GPT-3: Language Models are Few-Shot Learners.

Language Models are Few-Shot Learners (GPT-3)

337

Ashish Sabharwal

Science QA evaluation (ARC)

Co-authored ARC: an influential reasoning benchmark for question answering.

Evaluation & Benchmarks Agents & Reasoning Think you have Solved Question Answering? Try ARC, the AI2 Reasoning Challenge

338

Ashish Shenoy

Open multimodal models (Gemma 3)

Google

Co-authored the Gemma 3 Technical Report.

Open Models Multimodal Gemma 3 Technical Report

339

Ashish Thapliyal

Scaled multilingual vision-language models (PaLI)

Google

Co-authored PaLI: a key reference for scaling multilingual vision-language models.

Multimodal PaLI: A Jointly-Scaled Multilingual Language-Image Model

340

Ashish Vaswani

Transformers

A foundational figure in modern sequence modeling whose work on the Transformer changed the technical direction of language and multimodal systems.

Multimodal Systems & Infrastructure Attention Is All You Need

341

Ashley Edwards

Generalist agents (Gato)

Google DeepMind

Co-authored Gato: a key reference for generalist, multi-task agents.

Multimodal Agents & Reasoning A Generalist Agent

342

Ashley Gabriel

Open-weight frontier models (Llama 3)

Meta

Co-authored “The Llama 3 Herd of Models”.

Open Models The Llama 3 Herd of Models

343

Ashley Pantuliano

Frontier model development (GPT-4)

OpenAI

Co-authored the GPT-4 Technical Report: a key reference for the GPT-4-era frontier.

GPT-4 Technical Report

344

Ashvin Nair

Frontier model development (GPT-4)

OpenAI

Co-authored the GPT-4 Technical Report: a key reference for the GPT-4-era frontier.

GPT-4 Technical Report

345

Ashwin Bharambe

Open-weight frontier models (Llama 3)

Meta

Co-authored “The Llama 3 Herd of Models”.

Open Models The Llama 3 Herd of Models

346

Ashwin Gopinath

Self-reflection loops for LLM agents (Reflexion)

Co-authored Reflexion: a practical pattern for improving agents via self-critique and memory.

Systems & Infrastructure Agents & Reasoning Reinforcement Learning Reflexion: Language Agents with Verbal Reinforcement Learning

347

Ashwin Sethi

Multimodal frontier models (Gemini)

Google DeepMind

Co-authored Gemini: A Family of Highly Capable Multimodal Models.

Multimodal Gemini: A Family of Highly Capable Multimodal Models

348

Ashwin Sreevatsa

Multimodal frontier models (Gemini)

Google DeepMind

Co-authored Gemini: A Family of Highly Capable Multimodal Models.

Multimodal Gemini: A Family of Highly Capable Multimodal Models

349

Asier Mujika

Multimodal frontier models (Gemini)

Google DeepMind

Co-authored Gemini: A Family of Highly Capable Multimodal Models.

Multimodal Gemini: A Family of Highly Capable Multimodal Models

350

Assaf Eisenman

Open-weight frontier models (Llama 3)

Meta

Co-authored “The Llama 3 Herd of Models”.

Open Models The Llama 3 Herd of Models

351

Assaf Israel

Multimodal frontier models (Gemini)

Google DeepMind

Co-authored Gemini: A Family of Highly Capable Multimodal Models.

Multimodal Gemini: A Family of Highly Capable Multimodal Models

352

Aston Zhang

Open-weight frontier models (Llama 3)

Meta

Co-authored “The Llama 3 Herd of Models”.

Open Models The Llama 3 Herd of Models

353

Atharva Parulekar

Multimodal frontier models (Gemini)

Google DeepMind

Co-authored Gemini: A Family of Highly Capable Multimodal Models.

Multimodal Gemini: A Family of Highly Capable Multimodal Models

354

Atri Rudra

Fast, memory-efficient attention

Worth following because he brings a real theory background into the model-systems layer, especially where structured linear algebra and sequence methods end up mattering for practical modern architectures.

Systems & Infrastructure Atri Rudra

355

Atsushi Saito

RWKV and efficient sequence modeling

Useful because he connects an earlier line of conversational-AI work at Nextremer with later authorship on both the original RWKV paper and Eagle/Finch, which makes this page more than a stray coauthor stub.

Open Models Systems & Infrastructure Reinforcement Learning Curriculum Learning Based on Reward Sparseness for Deep Reinforcement Learning of Task Completion Dialogue Management

356

Attila Dankovics

Multimodal frontier models (Gemini)

Google DeepMind

Co-authored Gemini: A Family of Highly Capable Multimodal Models.

Multimodal Gemini: A Family of Highly Capable Multimodal Models

357

Atty Eleti

Frontier model development (GPT-4)

OpenAI

Co-authored the GPT-4 Technical Report: a key reference for the GPT-4-era frontier.

GPT-4 Technical Report

358

Aurelia Guy

Compute-optimal scaling for LLM training

Google DeepMind

A useful profile for the research layer behind DeepMind’s large-model program, especially across the line from Gopher and Chinchilla into Gemini.

Multimodal Gemini: A Family of Highly Capable Multimodal Models

359

Aurelien Boffy

Multimodal frontier models (Gemini)

Google DeepMind

Co-authored Gemini: A Family of Highly Capable Multimodal Models.

Multimodal Gemini: A Family of Highly Capable Multimodal Models

360

Aurelien Rodriguez

Open-weight foundation models (LLaMA)

Meta

Useful to follow for the scaling and productization layer of the LLaMA line, especially as it moved from the first paper into the broader Llama 3 release wave.

Open Models Systems & Infrastructure LLaMA: Open and Efficient Foundation Language Models