Researchers — page 6

Language Models are Few-Shot Learners (GPT-3)

Co-authored GPT-3: Language Models are Few-Shot Learners.

612

Christopher Olah

Model-written evaluations for LM behavior

Post-Training & Alignment Evaluation & Benchmarks Discovering Language Model Behaviors with Model-Written Evaluations

Co-authored model-written evals: a practical technique for discovering and measuring LM behaviors.

613

Christopher Ré

Fast, memory-efficient attention

Important because he sits at a productive seam between machine learning, data systems, and model infrastructure, with work that ranges from weak supervision to some of the most important efficiency breakthroughs in modern training stacks.

Systems & Infrastructure Homepage of Christopher Re (Chris Re)

614

Christopher Yew

Multimodal frontier models (Gemini)

Multimodal Gemini: A Family of Highly Capable Multimodal Models

Co-authored Gemini: A Family of Highly Capable Multimodal Models.

615

Christy Koh

Multimodal frontier models (Gemini)

Multimodal Gemini: A Family of Highly Capable Multimodal Models

Co-authored Gemini: A Family of Highly Capable Multimodal Models.

616

Chu-Cheng Lin

Multimodal frontier models (Gemini)

Multimodal Gemini: A Family of Highly Capable Multimodal Models

Co-authored Gemini: A Family of Highly Capable Multimodal Models.

617

Chuanqi Tan

Open-weight LLMs (Qwen)

Qwen

Co-authored the Qwen Technical Report.

Open Models Qwen Technical Report

618

Chun-Sung Ferng

Multimodal frontier models (Gemini)

Multimodal Gemini: A Family of Highly Capable Multimodal Models

Co-authored Gemini: A Family of Highly Capable Multimodal Models.

619

Chung-Cheng Chiu

Multimodal frontier models (Gemini)

Multimodal Gemini: A Family of Highly Capable Multimodal Models

Co-authored Gemini: A Family of Highly Capable Multimodal Models.

620

Chunyang Wu

Open-weight frontier models (Llama 3)

Co-authored “The Llama 3 Herd of Models”.

Open Models Systems & Infrastructure Phi-3 Technical Report: A Highly Capable Language Model Locally on Your Phone

621

Chunyu Wang

Small, capable models (Phi-3)

Co-authored the Phi-3 Technical Report (capable models designed for smaller footprints).

622

Chunyuan Li

Visual instruction tuning (LLaVA)

Co-authored Visual Instruction Tuning: a widely-cited recipe for LLaVA-style multimodal assistants.

Open Models Multimodal Post-Training & Alignment Visual Instruction Tuning

623

Cindy Wang

Multimodal frontier models (Gemini)

Multimodal Gemini: A Family of Highly Capable Multimodal Models

Co-authored Gemini: A Family of Highly Capable Multimodal Models.

624

Cip Baetu

Multimodal frontier models (Gemini)

Multimodal Gemini: A Family of Highly Capable Multimodal Models

Co-authored Gemini: A Family of Highly Capable Multimodal Models.

625

CJ Carey

Open multimodal models (Gemma 3)

Co-authored the Gemma 3 Technical Report.

626

CJ Weinmann

Frontier model development (GPT-4)

Co-authored the GPT-4 Technical Report: a key reference for the GPT-4-era frontier.

Systems & Infrastructure GLaM: Efficient Scaling of Language Models with Mixture-of-Experts

627

Claire Cui

Efficient MoE scaling (GLaM)

Co-authored GLaM: an influential MoE scaling reference in large language modeling.

628

Claire Schlesinger

Open code LLMs (StarCoder)

Co-authored StarCoder: a foundational open code model effort (BigCode).

Open Models Code Models StarCoder: may the source be with you!

629

Clara Fridman

Hybrid Transformer–Mamba language models (Jamba)

Evaluation & Benchmarks Systems & Infrastructure AI21 Labs

A distinctive page in this AI21 cluster because she brings a linguistics and human-evaluation angle to model work, especially around user interaction, multilingual language behavior, and how LLM performance gets tested in practice.

630

Clara Huiyi Hu

Multimodal frontier models (Gemini)

Multimodal Gemini: A Family of Highly Capable Multimodal Models

Co-authored Gemini: A Family of Highly Capable Multimodal Models.

631

Clara Ma

Open-source tooling for modern NLP (Transformers library)

Hugging Face

Co-authored the Hugging Face Transformers paper that helped standardize modern NLP workflows.

Open Models Transformers: State-of-the-Art Natural Language Processing

632

Clara Rivera

Multimodal frontier models (Gemini)

Multimodal Gemini: A Family of Highly Capable Multimodal Models

Co-authored Gemini: A Family of Highly Capable Multimodal Models.

633

Claudia van der Salm

Multimodal frontier models (Gemini)

Multimodal Gemini: A Family of Highly Capable Multimodal Models

Co-authored Gemini: A Family of Highly Capable Multimodal Models.

634

Clayton Mullis

Open large-scale image-text data (LAION-5B)

Co-authored LAION-5B: a widely used open dataset for vision-language foundation models.

Multimodal LAION-5B: An open large-scale dataset for training next generation image-text models

635

Clemens Lombriser

Multimodal frontier models (Gemini)

Multimodal Gemini: A Family of Highly Capable Multimodal Models

Co-authored Gemini: A Family of Highly Capable Multimodal Models.

636

Clemens Meyer

Multimodal frontier models (Gemini)

Multimodal Gemini: A Family of Highly Capable Multimodal Models

Co-authored Gemini: A Family of Highly Capable Multimodal Models.

637

Clemens Winter

Code-focused LLMs and evaluation (Codex)

Evaluation & Benchmarks Code Models Evaluating Large Language Models Trained on Code

Co-authored the Codex evaluation paper: an early anchor for code LLM capability measurement.

638

Clément Crepy

Open language models from Google (Gemma)

Open Models Gemma: Open Models Based on Gemini Research and Technology

Co-authored Gemma: open models based on Gemini research and technology.

639

Clement Delangue

Open-source tooling for modern NLP (Transformers library)

Hugging Face

Co-authored the Hugging Face Transformers paper that helped standardize modern NLP workflows.

Open Models Transformers: State-of-the-Art Natural Language Processing

640

Clément Farabet

Open language models from Google (Gemma)

Open Models Gemma: Open Models Based on Gemini Research and Technology

Co-authored Gemma: open models based on Gemini research and technology.

641

Cody Hao Yu

Fast, cheap LLM serving (PagedAttention)

Co-authored vLLM: a widely used serving stack for efficient LLM inference.

Systems & Infrastructure vLLM: Easy, Fast, and Cheap LLM Serving with PagedAttention

642

Colin Cherry

Open multimodal models (Gemma 3)

Co-authored the Gemma 3 Technical Report.

643

Colin Evans

Multimodal frontier models (Gemini)

Multimodal Gemini: A Family of Highly Capable Multimodal Models

Co-authored Gemini: A Family of Highly Capable Multimodal Models.

644

Colin Gaffney

Multimodal frontier models (Gemini)

Multimodal Gemini: A Family of Highly Capable Multimodal Models

Co-authored Gemini: A Family of Highly Capable Multimodal Models.

645

Colin Ji

Multimodal frontier models (Gemini)

Multimodal Gemini: A Family of Highly Capable Multimodal Models

Co-authored Gemini: A Family of Highly Capable Multimodal Models.

646

Colin Raffel

Text-to-text transfer and pretraining (T5)

Co-authored T5: a practical template for unified NLP training and evaluation.

Evaluation & Benchmarks Exploring the Limits of Transfer Learning with a Unified Text-to-Text Transformer

647

Collin Burns

Broad capability evaluation (MMLU)

Co-authored MMLU: a widely used benchmark for general LLM capability across many subjects.

Evaluation & Benchmarks Measuring Massive Multitask Language Understanding

648

Colton Bishop

Multimodal frontier models (Gemini)

Multimodal Gemini: A Family of Highly Capable Multimodal Models

Co-authored Gemini: A Family of Highly Capable Multimodal Models.

649

Connie Tao

Multimodal frontier models (Gemini)

Multimodal Gemini: A Family of Highly Capable Multimodal Models

Co-authored Gemini: A Family of Highly Capable Multimodal Models.

650

Connor Leahy

Open models, governance, communication

EleutherAI

An important bridge figure between open-weight language-model communities and the modern alignment debate, especially when you want to understand how frontier capability, openness, and control arguments collide in practice.

Open Models Post-Training & Alignment Conjecture

651

Corby Rosset

Small, capable models (Phi-3)

Co-authored the Phi-3 Technical Report (capable models designed for smaller footprints).

Open Models Systems & Infrastructure Phi-3 Technical Report: A Highly Capable Language Model Locally on Your Phone

652

Corinne Wong

Open-weight frontier models (Llama 3)

Co-authored “The Llama 3 Herd of Models”.

653

Cormac Brick

Open multimodal models (Gemma 3)

Co-authored the Gemma 3 Technical Report.

654

Cory Decareaux

Frontier model development (GPT-4)

Co-authored the GPT-4 Technical Report: a key reference for the GPT-4-era frontier.

655

Cosmin Paduraru

Multimodal frontier models (Gemini)

Multimodal Gemini: A Family of Highly Capable Multimodal Models

Co-authored Gemini: A Family of Highly Capable Multimodal Models.

656

Cosmo Du

Multimodal frontier models (Gemini)

Multimodal Gemini: A Family of Highly Capable Multimodal Models

Co-authored Gemini: A Family of Highly Capable Multimodal Models.

657

Craig Pettit

Model-written evaluations for LM behavior

Post-Training & Alignment Evaluation & Benchmarks Discovering Language Model Behaviors with Model-Written Evaluations

Co-authored model-written evals: a practical technique for discovering and measuring LM behaviors.

658

Craig Swanson

Multimodal frontier models (Gemini)

Multimodal Gemini: A Family of Highly Capable Multimodal Models

Co-authored Gemini: A Family of Highly Capable Multimodal Models.

659

Cristian Canton Ferrer

Open-weight chat and foundation models (Llama 2)

Open Models Llama 2: Open Foundation and Fine-Tuned Chat Models

Co-authored Llama 2: Open Foundation and Fine-Tuned Chat Models.

660

Crystal Nam

Open, fully-documented language models (OLMo)

AI2

Co-authored OLMo: Accelerating the Science of Language Models.

Open Models OLMo: Accelerating the Science of Language Models

661

Cullen O'Keefe

Frontier model development (GPT-4)

Co-authored the GPT-4 Technical Report: a key reference for the GPT-4-era frontier.

662

Cynthia Gao

Open-weight chat and foundation models (Llama 2)

Open Models Llama 2: Open Foundation and Fine-Tuned Chat Models

Co-authored Llama 2: Open Foundation and Fine-Tuned Chat Models.

663

Cyril Zhang

Small, capable models (Phi-3)

Co-authored the Phi-3 Technical Report (capable models designed for smaller footprints).

Open Models Systems & Infrastructure Phi-3 Technical Report: A Highly Capable Language Model Locally on Your Phone

664

Cyrus Nikolaidis

Open-weight frontier models (Llama 3)

Co-authored “The Llama 3 Herd of Models”.

665

D. Sculley

Open language models (Gemma 2)

Open Models Gemma 2: Improving Open Language Models at a Practical Size

Co-authored Gemma 2: improving open language models at a practical size.

666

Da Yan

Model-written evaluations for LM behavior

Post-Training & Alignment Evaluation & Benchmarks Discovering Language Model Behaviors with Model-Written Evaluations

Co-authored model-written evals: a practical technique for discovering and measuring LM behaviors.

667

Da-Cheng Juan

Multimodal frontier models (Gemini)

Multimodal Gemini: A Family of Highly Capable Multimodal Models

Co-authored Gemini: A Family of Highly Capable Multimodal Models.

668

Da-Woon Chung

Multimodal frontier models (Gemini)

Multimodal Gemini: A Family of Highly Capable Multimodal Models

Co-authored Gemini: A Family of Highly Capable Multimodal Models.

669

Daan Wierstra

Deep Q-Networks (DQN)

Co-authored the original DQN preprint: a core reference for deep reinforcement learning.

Reinforcement Learning Playing Atari with Deep Reinforcement Learning

670

Dacheng Li

Human preference evaluation at scale (Chatbot Arena)

Co-authored Chatbot Arena: a high-impact human-preference evaluation platform for LLMs.

Evaluation & Benchmarks Chatbot Arena: An Open Platform for Evaluating LLMs by Human Preference

671

Daiyi Peng

Multimodal frontier models (Gemini)

Multimodal Gemini: A Family of Highly Capable Multimodal Models

Co-authored Gemini: A Family of Highly Capable Multimodal Models.

672

Dale Schuurmans

Chain-of-thought prompting and reasoning

Co-authored the chain-of-thought prompting paper; foundational for modern reasoning prompting.

Agents & Reasoning Chain-of-Thought Prompting Elicits Reasoning in Large Language Models

673

Dalia El Badawy

Multimodal frontier models (Gemini)

Multimodal Gemini: A Family of Highly Capable Multimodal Models

Co-authored Gemini: A Family of Highly Capable Multimodal Models.

674

Damai Dai

Open-model frontier reports (DeepSeek-V3)

DeepSeek

Co-authored the DeepSeek-V3 Technical Report.

Open Models DeepSeek-V3 Technical Report

675

Damien Allonsius

Open-weight frontier models (Llama 3)

Co-authored “The Llama 3 Herd of Models”.

676

Damien Deville

Frontier model development (GPT-4)

Co-authored the GPT-4 Technical Report: a key reference for the GPT-4-era frontier.

677

Damon Civin

Open-weight frontier models (Llama 3)

Co-authored “The Llama 3 Herd of Models”.

678

Dan Banica

Multimodal frontier models (Gemini)

Multimodal Gemini: A Family of Highly Capable Multimodal Models

Co-authored Gemini: A Family of Highly Capable Multimodal Models.

679

Dan Bikel

Open-weight chat and foundation models (Llama 2)

Open Models Llama 2: Open Foundation and Fine-Tuned Chat Models

Co-authored Llama 2: Open Foundation and Fine-Tuned Chat Models.

680

Dan Dooley

Multimodal frontier models (Gemini)

Multimodal Gemini: A Family of Highly Capable Multimodal Models

Co-authored Gemini: A Family of Highly Capable Multimodal Models.

681

Dan Garrette

Multimodal frontier models (Gemini)

Multimodal Gemini: A Family of Highly Capable Multimodal Models

Co-authored Gemini: A Family of Highly Capable Multimodal Models.

682

Dan Hendrycks

Broad capability evaluation (MMLU)

Co-authored MMLU: a widely used benchmark for general LLM capability across many subjects.

Evaluation & Benchmarks Measuring Massive Multitask Language Understanding

683

Dan Holtmann-Rice

Multimodal frontier models (Gemini)

Multimodal Gemini: A Family of Highly Capable Multimodal Models

Co-authored Gemini: A Family of Highly Capable Multimodal Models.

684

Dan Horgan

Multimodal frontier models (Gemini)

Multimodal Gemini: A Family of Highly Capable Multimodal Models

Co-authored Gemini: A Family of Highly Capable Multimodal Models.

685

Dan Hurt

Multimodal frontier models (Gemini)

Multimodal Gemini: A Family of Highly Capable Multimodal Models

Co-authored Gemini: A Family of Highly Capable Multimodal Models.

686

Dan Iter

Small, capable models (Phi-3)

Co-authored the Phi-3 Technical Report (capable models designed for smaller footprints).

Open Models Systems & Infrastructure Phi-3 Technical Report: A Highly Capable Language Model Locally on Your Phone

687

Dan Malkin

Open multimodal models (Gemma 3)

Co-authored the Gemma 3 Technical Report.

688

Dan McKinnon

Multimodal frontier models (Gemini)

Multimodal Gemini: A Family of Highly Capable Multimodal Models

Co-authored Gemini: A Family of Highly Capable Multimodal Models.

689

Dan Padnos

Hybrid Transformer–Mamba language models (Jamba)

Systems & Infrastructure Dan Padnos

A strong page for the applied side of frontier AI because his work sits closer to deployment and platform architecture than pure modeling, which makes him useful for understanding how AI21 turned model research into products other teams could actually build on.

690

Dana Beaty

Open-weight frontier models (Llama 3)

Co-authored “The Llama 3 Herd of Models”.

691

Daniel Andor

Multimodal frontier models (Gemini)

Multimodal Gemini: A Family of Highly Capable Multimodal Models

Co-authored Gemini: A Family of Highly Capable Multimodal Models.

692

Daniel Balle

Multimodal frontier models (Gemini)

Multimodal Gemini: A Family of Highly Capable Multimodal Models

Co-authored Gemini: A Family of Highly Capable Multimodal Models.

693

Daniel Cer

Open language models from Google (Gemma)

Open Models Gemma: Open Models Based on Gemini Research and Technology

Co-authored Gemma: open models based on Gemini research and technology.

694

Daniel Deutsch

Open multimodal models (Gemma 3)

Co-authored the Gemma 3 Technical Report.

695

Daniel Finchelstein

Multimodal frontier models (Gemini)

Multimodal Gemini: A Family of Highly Capable Multimodal Models

Co-authored Gemini: A Family of Highly Capable Multimodal Models.

696

Daniel Fried

Open code LLMs (StarCoder)

Co-authored StarCoder: a foundational open code model effort (BigCode).

Open Models Code Models StarCoder: may the source be with you!

697

Daniel Gissin

Hybrid Transformer–Mamba language models (Jamba)

Systems & Infrastructure Jamba-1.5: Hybrid Transformer-Mamba Models at Scale

A stronger page than the default Jamba byline because his work clearly predates it: he has earlier papers on active learning and implicit bias in deep networks before showing up on Jamba-1.5.

698

Daniel J. Mankowitz

Multimodal frontier models (Gemini)

Multimodal Gemini: A Family of Highly Capable Multimodal Models

Co-authored Gemini: A Family of Highly Capable Multimodal Models.

699

Daniel Jannai

Hybrid Transformer–Mamba language models (Jamba)

Systems & Infrastructure Daniel Jannai on Google Scholar

A worthwhile profile because he is tied directly to the main public Jamba releases, which makes him one of the clearer names behind the hybrid Transformer-Mamba model line rather than just another long author list entry.

700

Daniel Khashabi

Synthetic instructions for alignment (Self-Instruct)

Co-authored Self-Instruct: a key reference for instruction data generation pipelines.

Post-Training & Alignment Self-Instruct: Aligning Language Models with Self-Generated Instructions

701

Daniel Kokotajlo

Frontier model development (GPT-4)

Co-authored the GPT-4 Technical Report: a key reference for the GPT-4-era frontier.

702

Daniel Kreymer

Open-weight frontier models (Llama 3)

Co-authored “The Llama 3 Herd of Models”.

703

Daniel Levy

Frontier model development (GPT-4)

Co-authored the GPT-4 Technical Report: a key reference for the GPT-4-era frontier.

704

Daniel Li

Open-weight frontier models (Llama 3)

Co-authored “The Llama 3 Herd of Models”.

705

Daniel M. Ziegler

Large-scale language modeling (GPT-3)

Language Models are Few-Shot Learners (GPT-3)

Co-authored GPT-3: Language Models are Few-Shot Learners.

706

Daniel Mossing

Frontier model development (GPT-4)

Co-authored the GPT-4 Technical Report: a key reference for the GPT-4-era frontier.

Open Models Systems & Infrastructure Phi-3 Technical Report: A Highly Capable Language Model Locally on Your Phone

707

Daniel Perez-Becker

Small, capable models (Phi-3)

Co-authored the Phi-3 Technical Report (capable models designed for smaller footprints).

708

Daniel Salz

Scaled multilingual vision-language models (PaLI)

Multimodal PaLI: A Jointly-Scaled Multilingual Language-Image Model

Co-authored PaLI: a key reference for scaling multilingual vision-language models.

709

Daniel Selsam

Frontier model development (GPT-4)

Co-authored the GPT-4 Technical Report: a key reference for the GPT-4-era frontier.

710

Daniel Sohn

Multimodal frontier models (Gemini)

Multimodal Gemini: A Family of Highly Capable Multimodal Models

Co-authored Gemini: A Family of Highly Capable Multimodal Models.

711

Daniel Song

Open-weight frontier models (Llama 3)

Co-authored “The Llama 3 Herd of Models”.

712

Daniel Toyama

Multimodal frontier models (Gemini)

Multimodal Gemini: A Family of Highly Capable Multimodal Models

Co-authored Gemini: A Family of Highly Capable Multimodal Models.

713

Daniel von Dincklage

Multimodal frontier models (Gemini)

Multimodal Gemini: A Family of Highly Capable Multimodal Models

Co-authored Gemini: A Family of Highly Capable Multimodal Models.

714

Daniel Y. Fu

Fast, memory-efficient attention

One of the more useful people to follow for the systems side of modern model building, especially where better kernels and sequence methods translate directly into frontier-model training and inference speed.

Systems & Infrastructure Research | Together AI

715

Daniela Amodei

Model-written evaluations for LM behavior

Post-Training & Alignment Evaluation & Benchmarks Discovering Language Model Behaviors with Model-Written Evaluations

Co-authored model-written evals: a practical technique for discovering and measuring LM behaviors.

716

Danielle Eisenbud

Open multimodal models (Gemma 3)