Researchers — page 10

Co-authored “The Llama 3 Herd of Models”.

1083

Guillem Cucurull

Open-weight chat and foundation models (Llama 2)

Open Models Llama 2: Open Foundation and Fine-Tuned Chat Models

Co-authored Llama 2: Open Foundation and Fine-Tuned Chat Models.

1084

Guillermo Garrido

Multimodal frontier models (Gemini)

Multimodal Gemini: A Family of Highly Capable Multimodal Models

Co-authored Gemini: A Family of Highly Capable Multimodal Models.

1085

Guna Lakshminarayanan

Open-weight frontier models (Llama 3)

Co-authored “The Llama 3 Herd of Models”.

1086

Guodong Zhang

Multimodal frontier models (Gemini)

Multimodal Gemini: A Family of Highly Capable Multimodal Models

Co-authored Gemini: A Family of Highly Capable Multimodal Models.

1087

Guolong Su

Multimodal frontier models (Gemini)

Multimodal Gemini: A Family of Highly Capable Multimodal Models

Co-authored Gemini: A Family of Highly Capable Multimodal Models.

1088

Guowei Li

Open-model frontier reports (DeepSeek-V3)

Co-authored the DeepSeek-V3 Technical Report.

Post-Training & Alignment Evaluation & Benchmarks Discovering Language Model Behaviors with Model-Written Evaluations

1089

Guro Khundadze

Model-written evaluations for LM behavior

Anthropic

Co-authored model-written evals: a practical technique for discovering and measuring LM behaviors.

1090

Gus Martins

Open language models (Gemma 2)

Open Models Gemma 2: Improving Open Language Models at a Practical Size

Co-authored Gemma 2: improving open language models at a practical size.

1091

Gustavo de Rosa

Small, capable models (Phi-3)

Co-authored the Phi-3 Technical Report (capable models designed for smaller footprints).

Open Models Systems & Infrastructure Phi-3 Technical Report: A Highly Capable Language Model Locally on Your Phone

1092

Guy Gur-Ari

Pathways-scale language modeling (PaLM)

PaLM: Scaling Language Modeling with Pathways

Co-authored PaLM: Scaling Language Modeling with Pathways.

1093

H. Zhang

Open-model frontier reports (DeepSeek-V3)

Co-authored the DeepSeek-V3 Technical Report.

1094

Hadi Hashemi

Open language models (Gemma 2)

Open Models Gemma 2: Improving Open Language Models at a Practical Size

Co-authored Gemma 2: improving open language models at a practical size.

1095

Hafeezul Rahman Mohammad

Multimodal frontier models (Gemini)

Multimodal Gemini: A Family of Highly Capable Multimodal Models

Co-authored Gemini: A Family of Highly Capable Multimodal Models.

1096

Hagai Taitelbaum

Multimodal frontier models (Gemini)

Multimodal Gemini: A Family of Highly Capable Multimodal Models

Co-authored Gemini: A Family of Highly Capable Multimodal Models.

1097

Hailey Nguyen

Open-weight frontier models (Llama 3)

Co-authored “The Llama 3 Herd of Models”.

Open Models Code Models StarCoder: may the source be with you!

1098

Hailey Schoelkopf

Open code LLMs (StarCoder)

Co-authored StarCoder: a foundational open code model effort (BigCode).

1099

Haim Rozenblum

Hybrid Transformer–Mamba language models (Jamba)

AI21

A better page than the generic research stub because it surfaces the product and backend engineering layer that supports AI21's model work, not just the research papers themselves.

Systems & Infrastructure Haim Rozenblum

1100

Haiming Bao

Frontier model development (GPT-4)

Co-authored the GPT-4 Technical Report: a key reference for the GPT-4-era frontier.

Open Models Systems & Infrastructure Phi-3 Technical Report: A Highly Capable Language Model Locally on Your Phone

1101

Haiping Wu

Small, capable models (Phi-3)

Co-authored the Phi-3 Technical Report (capable models designed for smaller footprints).

1102

Hakan Inan

Open-weight chat and foundation models (Llama 2)

Open Models Llama 2: Open Foundation and Fine-Tuned Chat Models

Co-authored Llama 2: Open Foundation and Fine-Tuned Chat Models.

1103

Hamid Moghaddam

Multimodal frontier models (Gemini)

Multimodal Gemini: A Family of Highly Capable Multimodal Models

Co-authored Gemini: A Family of Highly Capable Multimodal Models.

1104

Hamid Shojanazeri

Fully Sharded Data Parallel training (FSDP)

Co-authored PyTorch FSDP: practical lessons for scaling fully-sharded training workloads.

Systems & Infrastructure PyTorch FSDP: Experiences on Scaling Fully Sharded Data Parallel

1105

Hamish Ivison

Open, fully-documented language models (OLMo)

AI2

Co-authored OLMo: Accelerating the Science of Language Models.

Open Models OLMo: Accelerating the Science of Language Models

1106

Han Bao

Open-model frontier reports (DeepSeek-V3)

Co-authored the DeepSeek-V3 Technical Report.

1107

Han Lu

Multimodal frontier models (Gemini)

Multimodal Gemini: A Family of Highly Capable Multimodal Models

Co-authored Gemini: A Family of Highly Capable Multimodal Models.

1108

Han Zhang

Multimodal frontier models (Gemini)

Multimodal Gemini: A Family of Highly Capable Multimodal Models

Co-authored Gemini: A Family of Highly Capable Multimodal Models.

1109

Han Zou

Open-weight frontier models (Llama 3)

Co-authored “The Llama 3 Herd of Models”.

1110

Hanna Klimczak-Plucińska

Open language models (Gemma 2)

Open Models Gemma 2: Improving Open Language Models at a Practical Size

Co-authored Gemma 2: improving open language models at a practical size.

1111

Hannah Forbes

Multimodal frontier models (Gemini)

Multimodal Gemini: A Family of Highly Capable Multimodal Models

Co-authored Gemini: A Family of Highly Capable Multimodal Models.

1112

Hannah Korevaar

Open-weight frontier models (Llama 3)

Co-authored “The Llama 3 Herd of Models”.

1113

Hannah Sheahan

Multimodal frontier models (Gemini)

Multimodal Gemini: A Family of Highly Capable Multimodal Models

Co-authored Gemini: A Family of Highly Capable Multimodal Models.

1114

Hannah Wang

Open-weight frontier models (Llama 3)

Co-authored “The Llama 3 Herd of Models”.

1115

Hannah Wong

Frontier model development (GPT-4)

Co-authored the GPT-4 Technical Report: a key reference for the GPT-4-era frontier.

Post-Training & Alignment Self-Instruct: Aligning Language Models with Self-Generated Instructions

1116

Hannaneh Hajishirzi

Synthetic instructions for alignment (Self-Instruct)

Co-authored Self-Instruct: a key reference for instruction data generation pipelines.

1117

Hansa Srinivasan

Multimodal frontier models (Gemini)

Multimodal Gemini: A Family of Highly Capable Multimodal Models

Co-authored Gemini: A Family of Highly Capable Multimodal Models.

1118

Hanwei Xu

Open-model frontier reports (DeepSeek-V3)

Co-authored the DeepSeek-V3 Technical Report.

1119

Hanwen Zha

Open-weight frontier models (Llama 3)

Co-authored “The Llama 3 Herd of Models”.

Open Models Systems & Infrastructure Phi-3 Technical Report: A Highly Capable Language Model Locally on Your Phone

1120

Hany Awadalla

Small, capable models (Phi-3)

Co-authored the Phi-3 Technical Report (capable models designed for smaller footprints).

1121

Hanzhao Lin

Multimodal frontier models (Gemini)

Multimodal Gemini: A Family of Highly Capable Multimodal Models

Co-authored Gemini: A Family of Highly Capable Multimodal Models.

1122

Hanzi Mao

Promptable segmentation foundation models (SAM)

Vision & Robotics Segment Anything

Co-authored Segment Anything.

1123

Hao Cheng

Small, capable models (Phi-3)

Co-authored the Phi-3 Technical Report (capable models designed for smaller footprints).

Open Models Systems & Infrastructure Phi-3 Technical Report: A Highly Capable Language Model Locally on Your Phone

1124

Hao Wu

Multimodal frontier models (Gemini)

Multimodal Gemini: A Family of Highly Capable Multimodal Models

Co-authored Gemini: A Family of Highly Capable Multimodal Models.

1125

Hao Yang

Open-weight LLMs (Qwen)

Open Models Qwen Technical Report

Co-authored the Qwen Technical Report.

1126

Hao Zhang

Fast, cheap LLM serving (PagedAttention)

Co-authored vLLM: a widely used serving stack for efficient LLM inference.

Systems & Infrastructure vLLM: Easy, Fast, and Cheap LLM Serving with PagedAttention

1127

Hao Zhou

Multimodal frontier models (Gemini)

Multimodal Gemini: A Family of Highly Capable Multimodal Models

Co-authored Gemini: A Family of Highly Capable Multimodal Models.

1128

Haocheng Wang

Open-model frontier reports (DeepSeek-V3)

Co-authored the DeepSeek-V3 Technical Report.

1129

Haojie Wei

Audio-capable open models (Qwen2-Audio)

Open Models Multimodal Qwen2-Audio Technical Report

Co-authored the Qwen2-Audio Technical Report.

1130

Haoran Wei

Open-weight LLMs (Qwen2)

Open Models Qwen2 Technical Report

Co-authored the Qwen2 Technical Report.

1131

Haotian Liu

Visual instruction tuning (LLaVA)

Co-authored Visual Instruction Tuning: a widely-cited recipe for LLaVA-style multimodal assistants.

Open Models Multimodal Post-Training & Alignment Visual Instruction Tuning

1132

Haowei Zhang

Open-model frontier reports (DeepSeek-V3)

Co-authored the DeepSeek-V3 Technical Report.

Open Models Systems & Infrastructure RWKV: Reinventing RNNs for the Transformer Era

1133

Haowen Hou

RWKV and efficient sequence modeling

A useful RWKV page because he is present on the original paper, Eagle/Finch, and RWKV-7, making him part of the smaller set of contributors who stayed with the architecture as it evolved rather than only appearing at launch.

1134

Haozhun Jin

Frontier model development (GPT-4)

Co-authored the GPT-4 Technical Report: a key reference for the GPT-4-era frontier.

1135

Hardie Cate

Multimodal frontier models (Gemini)

Multimodal Gemini: A Family of Highly Capable Multimodal Models

Co-authored Gemini: A Family of Highly Capable Multimodal Models.

1136

Hardik Modi

Small, capable models (Phi-3)

Co-authored the Phi-3 Technical Report (capable models designed for smaller footprints).

Open Models Systems & Infrastructure Phi-3 Technical Report: A Highly Capable Language Model Locally on Your Phone

1137

Harish Ganapathy

Multimodal frontier models (Gemini)

Multimodal Gemini: A Family of Highly Capable Multimodal Models

Co-authored Gemini: A Family of Highly Capable Multimodal Models.

1138

Harkirat Behl

Small, capable models (Phi-3)

Co-authored the Phi-3 Technical Report (capable models designed for smaller footprints).

Open Models Systems & Infrastructure Phi-3 Technical Report: A Highly Capable Language Model Locally on Your Phone

1139

Harleen Batra

Open language models (Gemma 2)

Open Models Gemma 2: Improving Open Language Models at a Practical Size

Co-authored Gemma 2: improving open language models at a practical size.

1140

Harm de Vries

Open code LLMs (StarCoder)

Co-authored StarCoder: a foundational open code model effort (BigCode).

Open Models Code Models StarCoder: may the source be with you!

1141

Harman Singh

Open multimodal models (Gemma 3)

Open Models Multimodal Gemma 3 Technical Report

Co-authored the Gemma 3 Technical Report.

1142

Haroon Qureshi

Multimodal frontier models (Gemini)

Multimodal Gemini: A Family of Highly Capable Multimodal Models

Co-authored Gemini: A Family of Highly Capable Multimodal Models.

1143

Haroun Habeeb

Open-weight frontier models (Llama 3)

Co-authored “The Llama 3 Herd of Models”.

1144

Harri Edwards

Code-focused LLMs and evaluation (Codex)

Evaluation & Benchmarks Code Models Evaluating Large Language Models Trained on Code

Co-authored the Codex evaluation paper: an early anchor for code LLM capability measurement.

1145

Harrison Rudolph

Open-weight frontier models (Llama 3)

Co-authored “The Llama 3 Herd of Models”.

1146

Harry Askham

Multimodal frontier models (Gemini)

Multimodal Gemini: A Family of Highly Capable Multimodal Models

Co-authored Gemini: A Family of Highly Capable Multimodal Models.

1147

Harsh Dhand

Open language models (Gemma 2)

Open Models Gemma 2: Improving Open Language Models at a Practical Size

Co-authored Gemma 2: improving open language models at a practical size.

1148

Harsh Mehta

Open multimodal models (Gemma 3)

Open Models Multimodal Gemma 3 Technical Report

Co-authored the Gemma 3 Technical Report.

1149

Harsha Vashisht

Multimodal frontier models (Gemini)

Multimodal Gemini: A Family of Highly Capable Multimodal Models

Co-authored Gemini: A Family of Highly Capable Multimodal Models.

1150

Harshal Godhia

Multimodal frontier models (Gemini)

Multimodal Gemini: A Family of Highly Capable Multimodal Models

Co-authored Gemini: A Family of Highly Capable Multimodal Models.

1151

Harshal Tushar Lehri

Open multimodal models (Gemma 3)

Open Models Multimodal Gemma 3 Technical Report

Co-authored the Gemma 3 Technical Report.

1152

Hassan Akbari

Scaled multilingual vision-language models (PaLI)

Multimodal PaLI: A Jointly-Scaled Multilingual Language-Image Model

Co-authored PaLI: a key reference for scaling multilingual vision-language models.

1153

Hayden Lau

RWKV and efficient sequence modeling

Co-authored RWKV: Reinventing RNNs for the Transformer Era.

Open Models Systems & Infrastructure RWKV: Reinventing RNNs for the Transformer Era

1154

Heather Schmidt

Frontier model development (GPT-4)

Co-authored the GPT-4 Technical Report: a key reference for the GPT-4-era frontier.

1155

Héctor Fernández Alcalde

Multimodal frontier models (Gemini)

Multimodal Gemini: A Family of Highly Capable Multimodal Models

Co-authored Gemini: A Family of Highly Capable Multimodal Models.

1156

Heewoo Jun

Code-focused LLMs and evaluation (Codex)

Evaluation & Benchmarks Code Models Evaluating Large Language Models Trained on Code

Co-authored the Codex evaluation paper: an early anchor for code LLM capability measurement.

1157

Heidi Howard

Multimodal frontier models (Gemini)

Multimodal Gemini: A Family of Highly Capable Multimodal Models

Co-authored Gemini: A Family of Highly Capable Multimodal Models.

1158

Heidy Khlaaf

Code-focused LLMs and evaluation (Codex)

Evaluation & Benchmarks Code Models Evaluating Large Language Models Trained on Code

Co-authored the Codex evaluation paper: an early anchor for code LLM capability measurement.

1159

Heinrich Jiang

Multimodal frontier models (Gemini)

Multimodal Gemini: A Family of Highly Capable Multimodal Models

Co-authored Gemini: A Family of Highly Capable Multimodal Models.

1160

Heinrich Küttler

Retrieval-augmented generation (RAG)

Co-authored RAG: a canonical reference for retrieval-augmented generation in NLP.

Evaluation & Benchmarks Retrieval-Augmented Generation for Knowledge-Intensive NLP Tasks

1161

Helen Miller

Multimodal frontier models (Gemini)

Multimodal Gemini: A Family of Highly Capable Multimodal Models

Co-authored Gemini: A Family of Highly Capable Multimodal Models.

1162

Helen Suk

Open-weight frontier models (Llama 3)

Co-authored “The Llama 3 Herd of Models”.

1163

Heng-Tze Cheng

Multimodal frontier models (Gemini)

Multimodal Gemini: A Family of Highly Capable Multimodal Models

Co-authored Gemini: A Family of Highly Capable Multimodal Models.

1164

Henri Roussez

Frontier model development (GPT-4)

Co-authored the GPT-4 Technical Report: a key reference for the GPT-4-era frontier.

1165

Henrik Jacobsson

Multimodal frontier models (Gemini)

Multimodal Gemini: A Family of Highly Capable Multimodal Models

Co-authored Gemini: A Family of Highly Capable Multimodal Models.

1166

Henrique Ponde de Oliveira Pinto

Code-focused LLMs and evaluation (Codex)

Evaluation & Benchmarks Code Models Evaluating Large Language Models Trained on Code

Co-authored the Codex evaluation paper: an early anchor for code LLM capability measurement.

1167

Henry Aspegren

Open-weight frontier models (Llama 3)

Co-authored “The Llama 3 Herd of Models”.

1168

Henryk Michalewski

Pathways-scale language modeling (PaLM)

PaLM: Scaling Language Modeling with Pathways

Co-authored PaLM: Scaling Language Modeling with Pathways.

1169

Heri Zhao

Open code models (CodeGemma)

Open Models Code Models CodeGemma: Open Code Models Based on Gemma

Co-authored CodeGemma: open code models based on Gemma.

1170

Hervé Jégou

Self-supervised vision transformers (DINO)

Co-authored DINO: influential self-supervised representation learning for vision transformers.

Vision & Robotics Emerging Properties in Self-Supervised Vision Transformers

1171

Hexiang Hu

Multimodal frontier models (Gemini)

Multimodal Gemini: A Family of Highly Capable Multimodal Models

Co-authored Gemini: A Family of Highly Capable Multimodal Models.

1172

Heyang Qin

Small, capable models (Phi-3)

Co-authored the Phi-3 Technical Report (capable models designed for smaller footprints).

Open Models Systems & Infrastructure Phi-3 Technical Report: A Highly Capable Language Model Locally on Your Phone

1173

Hila Noga

Multimodal frontier models (Gemini)

Multimodal Gemini: A Family of Highly Capable Multimodal Models

Co-authored Gemini: A Family of Highly Capable Multimodal Models.

1174

Himadri Choudhury

Multimodal frontier models (Gemini)

Multimodal Gemini: A Family of Highly Capable Multimodal Models

Co-authored Gemini: A Family of Highly Capable Multimodal Models.

1175

Himanshu Gupta

Multimodal frontier models (Gemini)

Multimodal Gemini: A Family of Highly Capable Multimodal Models

Co-authored Gemini: A Family of Highly Capable Multimodal Models.

1176

Hiteshi Sharma

Small, capable models (Phi-3)

Co-authored the Phi-3 Technical Report (capable models designed for smaller footprints).

Open Models Systems & Infrastructure Phi-3 Technical Report: A Highly Capable Language Model Locally on Your Phone

1177

Hofit Bata

Hybrid Transformer–Mamba language models (Jamba)

AI21

A useful page because it points to the research-and-strategy side of AI21 rather than only the product or engineering side, especially where model evaluation and new architectural bets get shaped at the CTO-office level.

Evaluation & Benchmarks Systems & Infrastructure Hofit Bata

1178

Honghui Ding

Open-model frontier reports (DeepSeek-V3)

Co-authored the DeepSeek-V3 Technical Report.

1179

Hongkun Yu

Multimodal frontier models (Gemini)

Multimodal Gemini: A Family of Highly Capable Multimodal Models

Co-authored Gemini: A Family of Highly Capable Multimodal Models.

1180

Honglong Cai

Multimodal frontier models (Gemini)

Multimodal Gemini: A Family of Highly Capable Multimodal Models

Co-authored Gemini: A Family of Highly Capable Multimodal Models.

1181

Hongyi Yuan

Open-weight LLMs (Qwen)

Open Models Qwen Technical Report

Co-authored the Qwen Technical Report.

1182

Hongyu Ren

Holistic evaluation of language models (HELM)

Co-authored HELM: a framework for evaluating language models across many axes beyond raw accuracy.

Evaluation & Benchmarks Holistic Evaluation of Language Models

1183

Hongyuan Zhan

Open-weight frontier models (Llama 3)

Co-authored “The Llama 3 Herd of Models”.

1184

Hongzhi Shi

Multimodal frontier models (Gemini)

Multimodal Gemini: A Family of Highly Capable Multimodal Models

Co-authored Gemini: A Family of Highly Capable Multimodal Models.

1185

Horace He

Open-source LLMs (EleutherAI)

EleutherAI

One of the best people to track if you care about the practical performance layer of modern AI systems, especially where compilers, kernels, and model-serving speed actually move the frontier.

Open Models Systems & Infrastructure Horace He

1186

Hu Xu

Open-weight frontier models (Llama 3)

Co-authored “The Llama 3 Herd of Models”.

1187

Huajian Xin

Open-model frontier reports (DeepSeek-V3)

Co-authored the DeepSeek-V3 Technical Report.

1188

Huan Lin

Open-weight LLMs (Qwen2)

Open Models Qwen2 Technical Report

Co-authored the Qwen2 Technical Report.

1189

Huanjie Zhou

Multimodal frontier models (Gemini)

Multimodal Gemini: A Family of Highly Capable Multimodal Models

Co-authored Gemini: A Family of Highly Capable Multimodal Models.

1190

Huanqi Cao

RWKV and efficient sequence modeling

Useful because it turns an otherwise thin RWKV byline into a real systems profile: after the original paper, his public work tracks toward large-scale pretraining infrastructure, pipeline parallelism, and systems support for frontier-scale models.

Open Models Systems & Infrastructure Huanqi Cao at Tsinghua University

1191

Huaxiu Yao

Holistic evaluation of language models (HELM)

Co-authored HELM: a framework for evaluating language models across many axes beyond raw accuracy.

Evaluation & Benchmarks Holistic Evaluation of Language Models

1192

Huazuo Gao

Open-model frontier reports (DeepSeek-V3)

Co-authored the DeepSeek-V3 Technical Report.

1193

Hugo Touvron

Open-weight foundation models (LLaMA)

Open Models Systems & Infrastructure Hugo Touvron at Meta

One of the cleaner bridge figures between the vision-transformer era and the open-weight LLaMA era: his public paper trail runs from influential self-supervised vision work into the first LLaMA release, Llama 2, and Code Llama.

1194

Hui Li

Open-model frontier reports (DeepSeek-V3)

Co-authored the DeepSeek-V3 Technical Report.

1195

Hui Qu

Open-model frontier reports (DeepSeek-V3)