Researchers — page 27

Multimodal Gemini: A Family of Highly Capable Multimodal Models

Co-authored Gemini: A Family of Highly Capable Multimodal Models.

3123

Uriya Pumerantz

Hybrid Transformer–Mamba language models (Jamba)

AI21

A strong long-tail page for the AI21 cluster because it surfaces one of the algorithm developers behind the Jamba line instead of collapsing all of that work into a single undifferentiated author list.

Systems & Infrastructure Uriya Pumerantz

3124

Urvashi Bhattacharyya

Open code LLMs (StarCoder)

Co-authored StarCoder: a foundational open code model effort (BigCode).

Open Models Code Models StarCoder: may the source be with you!

3125

Urvashi Khandelwal

Multimodal frontier models (Gemini)

Multimodal Gemini: A Family of Highly Capable Multimodal Models

Co-authored Gemini: A Family of Highly Capable Multimodal Models.

3126

USVSN Sai Prashanth

Open-source LLMs (EleutherAI)

EleutherAI

A worthwhile long-tail open-model page because it captures one of the quieter GPT-NeoX contributors with an explicit EleutherAI paper trail instead of leaving the profile as a generic coauthor stub.

Open Models GPT-NeoX-20B: An Open-Source Autoregressive Language Model

3127

Utku Evci

Open multimodal models (Gemma 3)

Co-authored the Gemma 3 Technical Report.

3128

Vahab Mirrokni

Open multimodal models (Gemma 3)

Co-authored the Gemma 3 Technical Report.

3129

Valentin Anklin

Multimodal frontier models (Gemini)

Multimodal Gemini: A Family of Highly Capable Multimodal Models

Co-authored Gemini: A Family of Highly Capable Multimodal Models.

3130

Valentina Pyatkin

Open, fully-documented language models (OLMo)

AI2

Co-authored OLMo: Accelerating the Science of Language Models.

Open Models OLMo: Accelerating the Science of Language Models

3131

Valerie Balcom

Frontier model development (GPT-4)

Co-authored the GPT-4 Technical Report: a key reference for the GPT-4-era frontier.

3132

Vamsi Bedapudi

Multimodal frontier models (Gemini)

Multimodal Gemini: A Family of Highly Capable Multimodal Models

Co-authored Gemini: A Family of Highly Capable Multimodal Models.

3133

Varun Godbole

Multimodal frontier models (Gemini)

Multimodal Gemini: A Family of Highly Capable Multimodal Models

Co-authored Gemini: A Family of Highly Capable Multimodal Models.

3134

Varun Vontimitta

Open-weight frontier models (Llama 3)

Co-authored “The Llama 3 Herd of Models”.

3135

Vedant Misra

Code-focused LLMs and evaluation (Codex)

Evaluation & Benchmarks Code Models Evaluating Large Language Models Trained on Code

Co-authored the Codex evaluation paper: an early anchor for code LLM capability measurement.

3136

Vedanuj Goswami

Open-weight chat and foundation models (Llama 2)

Open Models Llama 2: Open Foundation and Fine-Tuned Chat Models

Co-authored Llama 2: Open Foundation and Fine-Tuned Chat Models.

3137

Venus Wang

Multimodal frontier models (Gemini)

Multimodal Gemini: A Family of Highly Capable Multimodal Models

Co-authored Gemini: A Family of Highly Capable Multimodal Models.

3138

Vera Filippova

Multimodal frontier models (Gemini)

Multimodal Gemini: A Family of Highly Capable Multimodal Models

Co-authored Gemini: A Family of Highly Capable Multimodal Models.

3139

Vered Cohen

Multimodal frontier models (Gemini)

Multimodal Gemini: A Family of Highly Capable Multimodal Models

Co-authored Gemini: A Family of Highly Capable Multimodal Models.

3140

Vibhor Gupta

Open-weight frontier models (Llama 3)

Co-authored “The Llama 3 Herd of Models”.

3141

Victor Ähdel

Multimodal frontier models (Gemini)

Multimodal Gemini: A Family of Highly Capable Multimodal Models

Co-authored Gemini: A Family of Highly Capable Multimodal Models.

3142

Víctor Campos Campos

Multimodal frontier models (Gemini)

Multimodal Gemini: A Family of Highly Capable Multimodal Models

Co-authored Gemini: A Family of Highly Capable Multimodal Models.

3143

Victor Cotruta

Open language models (Gemma 2)

Open Models Gemma 2: Improving Open Language Models at a Practical Size

Co-authored Gemma 2: improving open language models at a practical size.

3144

Victor Fragoso

Small, capable models (Phi-3)

Co-authored the Phi-3 Technical Report (capable models designed for smaller footprints).

Open Models Systems & Infrastructure Phi-3 Technical Report: A Highly Capable Language Model Locally on Your Phone

3145

Victor Sanh

Open-source tooling for modern NLP (Transformers library)

Hugging Face

Co-authored the Hugging Face Transformers paper that helped standardize modern NLP workflows.

Open Models Transformers: State-of-the-Art Natural Language Processing

3146

Victoria Ajayi

Open-weight frontier models (Llama 3)

Co-authored “The Llama 3 Herd of Models”.

3147

Victoria Krakovna

Multimodal frontier models (Gemini)

Multimodal Gemini: A Family of Highly Capable Multimodal Models

Co-authored Gemini: A Family of Highly Capable Multimodal Models.

3148

Victoria Montanez

Open-weight frontier models (Llama 3)

Co-authored “The Llama 3 Herd of Models”.

3149

Vignesh Ramanathan

Open-weight frontier models (Llama 3)

Co-authored “The Llama 3 Herd of Models”.

3150

Vihan Jain

Open language models (Gemma 2)

Open Models Gemma 2: Improving Open Language Models at a Practical Size

Co-authored Gemma 2: improving open language models at a practical size.

3151

Vijai Mohan

Open-weight frontier models (Llama 3)

Co-authored “The Llama 3 Herd of Models”.

3152

Vijay Bolina

Multimodal frontier models (Gemini)

Multimodal Gemini: A Family of Highly Capable Multimodal Models

Co-authored Gemini: A Family of Highly Capable Multimodal Models.

3153

Vijay Vasudevan

Multimodal frontier models (Gemini)

Multimodal Gemini: A Family of Highly Capable Multimodal Models

Co-authored Gemini: A Family of Highly Capable Multimodal Models.

3154

Vik Goel

Frontier model development (GPT-4)

Co-authored the GPT-4 Technical Report: a key reference for the GPT-4-era frontier.

3155

Vikas Peswani

Multimodal frontier models (Gemini)

Multimodal Gemini: A Family of Highly Capable Multimodal Models

Co-authored Gemini: A Family of Highly Capable Multimodal Models.

3156

Vikas Yadav

Open language models (Gemma 2)

Open Models Gemma 2: Improving Open Language Models at a Practical Size

Co-authored Gemma 2: improving open language models at a practical size.

3157

Vikram Rao

Multimodal frontier models (Gemini)

Multimodal Gemini: A Family of Highly Capable Multimodal Models

Co-authored Gemini: A Family of Highly Capable Multimodal Models.

3158

Viktor Kerkez

Open-weight chat and foundation models (Llama 2)

Open Models Llama 2: Open Foundation and Fine-Tuned Chat Models

Co-authored Llama 2: Open Foundation and Fine-Tuned Chat Models.

3159

Vilobh Meshram

Open language models (Gemma 2)

Open Models Gemma 2: Improving Open Language Models at a Practical Size

Co-authored Gemma 2: improving open language models at a practical size.

3160

Vinay Ramasesh

Multimodal frontier models (Gemini)

Multimodal Gemini: A Family of Highly Capable Multimodal Models

Co-authored Gemini: A Family of Highly Capable Multimodal Models.

3161

Vinay Satish Kumar

Open-weight frontier models (Llama 3)

Co-authored “The Llama 3 Herd of Models”.

3162

Vincent Gonguet

Open-weight frontier models (Llama 3)

Co-authored “The Llama 3 Herd of Models”.

3163

Vincent Hellendoorn

Multimodal frontier models (Gemini)

Multimodal Gemini: A Family of Highly Capable Multimodal Models

Co-authored Gemini: A Family of Highly Capable Multimodal Models.

3164

Vincent Roseberry

Open multimodal models (Gemma 3)

Co-authored the Gemma 3 Technical Report.

Post-Training & Alignment Finetuned Language Models Are Zero-Shot Learners

3165

Vincent Y. Zhao

Instruction tuning for better zero-shot behavior

Co-authored FLAN: a practical anchor for instruction tuning and zero-shot transfer.

3166

Vineet Kosaraju

Grade-school math reasoning (GSM8K)

Co-authored GSM8K: a core benchmark/dataset for math word problems and verification.

Evaluation & Benchmarks Agents & Reasoning Training Verifiers to Solve Math Word Problems (GSM8K)

3167

Vineet Shah

Multimodal frontier models (Gemini)

Multimodal Gemini: A Family of Highly Capable Multimodal Models

Co-authored Gemini: A Family of Highly Capable Multimodal Models.

3168

Vinnie Monaco

Frontier model development (GPT-4)

Co-authored the GPT-4 Technical Report: a key reference for the GPT-4-era frontier.

3169

Vinod Koverkathu

Multimodal frontier models (Gemini)

Multimodal Gemini: A Family of Highly Capable Multimodal Models

Co-authored Gemini: A Family of Highly Capable Multimodal Models.

3170

Vinodkumar Prabhakaran

Pathways-scale language modeling (PaLM)

PaLM: Scaling Language Modeling with Pathways

Co-authored PaLM: Scaling Language Modeling with Pathways.

3171

Vinu Rajashekhar

Multimodal frontier models (Gemini)

Multimodal Gemini: A Family of Highly Capable Multimodal Models

Co-authored Gemini: A Family of Highly Capable Multimodal Models.

3172

Vipul Ranjan

Multimodal frontier models (Gemini)

Multimodal Gemini: A Family of Highly Capable Multimodal Models

Co-authored Gemini: A Family of Highly Capable Multimodal Models.

3173

Virginie Do

Open-weight frontier models (Llama 3)

Co-authored “The Llama 3 Herd of Models”.

3174

Vish Vogeti

Open-weight frontier models (Llama 3)

Co-authored “The Llama 3 Herd of Models”.

3175

Vishal Dharmadhikari

Open language models (Gemma 2)

Open Models Gemma 2: Improving Open Language Models at a Practical Size

Co-authored Gemma 2: improving open language models at a practical size.

3176

Vishal Kuo

Frontier model development (GPT-4)

Co-authored the GPT-4 Technical Report: a key reference for the GPT-4-era frontier.

3177

Vishal Mangla

Open-weight frontier models (Llama 3)

Co-authored “The Llama 3 Herd of Models”.

3178

Vishal Verma

Multimodal frontier models (Gemini)

Multimodal Gemini: A Family of Highly Capable Multimodal Models

Co-authored Gemini: A Family of Highly Capable Multimodal Models.

3179

Vishrav Chaudhary

Holistic evaluation of language models (HELM)

Co-authored HELM: a framework for evaluating language models across many axes beyond raw accuracy.

Evaluation & Benchmarks Holistic Evaluation of Language Models

3180

Vít Listík

Multimodal frontier models (Gemini)

Multimodal Gemini: A Family of Highly Capable Multimodal Models

Co-authored Gemini: A Family of Highly Capable Multimodal Models.

3181

Vitaliy Nikolaev

Multimodal frontier models (Gemini)

Multimodal Gemini: A Family of Highly Capable Multimodal Models

Co-authored Gemini: A Family of Highly Capable Multimodal Models.

3182

Vitaly Gatsko

Multimodal frontier models (Gemini)

Multimodal Gemini: A Family of Highly Capable Multimodal Models

Co-authored Gemini: A Family of Highly Capable Multimodal Models.

3183

Vitchyr H. Pong

Frontier model development (GPT-4)

Co-authored the GPT-4 Technical Report: a key reference for the GPT-4-era frontier.

3184

Vítor Albiero

Open-weight frontier models (Llama 3)

Co-authored “The Llama 3 Herd of Models”.

3185

Vittorio Selo

Multimodal frontier models (Gemini)

Multimodal Gemini: A Family of Highly Capable Multimodal Models

Co-authored Gemini: A Family of Highly Capable Multimodal Models.

3186

Vivaan Bhatia

Multimodal frontier models (Gemini)

Multimodal Gemini: A Family of Highly Capable Multimodal Models

Co-authored Gemini: A Family of Highly Capable Multimodal Models.

3187

Vlad Feinberg

Open language models from Google (Gemma)

Open Models Gemma: Open Models Based on Gemini Research and Technology

Co-authored Gemma: open models based on Gemini research and technology.

3188

Vlad Firoiu

Multimodal frontier models (Gemini)

Multimodal Gemini: A Family of Highly Capable Multimodal Models

Co-authored Gemini: A Family of Highly Capable Multimodal Models.

3189

Vlad Ionescu

Open-weight frontier models (Llama 3)

Co-authored “The Llama 3 Herd of Models”.

3190

Vlad Kolesnikov

Open multimodal models (Gemma 3)

Co-authored the Gemma 3 Technical Report.

3191

Vlad Poenaru

Open-weight frontier models (Llama 3)

Co-authored “The Llama 3 Herd of Models”.

3192

Vlad Tiberiu Mihailescu

Open-weight frontier models (Llama 3)

Co-authored “The Llama 3 Herd of Models”.

3193

Vladan Petrovic

Open-weight frontier models (Llama 3)

Co-authored “The Llama 3 Herd of Models”.

3194

Vladimir Feinberg

Multimodal frontier models (Gemini)

Multimodal Gemini: A Family of Highly Capable Multimodal Models

Co-authored Gemini: A Family of Highly Capable Multimodal Models.

3195

Vladimir Ivanov

Open-weight frontier models (Llama 3)

Co-authored “The Llama 3 Herd of Models”.

Evaluation & Benchmarks Retrieval-Augmented Generation for Knowledge-Intensive NLP Tasks

3196

Vladimir Karpukhin

Retrieval-augmented generation (RAG)

Co-authored RAG: a canonical reference for retrieval-augmented generation in NLP.

3197

Vladimir Mikulik

Multimodal frontier models (Gemini)

Multimodal Gemini: A Family of Highly Capable Multimodal Models

Co-authored Gemini: A Family of Highly Capable Multimodal Models.

3198

Volodymyr Mnih

Deep Q-Networks (DQN)

Co-authored the original DQN preprint: a core reference for deep reinforcement learning.

Reinforcement Learning Playing Atari with Deep Reinforcement Learning

3199

W. L. Xiao

Open-model frontier reports (DeepSeek-V3)

Co-authored the DeepSeek-V3 Technical Report.

3200

Wade Hickey

Frontier model development (GPT-4)

Co-authored the GPT-4 Technical Report: a key reference for the GPT-4-era frontier.

3201

Wael Farhan

Multimodal frontier models (Gemini)

Multimodal Gemini: A Family of Highly Capable Multimodal Models

Co-authored Gemini: A Family of Highly Capable Multimodal Models.

3202

Wan-Yen Lo

Promptable segmentation foundation models (SAM)

Vision & Robotics Segment Anything

Co-authored Segment Anything.

3203

Wang

Open-weight frontier models (Llama 3)

Co-authored “The Llama 3 Herd of Models”.

3204

Wangding Zeng

Open-model frontier reports (DeepSeek-V3)

Co-authored the DeepSeek-V3 Technical Report.

3205

Wanjia Zhao

Open-model frontier reports (DeepSeek-V3)

Co-authored the DeepSeek-V3 Technical Report.

3206

Wanming Chen

Multimodal frontier models (Gemini)

Multimodal Gemini: A Family of Highly Capable Multimodal Models

Co-authored Gemini: A Family of Highly Capable Multimodal Models.

3207

Warren Barkley

Open language models (Gemma 2)

Open Models Gemma 2: Improving Open Language Models at a Practical Size

Co-authored Gemma 2: improving open language models at a practical size.

3208

Warren Chen

Multimodal frontier models (Gemini)

Multimodal Gemini: A Family of Highly Capable Multimodal Models

Co-authored Gemini: A Family of Highly Capable Multimodal Models.

3209

Wei An

Open-model frontier reports (DeepSeek-V3)

Co-authored the DeepSeek-V3 Technical Report.

3210

Wei Fan

Multimodal frontier models (Gemini)

Multimodal Gemini: A Family of Highly Capable Multimodal Models

Co-authored Gemini: A Family of Highly Capable Multimodal Models.

3211

Wei Li

Text-to-text transfer and pretraining (T5)

Co-authored T5: a practical template for unified NLP training and evaluation.

Evaluation & Benchmarks Exploring the Limits of Transfer Learning with a Unified Text-to-Text Transformer

3212

Wei Wang

Open-weight LLMs (Qwen)

Qwen

Co-authored the Qwen Technical Report.

Open Models Qwen Technical Report

3213

Wei Wei

Open language models (Gemma 2)

Open Models Gemma 2: Improving Open Language Models at a Practical Size

Co-authored Gemma 2: improving open language models at a practical size.

3214

Wei-Lin Chiang

Human preference evaluation at scale (Chatbot Arena)

Co-authored Chatbot Arena: a high-impact human-preference evaluation platform for LLMs.

Evaluation & Benchmarks Chatbot Arena: An Open Platform for Evaluating LLMs by Human Preference

3215

Weicheng Kuo

Scaled multilingual vision-language models (PaLI)

Multimodal PaLI: A Jointly-Scaled Multilingual Language-Image Model

Co-authored PaLI: a key reference for scaling multilingual vision-language models.

3216

Weijian Xu

Small, capable models (Phi-3)

Co-authored the Phi-3 Technical Report (capable models designed for smaller footprints).

Open Models Systems & Infrastructure Phi-3 Technical Report: A Highly Capable Language Model Locally on Your Phone

3217

Weiren Wang

Multimodal frontier models (Gemini)

Multimodal Gemini: A Family of Highly Capable Multimodal Models

Co-authored Gemini: A Family of Highly Capable Multimodal Models.

3218

Weishung Liu

Small, capable models (Phi-3)

Co-authored the Phi-3 Technical Report (capable models designed for smaller footprints).

Open Models Systems & Infrastructure Phi-3 Technical Report: A Highly Capable Language Model Locally on Your Phone

3219

Weiwei Chu

Open-weight frontier models (Llama 3)

Co-authored “The Llama 3 Herd of Models”.

3220

Weize Kong

Multimodal frontier models (Gemini)

Multimodal Gemini: A Family of Highly Capable Multimodal Models

Co-authored Gemini: A Family of Highly Capable Multimodal Models.

3221

Weizhe Yuan

Self-rewarding post-training

Co-authored Self-Rewarding Language Models: explores self-improvement via internal reward modeling.

Post-Training & Alignment Self-Rewarding Language Models

3222

Weizhu Chen

Parameter-efficient finetuning

A good person to know for the longer Microsoft line that runs from machine-comprehension systems into more recent adaptation work like LoRA and MTL-LoRA.

Systems & Infrastructure LoRA: Low-Rank Adaptation of Large Language Models

3223

Wen Liu

Open-model frontier reports (DeepSeek-V3)

Co-authored the DeepSeek-V3 Technical Report.