Researchers — page 21

Open Models Gemma 2: Improving Open Language Models at a Practical Size

Co-authored Gemma 2: improving open language models at a practical size.

2413

Phoebe Thacker

Multimodal frontier models (Gemini)

Multimodal Gemini: A Family of Highly Capable Multimodal Models

Co-authored Gemini: A Family of Highly Capable Multimodal Models.

2414

Phuong Dao

Multimodal frontier models (Gemini)

Multimodal Gemini: A Family of Highly Capable Multimodal Models

Co-authored Gemini: A Family of Highly Capable Multimodal Models.

2415

Pidong Wang

Multimodal frontier models (Gemini)

Multimodal Gemini: A Family of Highly Capable Multimodal Models

Co-authored Gemini: A Family of Highly Capable Multimodal Models.

2416

Pier Giuseppe Sessa

Open language models from Google (Gemma)

Open Models Gemma: Open Models Based on Gemini Research and Technology

Co-authored Gemma: open models based on Gemini research and technology.

2417

Piermaria Mendolicchio

Multimodal frontier models (Gemini)

Multimodal Gemini: A Family of Highly Capable Multimodal Models

Co-authored Gemini: A Family of Highly Capable Multimodal Models.

2418

Piero Kauffmann

Small, capable models (Phi-3)

Co-authored the Phi-3 Technical Report (capable models designed for smaller footprints).

Open Models Systems & Infrastructure Phi-3 Technical Report: A Highly Capable Language Model Locally on Your Phone

2419

Pierre Roux

Open-weight frontier models (Llama 3)

Co-authored “The Llama 3 Herd of Models”.

Open Models Systems & Infrastructure Mixtral of Experts

2420

Pierre Stock

Mixture-of-experts LLMs

Mistral

Co-authored Mixtral of Experts: a key MoE reference in the open-weights frontier.

2421

Pierre-Louis Cedoz

Multimodal frontier models (Gemini)

Multimodal Gemini: A Family of Highly Capable Multimodal Models

Co-authored Gemini: A Family of Highly Capable Multimodal Models.

2422

Pierric Cistac

Open-source tooling for modern NLP (Transformers library)

Hugging Face

Co-authored the Hugging Face Transformers paper that helped standardize modern NLP workflows.

Open Models Transformers: State-of-the-Art Natural Language Processing

2423

Pieter Abbeel

Denoising diffusion probabilistic models

Co-authored DDPM: the modern diffusion-model starting point.

Vision & Robotics Diffusion & Generative Media Denoising Diffusion Probabilistic Models

2424

Pingmei Xu

Open multimodal models (Gemma 3)

Open Models Multimodal Gemma 3 Technical Report

Co-authored the Gemma 3 Technical Report.

2425

Piotr Bojanowski

Self-supervised vision transformers (DINO)

Co-authored DINO: influential self-supervised representation learning for vision transformers.

Vision & Robotics Emerging Properties in Self-Supervised Vision Transformers

2426

Piotr Dollár

Masked autoencoders for vision (MAE)

Co-authored MAE: a strong template for scalable self-supervised vision pretraining.

Vision & Robotics Masked Autoencoders Are Scalable Vision Learners

2427

Piotr Padlewski

Scaled multilingual vision-language models (PaLI)

Multimodal PaLI: A Jointly-Scaled Multilingual Language-Image Model

Co-authored PaLI: a key reference for scaling multilingual vision-language models.

2428

Piotr Stanczyk

Open language models (Gemma 2)

Open Models Gemma 2: Improving Open Language Models at a Practical Size

Co-authored Gemma 2: improving open language models at a practical size.

2429

Piyush Madan

Small, capable models (Phi-3)

Co-authored the Phi-3 Technical Report (capable models designed for smaller footprints).

Open Models Systems & Infrastructure Phi-3 Technical Report: A Highly Capable Language Model Locally on Your Phone

2430

Piyush Patil

Multimodal frontier models (Gemini)

Multimodal Gemini: A Family of Highly Capable Multimodal Models

Co-authored Gemini: A Family of Highly Capable Multimodal Models.

2431

Polina Zablotskaia

Multimodal frontier models (Gemini)

Multimodal Gemini: A Family of Highly Capable Multimodal Models

Co-authored Gemini: A Family of Highly Capable Multimodal Models.

2432

Polina Zvyagina

Open-weight frontier models (Llama 3)

Co-authored “The Llama 3 Herd of Models”.

2433

Pouya Tafti

Open language models from Google (Gemma)

Open Models Gemma: Open Models Based on Gemini Research and Technology

Co-authored Gemma: open models based on Gemini research and technology.

2434

Pradeep Dasigi

Open, fully-documented language models (OLMo)

AI2

Co-authored OLMo: Accelerating the Science of Language Models.

Open Models OLMo: Accelerating the Science of Language Models

2435

Pradeep Kuppala

Open language models (Gemma 2)

Open Models Gemma 2: Improving Open Language Models at a Practical Size

Co-authored Gemma 2: improving open language models at a practical size.

2436

Pradyumna Narayana

Multimodal frontier models (Gemini)

Multimodal Gemini: A Family of Highly Capable Multimodal Models

Co-authored Gemini: A Family of Highly Capable Multimodal Models.

2437

Prafulla Dhariwal

Diffusion-based text-to-image generation (DALL·E 2)

Diffusion & Generative Media Hierarchical Text-Conditional Image Generation with CLIP Latents

Co-authored DALL·E 2: hierarchical text-conditional image generation with CLIP latents.

2438

Prajjwal Bhargava

Open-weight chat and foundation models (Llama 2)

Open Models Llama 2: Open Foundation and Fine-Tuned Chat Models

Co-authored Llama 2: Open Foundation and Fine-Tuned Chat Models.

2439

Prakash Shroff

Multimodal frontier models (Gemini)

Multimodal Gemini: A Family of Highly Capable Multimodal Models

Co-authored Gemini: A Family of Highly Capable Multimodal Models.

2440

Pranab Saxena

Multimodal frontier models (Gemini)

Multimodal Gemini: A Family of Highly Capable Multimodal Models

Co-authored Gemini: A Family of Highly Capable Multimodal Models.

2441

Pranav Shyam

Large-scale language modeling (GPT-3)

Language Models are Few-Shot Learners (GPT-3)

Co-authored GPT-3: Language Models are Few-Shot Learners.

2442

Praneetha Vaddamanu

Small, capable models (Phi-3)

Co-authored the Phi-3 Technical Report (capable models designed for smaller footprints).

Open Models Systems & Infrastructure Phi-3 Technical Report: A Highly Capable Language Model Locally on Your Phone

2443

Praseem Banzal

Multimodal frontier models (Gemini)

Multimodal Gemini: A Family of Highly Capable Multimodal Models

Co-authored Gemini: A Family of Highly Capable Multimodal Models.

2444

Prashant Ratanchandani

Open-weight frontier models (Llama 3)

Co-authored “The Llama 3 Herd of Models”.

2445

Prateek Kolhar

Multimodal frontier models (Gemini)

Multimodal Gemini: A Family of Highly Capable Multimodal Models

Co-authored Gemini: A Family of Highly Capable Multimodal Models.

2446

Pratik Dubal

Open-weight frontier models (Llama 3)

Co-authored “The Llama 3 Herd of Models”.

2447

Pratik Joshi

Open code models (CodeGemma)

Open Models Code Models CodeGemma: Open Code Models Based on Gemma

Co-authored CodeGemma: open code models based on Gemma.

2448

Pratul P. Srinivasan

Neural radiance fields (NeRF)

Co-authored NeRF: a foundational paper for neural rendering and 3D scene representations.

Vision & Robotics NeRF: Representing Scenes as Neural Radiance Fields for View Synthesis

2449

Praveen Krishnan

Open-weight frontier models (Llama 3)

Co-authored “The Llama 3 Herd of Models”.

2450

Praveen Srinivasan

Multimodal frontier models (Gemini)

Multimodal Gemini: A Family of Highly Capable Multimodal Models

Co-authored Gemini: A Family of Highly Capable Multimodal Models.

2451

Preethi Lahoti

Multimodal frontier models (Gemini)

Multimodal Gemini: A Family of Highly Capable Multimodal Models

Co-authored Gemini: A Family of Highly Capable Multimodal Models.

2452

Premal Shah

Multimodal frontier models (Gemini)

Multimodal Gemini: A Family of Highly Capable Multimodal Models

Co-authored Gemini: A Family of Highly Capable Multimodal Models.

2453

Preston Tuggle

Frontier model development (GPT-4)

Systems & Infrastructure PyTorch FSDP: Experiences on Scaling Fully Sharded Data Parallel

Co-authored the GPT-4 Technical Report: a key reference for the GPT-4-era frontier.

GPT-4 Technical Report

2454

Pritam Damania

Fully Sharded Data Parallel training (FSDP)

Co-authored PyTorch FSDP: practical lessons for scaling fully-sharded training workloads.

2455

Pritish Yuvraj

Open-weight frontier models (Llama 3)

Co-authored “The Llama 3 Herd of Models”.

2456

Priya Jhakra

Multimodal frontier models (Gemini)

Multimodal Gemini: A Family of Highly Capable Multimodal Models

Co-authored Gemini: A Family of Highly Capable Multimodal Models.

2457

Priya Ponnapalli

Multimodal frontier models (Gemini)

Multimodal Gemini: A Family of Highly Capable Multimodal Models

Co-authored Gemini: A Family of Highly Capable Multimodal Models.

2458

Priyanka Agrawal

Multimodal frontier models (Gemini)

Multimodal Gemini: A Family of Highly Capable Multimodal Models

Co-authored Gemini: A Family of Highly Capable Multimodal Models.

2459

Przemysław Kazienko

RWKV and efficient sequence modeling

Important in the long tail because he is another contributor whose work spans both the RWKV sequence-model thread and the Polish PLLuM effort, which makes his page more informative than a generic single-paper profile.

Open Models Systems & Infrastructure RWKV: Reinventing RNNs for the Transformer Era

2460

Pulkit Mehta

Multimodal frontier models (Gemini)

Multimodal Gemini: A Family of Highly Capable Multimodal Models

Co-authored Gemini: A Family of Highly Capable Multimodal Models.

2461

Punit Singh Koura

Open-weight chat and foundation models (Llama 2)

Open Models Llama 2: Open Foundation and Fine-Tuned Chat Models

Co-authored Llama 2: Open Foundation and Fine-Tuned Chat Models.

2462

Purvi Shah

Multimodal frontier models (Gemini)

Multimodal Gemini: A Family of Highly Capable Multimodal Models

Co-authored Gemini: A Family of Highly Capable Multimodal Models.

2463

Pushkar Mishra

Open-weight chat and foundation models (Llama 2)

Open Models Llama 2: Open Foundation and Fine-Tuned Chat Models

Co-authored Llama 2: Open Foundation and Fine-Tuned Chat Models.

2464

Pushmeet Kohli

Robotics, vision, structured prediction

Evaluation & Benchmarks Vision & Robotics Security & Robustness Accurate proteome-wide missense variant effect prediction with AlphaMissense

A strong person to follow if you want to understand how frontier AI gets pushed into science, security, and trustworthy deployment rather than staying inside benchmark culture.

2465

Puxin Xu

Open-weight chat and foundation models (Llama 2)

Open Models Llama 2: Open Foundation and Fine-Tuned Chat Models

Co-authored Llama 2: Open Foundation and Fine-Tuned Chat Models.

2466

Qi Li

Multimodal frontier models (Gemini)

Multimodal Gemini: A Family of Highly Capable Multimodal Models

Co-authored Gemini: A Family of Highly Capable Multimodal Models.

2467

Qian Huang

Holistic evaluation of language models (HELM)

Co-authored HELM: a framework for evaluating language models across many axes beyond raw accuracy.

Evaluation & Benchmarks Holistic Evaluation of Language Models

2468

Qian Liang

Open-weight frontier models (Llama 3)

Co-authored “The Llama 3 Herd of Models”.

Open Models Code Models StarCoder: may the source be with you!

2469

Qian Liu

Open code LLMs (StarCoder)

Co-authored StarCoder: a foundational open code model effort (BigCode).

2470

Qian Yang

Audio-capable open models (Qwen2-Audio)

Qwen

Co-authored the Qwen2-Audio Technical Report.

Open Models Multimodal Qwen2-Audio Technical Report

2471

Qiancheng Wang

Open-model frontier reports (DeepSeek-V3)

Co-authored the DeepSeek-V3 Technical Report.

2472

Qiao Zhang

Multimodal frontier models (Gemini)

Multimodal Gemini: A Family of Highly Capable Multimodal Models

Co-authored Gemini: A Family of Highly Capable Multimodal Models.

2473

Qihang Zhao

RWKV and efficient sequence modeling

Useful because his work connects the main RWKV sequence-model line with the RWKV-inspired SpikeGPT branch, making the page more informative than a single coauthor record.

Open Models Systems & Infrastructure Diffusion & Generative Media RWKV: Reinventing RNNs for the Transformer Era

2474

Qihao Zhu

Open-model frontier reports (DeepSeek-V3)

Co-authored the DeepSeek-V3 Technical Report.

2475

Qijun Tan

Multimodal frontier models (Gemini)

Multimodal Gemini: A Family of Highly Capable Multimodal Models

Co-authored Gemini: A Family of Highly Capable Multimodal Models.

2476

Qiming Yuan

Code-focused LLMs and evaluation (Codex)

Evaluation & Benchmarks Code Models Evaluating Large Language Models Trained on Code

Co-authored the Codex evaluation paper: an early anchor for code LLM capability measurement.

2477

Qin Cai

Small, capable models (Phi-3)

Co-authored the Phi-3 Technical Report (capable models designed for smaller footprints).

Open Models Systems & Infrastructure Phi-3 Technical Report: A Highly Capable Language Model Locally on Your Phone

2478

Qing He

Open-weight frontier models (Llama 3)

Co-authored “The Llama 3 Herd of Models”.

Open Models Systems & Infrastructure Qinghua Zhou at King’s College London

2479

Qinghua Zhou

RWKV and efficient sequence modeling

Worth keeping because it turns an otherwise ghostlike RWKV byline into a real researcher page with a visible current program in trustworthy AI, neural-network theory, and high-dimensional learning.

2480

Qingxiao Dong

Open-weight frontier models (Llama 3)

Co-authored “The Llama 3 Herd of Models”.

Open Models Multimodal Post-Training & Alignment Visual Instruction Tuning

2481

Qingyang Wu

Visual instruction tuning (LLaVA)

Co-authored Visual Instruction Tuning: a widely-cited recipe for LLaVA-style multimodal assistants.

2482

Qingze Wang

Multimodal frontier models (Gemini)

Multimodal Gemini: A Family of Highly Capable Multimodal Models

Co-authored Gemini: A Family of Highly Capable Multimodal Models.

2483

Qinyu Chen

Open-model frontier reports (DeepSeek-V3)

Co-authored the DeepSeek-V3 Technical Report.

2484

Qiushi Du

Open-model frontier reports (DeepSeek-V3)

Co-authored the DeepSeek-V3 Technical Report.

2485

Quan Yuan

Multimodal frontier models (Gemini)

Multimodal Gemini: A Family of Highly Capable Multimodal Models

Co-authored Gemini: A Family of Highly Capable Multimodal Models.

2486

Quentin Anthony

Open-source LLMs (EleutherAI)

EleutherAI

A strong person to follow for the systems side of open models, especially where distributed training, hybrid architectures, and practical efficiency work feed directly into model capability.

Open Models Systems & Infrastructure Quentin Anthony

2487

Quentin Lhoest

Open-source tooling for modern NLP (Transformers library)

Hugging Face

Co-authored the Hugging Face Transformers paper that helped standardize modern NLP workflows.

Open Models Transformers: State-of-the-Art Natural Language Processing

2488

Quoc Le

Chain-of-thought prompting and reasoning

Co-authored the chain-of-thought prompting paper; foundational for modern reasoning prompting.

Agents & Reasoning Chain-of-Thought Prompting Elicits Reasoning in Large Language Models

2489

Quoc V. Le

Gemini (multimodal foundation models)

One of the central Google researchers to follow for the line from large-scale language modeling into instruction tuning, multilingual systems, and practical model scaling.

Multimodal Post-Training & Alignment Systems & Infrastructure Quoc V. Le

2490

R. J. Chen

Open-model frontier reports (DeepSeek-V3)

Co-authored the DeepSeek-V3 Technical Report.

2491

R. L. Jin

Open-model frontier reports (DeepSeek-V3)

Co-authored the DeepSeek-V3 Technical Report.

2492

Rachad Alao

Open-weight frontier models (Llama 3)

Co-authored “The Llama 3 Herd of Models”.

2493

Rachel Lim

Frontier model development (GPT-4)

Co-authored the GPT-4 Technical Report: a key reference for the GPT-4-era frontier.

GPT-4 Technical Report

2494

Rachel Rodriguez

Open-weight frontier models (Llama 3)

Co-authored “The Llama 3 Herd of Models”.

2495

Rachel Saputro

Multimodal frontier models (Gemini)

Multimodal Gemini: A Family of Highly Capable Multimodal Models

Co-authored Gemini: A Family of Highly Capable Multimodal Models.

2496

Rachel Sterneck

Multimodal frontier models (Gemini)

Multimodal Gemini: A Family of Highly Capable Multimodal Models

Co-authored Gemini: A Family of Highly Capable Multimodal Models.

2497

Rachel Ward

Small, capable models (Phi-3)

Co-authored the Phi-3 Technical Report (capable models designed for smaller footprints).

Open Models Systems & Infrastructure Phi-3 Technical Report: A Highly Capable Language Model Locally on Your Phone

2498

Radu Soricut

Gemini (multimodal foundation models)

Important for understanding how multilingual NLP, translation, and multimodal reasoning meet inside production-scale frontier systems rather than staying separate research tracks.

Multimodal Systems & Infrastructure Agents & Reasoning Radu Soricut

2499

Rafael Rafailov

Direct preference optimization (DPO)

One of the most important newer names to track in alignment-flavored language-model work because he sits directly on the line from DPO into newer attempts to turn language models into better optimizers and agents.

Post-Training & Alignment Agents & Reasoning Direct Preference Optimization: Your Language Model is Secretly a Reward Model

2500

Rafi Ayub

Open-weight frontier models (Llama 3)

Co-authored “The Llama 3 Herd of Models”.

2501

Ragavan Srinivasan

Open-weight frontier models (Llama 3)

Co-authored “The Llama 3 Herd of Models”.

2502

Ragha Kotikalapudi

Multimodal frontier models (Gemini)

Multimodal Gemini: A Family of Highly Capable Multimodal Models

Co-authored Gemini: A Family of Highly Capable Multimodal Models.

2503

Raghavender R

Multimodal frontier models (Gemini)

Multimodal Gemini: A Family of Highly Capable Multimodal Models

Co-authored Gemini: A Family of Highly Capable Multimodal Models.

2504

Raghotham Murthy

Open-weight frontier models (Llama 3)

Co-authored “The Llama 3 Herd of Models”.

2505

Raghu Nayani

Open-weight frontier models (Llama 3)

Co-authored “The Llama 3 Herd of Models”.

2506

Rahma Chaabouni

Open language models from Google (Gemma)

Open Models Gemma: Open Models Based on Gemini Research and Technology

Co-authored Gemma: open models based on Gemini research and technology.

2507

Rahul Goel

Multimodal frontier models (Gemini)

Multimodal Gemini: A Family of Highly Capable Multimodal Models

Co-authored Gemini: A Family of Highly Capable Multimodal Models.

2508

Rahul Mitra

Open-weight frontier models (Llama 3)

Co-authored “The Llama 3 Herd of Models”.

2509

Rahul Rishi

Multimodal frontier models (Gemini)

Multimodal Gemini: A Family of Highly Capable Multimodal Models

Co-authored Gemini: A Family of Highly Capable Multimodal Models.

2510

Raia Hadsell

Generalist agents (Gato)

Multimodal Agents & Reasoning A Generalist Agent

Co-authored Gato: a key reference for generalist, multi-task agents.

2511

Raj Ganapathy

Open-weight frontier models (Llama 3)

Co-authored “The Llama 3 Herd of Models”.

2512

Rajeev Nayak

Frontier model development (GPT-4)

Co-authored the GPT-4 Technical Report: a key reference for the GPT-4-era frontier.

GPT-4 Technical Report

2513

Rajkumar Samuel

Multimodal frontier models (Gemini)

Multimodal Gemini: A Family of Highly Capable Multimodal Models

Co-authored Gemini: A Family of Highly Capable Multimodal Models.

2514

Rakesh Ghiya

Multimodal frontier models (Gemini)

Multimodal Gemini: A Family of Highly Capable Multimodal Models

Co-authored Gemini: A Family of Highly Capable Multimodal Models.

2515

Rakesh Shivanna

Open multimodal models (Gemma 3)

Open Models Multimodal Gemma 3 Technical Report

Co-authored the Gemma 3 Technical Report.

2516

Rama Pasumarthi

Multimodal frontier models (Gemini)

Multimodal Gemini: A Family of Highly Capable Multimodal Models

Co-authored Gemini: A Family of Highly Capable Multimodal Models.

2517

Ramon Calderer

Open-weight frontier models (Llama 3)

Co-authored “The Llama 3 Herd of Models”.

2518

Ramona Comanescu

Open language models from Google (Gemma)

Open Models Gemma: Open Models Based on Gemini Research and Technology

Co-authored Gemma: open models based on Gemini research and technology.

2519

Ramona Merhej

Open language models (Gemma 2)

Open Models Gemma 2: Improving Open Language Models at a Practical Size

Co-authored Gemma 2: improving open language models at a practical size.

2520

Rangaprabhu Parthasarathy

Open-weight frontier models (Llama 3)

Co-authored “The Llama 3 Herd of Models”.