Researchers — page 14

Multimodal Gemini: A Family of Highly Capable Multimodal Models

Co-authored Gemini: A Family of Highly Capable Multimodal Models.

1570

Julie Fadlon

Hybrid Transformer–Mamba language models (Jamba)

AI21

An especially valuable page for understanding how AI systems get judged in practice, because it puts human evaluation and rubric design at the center rather than treating them as an afterthought to model building.

Evaluation & Benchmarks Systems & Infrastructure Agents & Reasoning Julie Fadlon

1571

Julien Chaumond

Open-source tooling for modern NLP (Transformers library)

Hugging Face

Co-authored the Hugging Face Transformers paper that helped standardize modern NLP workflows.

Open Models Transformers: State-of-the-Art Natural Language Processing

1572

Julien Mairal

Self-supervised vision transformers (DINO)

Co-authored DINO: influential self-supervised representation learning for vision transformers.

Vision & Robotics Emerging Properties in Self-Supervised Vision Transformers

1573

Julien Plu

Open-source tooling for modern NLP (Transformers library)

Hugging Face

Co-authored the Hugging Face Transformers paper that helped standardize modern NLP workflows.

Open Models Transformers: State-of-the-Art Natural Language Processing

1574

Juliette Love

Open language models from Google (Gemma)

Open Models Gemma: Open Models Based on Gemini Research and Technology

Co-authored Gemma: open models based on Gemini research and technology.

1575

Jun Xu

Multimodal frontier models (Gemini)

Multimodal Gemini: A Family of Highly Capable Multimodal Models

Co-authored Gemini: A Family of Highly Capable Multimodal Models.

1576

Junheng Hao

Small, capable models (Phi-3)

Co-authored the Phi-3 Technical Report (capable models designed for smaller footprints).

Open Models Systems & Infrastructure Phi-3 Technical Report: A Highly Capable Language Model Locally on Your Phone

1577

Junhyuk Oh

Multimodal frontier models (Gemini)

Multimodal Gemini: A Family of Highly Capable Multimodal Models

Co-authored Gemini: A Family of Highly Capable Multimodal Models.

1578

Junjie Qiu

Open-model frontier reports (DeepSeek-V3)

Co-authored the DeepSeek-V3 Technical Report.

1579

Junjie Wang

Open-weight frontier models (Llama 3)

Co-authored “The Llama 3 Herd of Models”.

1580

Junlong Li

Open-model frontier reports (DeepSeek-V3)

Co-authored the DeepSeek-V3 Technical Report.

Multimodal BLIP: Bootstrapping Language-Image Pre-training for Unified Vision-Language Understanding and Generation

1581

Junnan Li

Bootstrapped vision-language pretraining (BLIP)

Co-authored BLIP: a high-impact recipe for unified vision-language understanding and generation.

1582

Juntang Zhuang

Frontier model development (GPT-4)

Co-authored the GPT-4 Technical Report: a key reference for the GPT-4-era frontier.

1583

Junteng Jia

Open-weight frontier models (Llama 3)

Co-authored “The Llama 3 Herd of Models”.

1584

Junwhan Ahn

Multimodal frontier models (Gemini)

Multimodal Gemini: A Family of Highly Capable Multimodal Models

Co-authored Gemini: A Family of Highly Capable Multimodal Models.

1585

Junxiao Song

Open-model frontier reports (DeepSeek-V3)

Co-authored the DeepSeek-V3 Technical Report.

1586

Junyang Lin

Open-weight LLMs (Qwen)

Open Models Qwen Technical Report

Co-authored the Qwen Technical Report.

1587

Justin Chiu

Open language models from Google (Gemma)

Open Models Gemma: Open Models Based on Gemini Research and Technology

Co-authored Gemma: open models based on Gemini research and technology.

1588

Justin Chung

Multimodal frontier models (Gemini)

Multimodal Gemini: A Family of Highly Capable Multimodal Models

Co-authored Gemini: A Family of Highly Capable Multimodal Models.

1589

Justin Frye

Multimodal frontier models (Gemini)

Multimodal Gemini: A Family of Highly Capable Multimodal Models

Co-authored Gemini: A Family of Highly Capable Multimodal Models.

1590

Justin Gilmer

Multimodal frontier models (Gemini)

Multimodal Gemini: A Family of Highly Capable Multimodal Models

Co-authored Gemini: A Family of Highly Capable Multimodal Models.

1591

Justin Jay Wang

Frontier model development (GPT-4)

Co-authored the GPT-4 Technical Report: a key reference for the GPT-4-era frontier.

1592

Justin Mao-Jones

Open language models from Google (Gemma)

Open Models Gemma: Open Models Based on Gemini Research and Technology

Co-authored Gemma: open models based on Gemini research and technology.

1593

Juston Forte

Frontier model development (GPT-4)

Co-authored the GPT-4 Technical Report: a key reference for the GPT-4-era frontier.

Open Models Systems & Infrastructure Phi-3 Technical Report: A Highly Capable Language Model Locally on Your Phone

1594

Jyoti Aneja

Small, capable models (Phi-3)

Co-authored the Phi-3 Technical Report (capable models designed for smaller footprints).

1595

Jyotinder Singh

Open multimodal models (Gemma 3)

Open Models Multimodal Gemma 3 Technical Report

Co-authored the Gemma 3 Technical Report.

1596

Kai Dang

Open-weight LLMs (Qwen)

Open Models Qwen Technical Report

Co-authored the Qwen Technical Report.

1597

Kai Dong

Open-model frontier reports (DeepSeek-V3)

Co-authored the DeepSeek-V3 Technical Report.

1598

Kai Hu

Open-model frontier reports (DeepSeek-V3)

Co-authored the DeepSeek-V3 Technical Report.

1599

Kai Kang

Multimodal frontier models (Gemini)

Multimodal Gemini: A Family of Highly Capable Multimodal Models

Co-authored Gemini: A Family of Highly Capable Multimodal Models.

1600

Kai Wu

Open-weight frontier models (Llama 3)

Co-authored “The Llama 3 Herd of Models”.

1601

Kai Xiao

Frontier model development (GPT-4)

Co-authored the GPT-4 Technical Report: a key reference for the GPT-4-era frontier.

1602

Kai Yang

Multimodal frontier models (Gemini)

Multimodal Gemini: A Family of Highly Capable Multimodal Models

Co-authored Gemini: A Family of Highly Capable Multimodal Models.

1603

Kai Zhao

Multimodal frontier models (Gemini)

Multimodal Gemini: A Family of Highly Capable Multimodal Models

Co-authored Gemini: A Family of Highly Capable Multimodal Models.

1604

Kaige Gao

Open-model frontier reports (DeepSeek-V3)

Co-authored the DeepSeek-V3 Technical Report.

1605

Kaiming He

Computer vision, representation learning

Systems & Infrastructure Vision & Robotics Kaiming He

A foundational computer-vision researcher whose work on representations and architectures still shapes modern pretraining and perception systems.

1606

Kaisheng Yao

Multimodal frontier models (Gemini)

Multimodal Gemini: A Family of Highly Capable Multimodal Models

Co-authored Gemini: A Family of Highly Capable Multimodal Models.

1607

Kalind Thakkar

Multimodal frontier models (Gemini)

Multimodal Gemini: A Family of Highly Capable Multimodal Models

Co-authored Gemini: A Family of Highly Capable Multimodal Models.

1608

Kalpesh Krishna

Multimodal frontier models (Gemini)

Multimodal Gemini: A Family of Highly Capable Multimodal Models

Co-authored Gemini: A Family of Highly Capable Multimodal Models.

1609

Kalyan Saladi

Open-weight chat and foundation models (Llama 2)

Open Models Llama 2: Open Foundation and Fine-Tuned Chat Models

Co-authored Llama 2: Open Foundation and Fine-Tuned Chat Models.

1610

Kalyan Vasuden Alwala

Open-weight frontier models (Llama 3)

Co-authored “The Llama 3 Herd of Models”.

1611

Kam Hou U

Open-weight frontier models (Llama 3)

Co-authored “The Llama 3 Herd of Models”.

1612

Kamal Ndousse

Alignment via AI feedback (Constitutional AI)

Post-Training & Alignment Evaluation & Benchmarks Reinforcement Learning Training a Helpful and Harmless Assistant with Reinforcement Learning from Human Feedback

Worth following for the evaluation side of Anthropic’s alignment program, especially where model-written tests and public-input methods become practical tooling rather than just ideas.

1613

Kamilė Lukošiūtė

Model-written evaluations for LM behavior

Post-Training & Alignment Evaluation & Benchmarks Discovering Language Model Behaviors with Model-Written Evaluations

Co-authored model-written evals: a practical technique for discovering and measuring LM behaviors.

1614

Kamile Lukosuite

Alignment via AI feedback (Constitutional AI)

Post-Training & Alignment Evaluation & Benchmarks Agents & Reasoning Discovering Language Model Behaviors with Model-Written Evaluations

Worth following for the evaluation side of alignment work, especially where model-written tests and more faithful reasoning traces are used to make model behavior easier to inspect.

1615

Kane Jang

Multimodal frontier models (Gemini)

Multimodal Gemini: A Family of Highly Capable Multimodal Models

Co-authored Gemini: A Family of Highly Capable Multimodal Models.

1616

Kang Guan

Open-model frontier reports (DeepSeek-V3)

Co-authored the DeepSeek-V3 Technical Report.

1617

Karan Saxena

Open-weight frontier models (Llama 3)

Co-authored “The Llama 3 Herd of Models”.

1618

Kareem Ayoub

Multimodal frontier models (Gemini)

Multimodal Gemini: A Family of Highly Capable Multimodal Models

Co-authored Gemini: A Family of Highly Capable Multimodal Models.

1619

Kareem Mohamed

Open language models (Gemma 2)

Open Models Gemma 2: Improving Open Language Models at a Practical Size

Co-authored Gemma 2: improving open language models at a practical size.

1620

Karel Lenc

Few-shot vision-language models (Flamingo)

Multimodal Flamingo: a Visual Language Model for Few-Shot Learning

Co-authored Flamingo: an influential multimodal model for few-shot vision-language tasks.

1621

Karen Simonyan

Compute-optimal scaling for LLM training

Multimodal Reinforcement Learning Very Deep Convolutional Networks for Large-Scale Image Recognition

A foundational vision researcher who also matters for the more recent DeepMind language-model lineage, making him a good bridge between classic deep-learning milestones and the Gemini era.

1622

Karina Nguyen

Model-written evaluations for LM behavior

Post-Training & Alignment Evaluation & Benchmarks Discovering Language Model Behaviors with Model-Written Evaluations

Co-authored model-written evals: a practical technique for discovering and measuring LM behaviors.

1623

Karl Cobbe

Grade-school math reasoning (GSM8K)

Co-authored GSM8K: a core benchmark/dataset for math word problems and verification.

Evaluation & Benchmarks Agents & Reasoning Training Verifiers to Solve Math Word Problems (GSM8K)

1624

Karthik Kappaganthu

Multimodal frontier models (Gemini)

Multimodal Gemini: A Family of Highly Capable Multimodal Models

Co-authored Gemini: A Family of Highly Capable Multimodal Models.

1625

Karthik Narasimhan

Reasoning + acting for LLM agents (ReAct)

Co-authored ReAct: a simple, high-leverage template for tool-using LLM agents.

Agents & Reasoning ReAct: Synergizing Reasoning and Acting in Language Models

1626

Karthik Prasad

Open-weight frontier models (Llama 3)

Co-authored “The Llama 3 Herd of Models”.

1627

Kartikay Khandelwal

Open-weight frontier models (Llama 3)

Co-authored “The Llama 3 Herd of Models”.

1628

Kartikeya Badola

Open language models (Gemma 2)

Open Models Gemma 2: Improving Open Language Models at a Practical Size

Co-authored Gemma 2: improving open language models at a practical size.

1629

Kartikeya Upasani

Open-weight frontier models (Llama 3)

Co-authored “The Llama 3 Herd of Models”.

1630

Kat Black

Open language models (Gemma 2)

Open Models Gemma 2: Improving Open Language Models at a Practical Size

Co-authored Gemma 2: improving open language models at a practical size.

1631

Katarina Slama

Instruction-following via RLHF (InstructGPT)

Post-Training & Alignment Training Language Models to Follow Instructions with Human Feedback

A useful page for the OpenAI preference-learning line, especially if you want to understand how the field moved from InstructGPT-era RLHF into later work on whether stated preferences actually predict model behavior.

1632

Katayoun Zand

Open-weight frontier models (Llama 3)

Co-authored “The Llama 3 Herd of Models”.

1633

Kate Baumli

Multimodal frontier models (Gemini)

Multimodal Gemini: A Family of Highly Capable Multimodal Models

Co-authored Gemini: A Family of Highly Capable Multimodal Models.

1634

Kate Olszewska

Multimodal frontier models (Gemini)

Multimodal Gemini: A Family of Highly Capable Multimodal Models

Co-authored Gemini: A Family of Highly Capable Multimodal Models.

1635

Kate Plawiak

Open-weight frontier models (Llama 3)

Co-authored “The Llama 3 Herd of Models”.

1636

Katerina Tsihlas

Multimodal frontier models (Gemini)

Multimodal Gemini: A Family of Highly Capable Multimodal Models

Co-authored Gemini: A Family of Highly Capable Multimodal Models.

1637

Katherine Crowson

Open large-scale image-text data (LAION-5B)

Co-authored LAION-5B: a widely used open dataset for vision-language foundation models.

Multimodal LAION-5B: An open large-scale dataset for training next generation image-text models

1638

Katherine Lee

Pathways-scale language modeling (PaLM)

PaLM: Scaling Language Modeling with Pathways

Co-authored PaLM: Scaling Language Modeling with Pathways.

1639

Kathie Wang

Multimodal frontier models (Gemini)

Multimodal Gemini: A Family of Highly Capable Multimodal Models

Co-authored Gemini: A Family of Highly Capable Multimodal Models.

1640

Kathleen Kenealy

Open language models from Google (Gemma)

Open Models Gemma: Open Models Based on Gemini Research and Technology

Co-authored Gemma: open models based on Gemini research and technology.

1641

Kathleen Meier-Hellstern

Efficient MoE scaling (GLaM)

Co-authored GLaM: an influential MoE scaling reference in large language modeling.

Systems & Infrastructure GLaM: Efficient Scaling of Language Models with Mixture-of-Experts

1642

Kathy Korevec

Open code models (CodeGemma)

Open Models Code Models CodeGemma: Open Code Models Based on Gemma

Co-authored CodeGemma: open code models based on Gemma.

1643

Kathy Matosich

Open-weight frontier models (Llama 3)

Co-authored “The Llama 3 Herd of Models”.

1644

Kathy Meier-Hellstern

Pathways-scale language modeling (PaLM)

PaLM: Scaling Language Modeling with Pathways

Co-authored PaLM: Scaling Language Modeling with Pathways.

1645

Kathy Wu

Multimodal frontier models (Gemini)

Multimodal Gemini: A Family of Highly Capable Multimodal Models

Co-authored Gemini: A Family of Highly Capable Multimodal Models.

1646

Kathy Yu

Open language models from Google (Gemma)

Open Models Gemma: Open Models Based on Gemini Research and Technology

Co-authored Gemma: open models based on Gemini research and technology.

1647

Katie Mayer

Code-focused LLMs and evaluation (Codex)

Evaluation & Benchmarks Code Models Evaluating Large Language Models Trained on Code

Co-authored the Codex evaluation paper: an early anchor for code LLM capability measurement.

1648

Katie Millican

Gemini (multimodal foundation models)

Worth tracking for the data side of multimodal frontier models, where the quality and shape of training mixtures strongly determine what large systems can actually do.

Multimodal Systems & Infrastructure Gemini: A Family of Highly Capable Multimodal Models

1649

Kaushik Shivakumar

Multimodal frontier models (Gemini)

Multimodal Gemini: A Family of Highly Capable Multimodal Models

Co-authored Gemini: A Family of Highly Capable Multimodal Models.

1650

Kaushik Veeraraghavan

Open-weight frontier models (Llama 3)

Co-authored “The Llama 3 Herd of Models”.

1651

Ke Li

Open-weight frontier models (Llama 3)

Co-authored “The Llama 3 Herd of Models”.

1652

Ke Ye

Multimodal frontier models (Gemini)

Multimodal Gemini: A Family of Highly Capable Multimodal Models

Co-authored Gemini: A Family of Highly Capable Multimodal Models.

1653

Kedar Soparkar

Multimodal frontier models (Gemini)

Multimodal Gemini: A Family of Highly Capable Multimodal Models

Co-authored Gemini: A Family of Highly Capable Multimodal Models.

1654

Keelin McDonell

Open language models (Gemma 2)

Open Models Gemma 2: Improving Open Language Models at a Practical Size

Co-authored Gemma 2: improving open language models at a practical size.

1655

Kefan Xiao

Multimodal frontier models (Gemini)

Multimodal Gemini: A Family of Highly Capable Multimodal Models

Co-authored Gemini: A Family of Highly Capable Multimodal Models.

1656

Kehang Han

Multimodal frontier models (Gemini)

Multimodal Gemini: A Family of Highly Capable Multimodal Models

Co-authored Gemini: A Family of Highly Capable Multimodal Models.

1657

Keith Pallo

Multimodal frontier models (Gemini)

Multimodal Gemini: A Family of Highly Capable Multimodal Models

Co-authored Gemini: A Family of Highly Capable Multimodal Models.

1658

Kellie Webster

Efficient MoE scaling (GLaM)

Co-authored GLaM: an influential MoE scaling reference in large language modeling.

Systems & Infrastructure GLaM: Efficient Scaling of Language Models with Mixture-of-Experts

1659

Kelly Michelena

Open-weight frontier models (Llama 3)

Co-authored “The Llama 3 Herd of Models”.

1660

Kelly Schaefer

Open code models (CodeGemma)

Open Models Code Models CodeGemma: Open Code Models Based on Gemma

Co-authored CodeGemma: open code models based on Gemma.

1661

Kelvin Guu

Instruction tuning for better zero-shot behavior

Co-authored FLAN: a practical anchor for instruction tuning and zero-shot transfer.

Post-Training & Alignment Finetuned Language Models Are Zero-Shot Learners

1662

Kelvin Nguyen

Open language models (Gemma 2)

Open Models Gemma 2: Improving Open Language Models at a Practical Size

Co-authored Gemma 2: improving open language models at a practical size.

1663

Kelvin Xu

Multimodal frontier models (Gemini)

Multimodal Gemini: A Family of Highly Capable Multimodal Models

Co-authored Gemini: A Family of Highly Capable Multimodal Models.

1664

Keming Lu

Open-weight LLMs (Qwen)

Open Models Qwen Technical Report

Co-authored the Qwen Technical Report.

1665

Ken Durden

Multimodal frontier models (Gemini)

Multimodal Gemini: A Family of Highly Capable Multimodal Models

Co-authored Gemini: A Family of Highly Capable Multimodal Models.

1666

Ken Franko

Multimodal frontier models (Gemini)

Multimodal Gemini: A Family of Highly Capable Multimodal Models

Co-authored Gemini: A Family of Highly Capable Multimodal Models.

1667

Kendra Rimbach

Frontier model development (GPT-4)

Co-authored the GPT-4 Technical Report: a key reference for the GPT-4-era frontier.

1668

Kenneth Heafield

Open-weight frontier models (Llama 3)

Co-authored “The Llama 3 Herd of Models”.

1669

Kenny Hsu

Frontier model development (GPT-4)

Co-authored the GPT-4 Technical Report: a key reference for the GPT-4-era frontier.

1670

Kensen Shi

Pathways-scale language modeling (PaLM)

PaLM: Scaling Language Modeling with Pathways

Co-authored PaLM: Scaling Language Modeling with Pathways.

1671

Kenton Lee

NLP systems and evaluation

A strong person to follow for practical language systems because his work sits right at the intersection of pretraining, retrieval, and question answering, where product-grade NLP systems either become robust or fall apart.

Evaluation & Benchmarks Systems & Infrastructure Security & Robustness Kenton Lee

1672

Keqian Li

Open-weight frontier models (Llama 3)

Co-authored “The Llama 3 Herd of Models”.

1673

Keqin Chen

Open-weight LLMs (Qwen2)