Researchers — page 15

Multimodal Gemini: A Family of Highly Capable Multimodal Models

Co-authored Gemini: A Family of Highly Capable Multimodal Models.

1711

Krishna Haridasan

Multimodal frontier models (Gemini)

Multimodal Gemini: A Family of Highly Capable Multimodal Models

Co-authored Gemini: A Family of Highly Capable Multimodal Models.

1712

Krishna Sri Ipsit Mantri

RWKV and efficient sequence modeling

A strong page to keep because it connects the original RWKV paper to a later, much clearer research identity in graph representation learning, latent-space geometry, and multi-task adaptation.

Open Models Systems & Infrastructure Krishna Sri Ipsit Mantri

1713

Krishnan Vaidyanathan

Multimodal frontier models (Gemini)

Multimodal Gemini: A Family of Highly Capable Multimodal Models

Co-authored Gemini: A Family of Highly Capable Multimodal Models.

1714

Kristie Seymore

Multimodal frontier models (Gemini)

Multimodal Gemini: A Family of Highly Capable Multimodal Models

Co-authored Gemini: A Family of Highly Capable Multimodal Models.

1715

Kristina Toutanova

Bidirectional transformer pretraining (BERT)

Co-authored BERT: a turning point for transfer learning in NLP.

BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding

1716

Krithika Iyer

Open-weight frontier models (Llama 3)

Co-authored “The Llama 3 Herd of Models”.

1717

Krunoslav Zaher

Multimodal frontier models (Gemini)

Multimodal Gemini: A Family of Highly Capable Multimodal Models

Co-authored Gemini: A Family of Highly Capable Multimodal Models.

1718

Krzysztof Maziarz

Sparsely-gated mixture-of-experts

Co-authored the sparsely-gated MoE layer paper: a foundational conditional-computation design.

Systems & Infrastructure Outrageously Large Neural Networks: The Sparsely-Gated Mixture-of-Experts Layer

1719

Krzysztof Styrc

Multimodal frontier models (Gemini)

Multimodal Gemini: A Family of Highly Capable Multimodal Models

Co-authored Gemini: A Family of Highly Capable Multimodal Models.

1720

Kshitij Bansal

Open code models (CodeGemma)

Open Models Code Models CodeGemma: Open Code Models Based on Gemma

Co-authored CodeGemma: open code models based on Gemma.

1721

Kshitiz Malik

Open-weight frontier models (Llama 3)

Co-authored “The Llama 3 Herd of Models”.

1722

Kuai Yu

Open-model frontier reports (DeepSeek-V3)

Co-authored the DeepSeek-V3 Technical Report.

1723

Kuanysh Omarov

Multimodal frontier models (Gemini)

Multimodal Gemini: A Family of Highly Capable Multimodal Models

Co-authored Gemini: A Family of Highly Capable Multimodal Models.

1724

Kuenley Chiu

Open-weight frontier models (Llama 3)

Co-authored “The Llama 3 Herd of Models”.

1725

Kun Huang

Open-weight frontier models (Llama 3)

Co-authored “The Llama 3 Herd of Models”.

Systems & Infrastructure GLaM: Efficient Scaling of Language Models with Mixture-of-Experts

1726

Kun Zhang

Efficient MoE scaling (GLaM)

Co-authored GLaM: an influential MoE scaling reference in large language modeling.

1727

Kunal Bhalla

Open-weight frontier models (Llama 3)

Co-authored “The Llama 3 Herd of Models”.

1728

Kunal Chawla

Open-weight frontier models (Llama 3)

Co-authored “The Llama 3 Herd of Models”.

1729

Kushal Lakhotia

Open-weight frontier models (Llama 3)

Co-authored “The Llama 3 Herd of Models”.

1730

Kushal Majmundar

Multimodal frontier models (Gemini)

Multimodal Gemini: A Family of Highly Capable Multimodal Models

Co-authored Gemini: A Family of Highly Capable Multimodal Models.

1731

Kyla Sheppard

Frontier model development (GPT-4)

Co-authored the GPT-4 Technical Report: a key reference for the GPT-4-era frontier.

1732

Kyle He

Multimodal frontier models (Gemini)

Multimodal Gemini: A Family of Highly Capable Multimodal Models

Co-authored Gemini: A Family of Highly Capable Multimodal Models.

1733

Kyle Huang

Open-weight frontier models (Llama 3)

Co-authored “The Llama 3 Herd of Models”.

1734

Kyle Kosic

Frontier model development (GPT-4)

Co-authored the GPT-4 Technical Report: a key reference for the GPT-4-era frontier.

1735

Kyle Levin

Multimodal frontier models (Gemini)

Multimodal Gemini: A Family of Highly Capable Multimodal Models

Co-authored Gemini: A Family of Highly Capable Multimodal Models.

1736

Kyle Lo

Open, fully-documented language models (OLMo)

AI2

Co-authored OLMo: Accelerating the Science of Language Models.

Open Models OLMo: Accelerating the Science of Language Models

1737

Kyle McDonell

Open-source LLMs (EleutherAI)

Open Models Evaluation & Benchmarks Systems & Infrastructure A framework for few-shot language model evaluation

Worth tracking if you care about the seam between open-model benchmarking and the harder question of what frontier systems should actually be evaluated for.

1738

Kyle Richardson

Open, fully-documented language models (OLMo)

AI2

Co-authored OLMo: Accelerating the Science of Language Models.

Open Models OLMo: Accelerating the Science of Language Models

1739

Kyunghyun Cho

Self-rewarding post-training

Co-authored Self-Rewarding Language Models: explores self-improvement via internal reward modeling.

Post-Training & Alignment Self-Rewarding Language Models

1740

Lailin Chen

Open-weight frontier models (Llama 3)

Co-authored “The Llama 3 Herd of Models”.

1741

Lakshman Yagati

Multimodal frontier models (Gemini)

Multimodal Gemini: A Family of Highly Capable Multimodal Models

Co-authored Gemini: A Family of Highly Capable Multimodal Models.

1742

Lakshmi Ramachandruni

Multimodal frontier models (Gemini)

Multimodal Gemini: A Family of Highly Capable Multimodal Models

Co-authored Gemini: A Family of Highly Capable Multimodal Models.

1743

Lakshya Garg

Open-weight frontier models (Llama 3)

Co-authored “The Llama 3 Herd of Models”.

1744

Lala Li

Text-to-image diffusion with strong language understanding (Imagen)

Diffusion & Generative Media Photorealistic Text-to-Image Diffusion Models with Deep Language Understanding

Co-authored Imagen: a milestone for photorealistic text-to-image diffusion models.

1745

Lam Nguyen Thiet

Multimodal frontier models (Gemini)

Multimodal Gemini: A Family of Highly Capable Multimodal Models

Co-authored Gemini: A Family of Highly Capable Multimodal Models.

1746

Lama Ahmad

Frontier model development (GPT-4)

Co-authored the GPT-4 Technical Report: a key reference for the GPT-4-era frontier.

Post-Training & Alignment Evaluation & Benchmarks Discovering Language Model Behaviors with Model-Written Evaluations

1747

Landon Goldberg

Model-written evaluations for LM behavior

Anthropic

Co-authored model-written evals: a practical technique for discovering and measuring LM behaviors.

1748

Lara Tumeh

Multimodal frontier models (Gemini)

Multimodal Gemini: A Family of Highly Capable Multimodal Models

Co-authored Gemini: A Family of Highly Capable Multimodal Models.

1749

Laria Reynolds

Open-source LLMs (EleutherAI)

Open Models Evaluation & Benchmarks Agents & Reasoning A framework for few-shot language model evaluation

A good person to follow for the part of evaluation work that goes beyond leaderboard scores and asks how models generalize across cultures, languages, and shifting social context.

1750

Larissa Rinaldi

Multimodal frontier models (Gemini)

Multimodal Gemini: A Family of Highly Capable Multimodal Models

Co-authored Gemini: A Family of Highly Capable Multimodal Models.

1751

Lars Liden

Small, capable models (Phi-3)

Co-authored the Phi-3 Technical Report (capable models designed for smaller footprints).

Open Models Systems & Infrastructure Phi-3 Technical Report: A Highly Capable Language Model Locally on Your Phone

1752

Lars Lowe Sjoesund

Open language models from Google (Gemma)

Open Models Gemma: Open Models Based on Gemini Research and Technology

Co-authored Gemma: open models based on Gemini research and technology.

1753

Lars Lowe Sjösund

Multimodal frontier models (Gemini)

Multimodal Gemini: A Family of Highly Capable Multimodal Models

Co-authored Gemini: A Family of Highly Capable Multimodal Models.

1754

Laura Culp

Multimodal frontier models (Gemini)

Multimodal Gemini: A Family of Highly Capable Multimodal Models

Co-authored Gemini: A Family of Highly Capable Multimodal Models.

1755

Laura Gustafson

Promptable segmentation foundation models (SAM)

Vision & Robotics Segment Anything

Co-authored Segment Anything.

1756

Laura Knight

Multimodal frontier models (Gemini)

Multimodal Gemini: A Family of Highly Capable Multimodal Models

Co-authored Gemini: A Family of Highly Capable Multimodal Models.

1757

Laura Weidinger

Multimodal frontier models (Gemini)

Multimodal Gemini: A Family of Highly Capable Multimodal Models

Co-authored Gemini: A Family of Highly Capable Multimodal Models.

1758

Laurel Orr

Holistic evaluation of language models (HELM)

Co-authored HELM: a framework for evaluating language models across many axes beyond raw accuracy.

Evaluation & Benchmarks Holistic Evaluation of Language Models

1759

Lauren Rantala-Yeary

Open-weight frontier models (Llama 3)

Co-authored “The Llama 3 Herd of Models”.

1760

Lauren Usui

Open language models (Gemma 2)

Open Models Gemma 2: Improving Open Language Models at a Practical Size

Co-authored Gemma 2: improving open language models at a practical size.

1761

Lauren Workman

Frontier model development (GPT-4)

Co-authored the GPT-4 Technical Report: a key reference for the GPT-4-era frontier.

1762

Laurence Golding

Open-source LLMs (EleutherAI)

Open Models Evaluation & Benchmarks Systems & Infrastructure The Pile: An 800GB Dataset of Diverse Text for Language Modeling

One of the quieter but still important contributors in the open-data and open-evaluation lineage behind The Pile, GPT-NeoX, and later benchmarking infrastructure.

1763

Laurens van der Maaten

Open-weight frontier models (Llama 3)

Co-authored “The Llama 3 Herd of Models”.

1764

Laurent El Shafey

Multimodal frontier models (Gemini)

Multimodal Gemini: A Family of Highly Capable Multimodal Models

Co-authored Gemini: A Family of Highly Capable Multimodal Models.

1765

Laurent Sifre

Compute-optimal scaling for LLM training

Multimodal Reinforcement Learning Gemini: A Family of Highly Capable Multimodal Models

Important for the DeepMind large-model lineage because his work sits inside the sequence from compute-optimal scaling into Gemini rather than only the headline launch moment.

1766

Lavender A

Open-weight frontier models (Llama 3)

Co-authored “The Llama 3 Herd of Models”.

1767

Lawrence Chen

Open-weight frontier models (Llama 3)

Co-authored “The Llama 3 Herd of Models”.

1768

Le Hou

Multimodal frontier models (Gemini)

Multimodal Gemini: A Family of Highly Capable Multimodal Models

Co-authored Gemini: A Family of Highly Capable Multimodal Models.

1769

Lean Wang

Open-model frontier reports (DeepSeek-V3)

Co-authored the DeepSeek-V3 Technical Report.

1770

Leandro Silva

Open-weight frontier models (Llama 3)

Co-authored “The Llama 3 Herd of Models”.

Open Models Code Models StarCoder: may the source be with you!

1771

Leandro von Werra

Open code LLMs (StarCoder)

Co-authored StarCoder: a foundational open code model effort (BigCode).

1772

Lecong Zhang

Open-model frontier reports (DeepSeek-V3)

Co-authored the DeepSeek-V3 Technical Report.

1773

Lee Bell

Open-weight frontier models (Llama 3)

Co-authored “The Llama 3 Herd of Models”.

1774

Legg Yeung

Multimodal frontier models (Gemini)

Multimodal Gemini: A Family of Highly Capable Multimodal Models

Co-authored Gemini: A Family of Highly Capable Multimodal Models.

1775

Lei Wang

Multimodal frontier models (Gemini)

Multimodal Gemini: A Family of Highly Capable Multimodal Models

Co-authored Gemini: A Family of Highly Capable Multimodal Models.

1776

Lei Xu

Open-model frontier reports (DeepSeek-V3)

Co-authored the DeepSeek-V3 Technical Report.

1777

Lei Zhang

Open-weight frontier models (Llama 3)

Co-authored “The Llama 3 Herd of Models”.

1778

Leif Schelin

Multimodal frontier models (Gemini)

Multimodal Gemini: A Family of Highly Capable Multimodal Models

Co-authored Gemini: A Family of Highly Capable Multimodal Models.

1779

Lélio Renard Lavaud

Mixture-of-experts LLMs

Mistral

Co-authored Mixtral of Experts: a key MoE reference in the open-weights frontier.

Open Models Systems & Infrastructure Mixtral of Experts

1780

Lena Heuermann

Open language models (Gemma 2)

Open Models Gemma 2: Improving Open Language Models at a Practical Size

Co-authored Gemma 2: improving open language models at a practical size.

1781

Lenny Bogdonoff

Frontier model development (GPT-4)

Co-authored the GPT-4 Technical Report: a key reference for the GPT-4-era frontier.

1782

Leo (Len) Gao

Open-source LLMs (EleutherAI)

Open Models Post-Training & Alignment Leo Gao

Worth tracking for the open-model side of the field, especially where dataset construction, practical training work, and alignment-flavored thinking meet.

1783

Léonard Hussenot

Open language models from Google (Gemma)

Open Models Gemma: Open Models Based on Gemini Research and Technology

Co-authored Gemma: open models based on Gemini research and technology.

1784

Leslie Baker

Multimodal frontier models (Gemini)

Multimodal Gemini: A Family of Highly Capable Multimodal Models

Co-authored Gemini: A Family of Highly Capable Multimodal Models.

1785

Less Wright

Fully Sharded Data Parallel training (FSDP)

Co-authored PyTorch FSDP: practical lessons for scaling fully-sharded training workloads.

Systems & Infrastructure PyTorch FSDP: Experiences on Scaling Fully Sharded Data Parallel

1786

Leticia Lago

Open language models (Gemma 2)

Open Models Gemma 2: Improving Open Language Models at a Practical Size

Co-authored Gemma 2: improving open language models at a practical size.

1787

Lev Kurilenko

Small, capable models (Phi-3)

Co-authored the Phi-3 Technical Report (capable models designed for smaller footprints).

Open Models Systems & Infrastructure Phi-3 Technical Report: A Highly Capable Language Model Locally on Your Phone

1788

Lev Proleev

Multimodal frontier models (Gemini)

Multimodal Gemini: A Family of Highly Capable Multimodal Models

Co-authored Gemini: A Family of Highly Capable Multimodal Models.

1789

Lexi Walker

Multimodal frontier models (Gemini)

Multimodal Gemini: A Family of Highly Capable Multimodal Models

Co-authored Gemini: A Family of Highly Capable Multimodal Models.

1790

Leyi Xia

Open-model frontier reports (DeepSeek-V3)

Co-authored the DeepSeek-V3 Technical Report.

Open Models Systems & Infrastructure Phi-3 Technical Report: A Highly Capable Language Model Locally on Your Phone

1791

Li Lyna Zhang

Small, capable models (Phi-3)

Co-authored the Phi-3 Technical Report (capable models designed for smaller footprints).

1792

Liam Fedus

Pathways-scale language modeling (PaLM)

PaLM: Scaling Language Modeling with Pathways

Co-authored PaLM: Scaling Language Modeling with Pathways.

1793

Liana-Eleonora Marinescu

Multimodal frontier models (Gemini)

Multimodal Gemini: A Family of Highly Capable Multimodal Models

Co-authored Gemini: A Family of Highly Capable Multimodal Models.

1794

Liane Lovitt

Alignment via AI feedback (Constitutional AI)

Anthropic

A good page for the societal-impacts side of Anthropic’s research, especially where evaluation work turns toward persuasion and real-world downstream effects.

Post-Training & Alignment Evaluation & Benchmarks Reinforcement Learning Measuring the Persuasiveness of Language Models

1795

Liang Luo

Fully Sharded Data Parallel training (FSDP)

Co-authored PyTorch FSDP: practical lessons for scaling fully-sharded training workloads.

Systems & Infrastructure PyTorch FSDP: Experiences on Scaling Fully Sharded Data Parallel

1796

Liang Tan

Open-weight frontier models (Llama 3)

Co-authored “The Llama 3 Herd of Models”.