Researchers — page 8

Open Models Gemma 2: Improving Open Language Models at a Practical Size

Co-authored Gemma 2: improving open language models at a practical size.

842

Dustin Holland

Open-weight frontier models (Llama 3)

Co-authored “The Llama 3 Herd of Models”.

843

Dustin Li

Alignment via AI feedback (Constitutional AI)

Post-Training & Alignment Evaluation & Benchmarks Agents & Reasoning Discovering Language Model Behaviors with Model-Written Evaluations

Worth tracking for the newer evaluation thread at Anthropic, especially where failure-mode discovery and faithfulness measurement extend beyond the original RLHF papers.

844

Dustin Schwenk

Open, fully-documented language models (OLMo)

AI2

Co-authored OLMo: Accelerating the Science of Language Models.

Open Models OLMo: Accelerating the Science of Language Models

845

Dustin Tran

Open multimodal models (Gemma 3)

Co-authored the Gemma 3 Technical Report.

846

Dustin Zelle

Open multimodal models (Gemma 3)

Co-authored the Gemma 3 Technical Report.

847

Dylan Banarse

Multimodal frontier models (Gemini)

Multimodal Gemini: A Family of Highly Capable Multimodal Models

Co-authored Gemini: A Family of Highly Capable Multimodal Models.

848

Dylan Scandinaro

Multimodal frontier models (Gemini)

Multimodal Gemini: A Family of Highly Capable Multimodal Models

Co-authored Gemini: A Family of Highly Capable Multimodal Models.

849

Dzmitry Bahdanau

Open code LLMs (StarCoder)

Co-authored StarCoder: a foundational open code model effort (BigCode).

Open Models Code Models StarCoder: may the source be with you!

850

Ed Chi

Chain-of-thought prompting and reasoning

Co-authored the chain-of-thought prompting paper; foundational for modern reasoning prompting.

Agents & Reasoning Chain-of-Thought Prompting Elicits Reasoning in Large Language Models

851

Edden M Gerber

Hybrid Transformer–Mamba language models (Jamba)

Post-Training & Alignment Systems & Infrastructure JAMBA: Hybrid Transformer-Mamba Language Models

A worthwhile head-page upgrade because it gives one of the quieter Jamba contributors a concrete place in the stack: the pre- and post-training work that turns a hybrid architecture into an actual usable model.

852

Édouard Grave

Open-weight foundation models (LLaMA)

Open Models Systems & Infrastructure Bag of Tricks for Efficient Text Classification

Important for the practical representation-learning line behind fastText, multilingual embeddings, and later open-weight model work at Meta.

853

Edouard Leurent

Multimodal frontier models (Gemini)

Multimodal Gemini: A Family of Highly Capable Multimodal Models

Co-authored Gemini: A Family of Highly Capable Multimodal Models.

854

Edouard Yvinec

Open multimodal models (Gemma 3)

Co-authored the Gemma 3 Technical Report.

Systems & Infrastructure Agents & Reasoning Reinforcement Learning Reflexion: Language Agents with Verbal Reinforcement Learning

855

Edward Berman

Self-reflection loops for LLM agents (Reflexion)

Co-authored Reflexion: a practical pattern for improving agents via self-critique and memory.

856

Edward Dowling

Open-weight frontier models (Llama 3)

Co-authored “The Llama 3 Herd of Models”.

Systems & Infrastructure About | Edward Hu

857

Edward J. Hu

Parameter-efficient finetuning

A high-signal person to study if you care about the practical mechanics of adapting large models, especially where scaling theory turns into techniques that actually spread across the industry.

858

Edward Li

Multimodal frontier models (Gemini)

Multimodal Gemini: A Family of Highly Capable Multimodal Models

Co-authored Gemini: A Family of Highly Capable Multimodal Models.

859

Edward Lockhart

Planning with learned dynamics (MuZero)

Reinforcement Learning Mastering Atari, Go, Chess and Shogi by Planning with a Learned Model

Co-authored MuZero: planning with a learned model across games and Atari.

860

Edward Loper

Multimodal frontier models (Gemini)

Multimodal Gemini: A Family of Highly Capable Multimodal Models

Co-authored Gemini: A Family of Highly Capable Multimodal Models.

861

Edward Yang

Deep learning infrastructure (PyTorch)

Co-authored the PyTorch paper describing the imperative-style deep learning framework.

Open Models Systems & Infrastructure PyTorch: An Imperative Style, High-Performance Deep Learning Library

862

Edwin Chen

Model-written evaluations for LM behavior

Post-Training & Alignment Evaluation & Benchmarks Discovering Language Model Behaviors with Model-Written Evaluations

Co-authored model-written evals: a practical technique for discovering and measuring LM behaviors.

863

Egor Filonov

Multimodal frontier models (Gemini)

Multimodal Gemini: A Family of Highly Capable Multimodal Models

Co-authored Gemini: A Family of Highly Capable Multimodal Models.

864

Egor Lakomkin

Open-weight frontier models (Llama 3)

Co-authored “The Llama 3 Herd of Models”.

865

Ehab AlBadawy

Open-weight frontier models (Llama 3)

Co-authored “The Llama 3 Herd of Models”.

866

Ehsan Amid

Multimodal frontier models (Gemini)

Multimodal Gemini: A Family of Highly Capable Multimodal Models

Co-authored Gemini: A Family of Highly Capable Multimodal Models.

867

Eileen O'Neill

Multimodal frontier models (Gemini)

Multimodal Gemini: A Family of Highly Capable Multimodal Models

Co-authored Gemini: A Family of Highly Capable Multimodal Models.

868

Eissa Jamil

Open-weight frontier models (Llama 3)

Co-authored “The Llama 3 Herd of Models”.

869

Elad Dolev

Hybrid Transformer–Mamba language models (Jamba)

Systems & Infrastructure Elad Dolev

One of the clearer infrastructure pages in the AI21 cluster because it anchors the operational side of the stack: deployment, reliability, and the systems work needed to keep fast-moving model releases usable.

870

Elahe Rahimtoroghi

Multimodal frontier models (Gemini)

Multimodal Gemini: A Family of Highly Capable Multimodal Models

Co-authored Gemini: A Family of Highly Capable Multimodal Models.

871

Elaine Montgomery

Open-weight frontier models (Llama 3)

Co-authored “The Llama 3 Herd of Models”.

872

Elena Allica Abellan

Multimodal frontier models (Gemini)

Multimodal Gemini: A Family of Highly Capable Multimodal Models

Co-authored Gemini: A Family of Highly Capable Multimodal Models.

873

Elena Buchatskaya

Compute-optimal scaling for LLM training

Open Models Multimodal Gemini: A Family of Highly Capable Multimodal Models

Worth tracking for the DeepMind thread that links large-model scaling research to the multimodal Gemini stack, rather than treating those as separate eras.

874

Elena Gribovskaya

Multimodal frontier models (Gemini)

Multimodal Gemini: A Family of Highly Capable Multimodal Models

Co-authored Gemini: A Family of Highly Capable Multimodal Models.

875

Eleonora Presani

Open-weight frontier models (Llama 3)

Co-authored “The Llama 3 Herd of Models”.

876

Eli Collins

Open language models from Google (Gemma)

Open Models Gemma: Open Models Based on Gemini Research and Technology

Co-authored Gemma: open models based on Gemini research and technology.

877

Eli Tran-Johnson

Alignment via AI feedback (Constitutional AI)

Post-Training & Alignment Evaluation & Benchmarks Reinforcement Learning Discovering Language Model Behaviors with Model-Written Evaluations

A useful profile for the people building Anthropic’s evaluation stack, especially the model-written-evals line that tries to surface behaviors faster than hand-built test sets can.

878

Elico Teixeira

Multimodal frontier models (Gemini)

Multimodal Gemini: A Family of Highly Capable Multimodal Models

Co-authored Gemini: A Family of Highly Capable Multimodal Models.

879

Elie Georges

Frontier model development (GPT-4)

Co-authored the GPT-4 Technical Report: a key reference for the GPT-4-era frontier.

880

Elina Lobanova

Open-weight frontier models (Llama 3)

Co-authored “The Llama 3 Herd of Models”.

881

Elisa Bandy

Open language models (Gemma 2)

Open Models Gemma 2: Improving Open Language Models at a Practical Size

Co-authored Gemma 2: improving open language models at a practical size.

882

Eliza Rutherford

Compute-optimal scaling for LLM training

Multimodal Gemini: A Family of Highly Capable Multimodal Models

Worth tracking for the contributor layer inside DeepMind’s language-model program rather than only the most visible public faces of Gemini and Chinchilla.

883

Elizabeth Barnes

Code-focused LLMs and evaluation (Codex)

Evaluation & Benchmarks Code Models Evaluating Large Language Models Trained on Code

Co-authored the Codex evaluation paper: an early anchor for code LLM capability measurement.

884

Elizabeth Cole

Multimodal frontier models (Gemini)

Multimodal Gemini: A Family of Highly Capable Multimodal Models

Co-authored Gemini: A Family of Highly Capable Multimodal Models.

885

Elizabeth Proehl

Frontier model development (GPT-4)

Co-authored the GPT-4 Technical Report: a key reference for the GPT-4-era frontier.

886

Elizabeth Tseng

Frontier model development (GPT-4)

Co-authored the GPT-4 Technical Report: a key reference for the GPT-4-era frontier.

887

Elliot Catt

Multimodal frontier models (Gemini)

Multimodal Gemini: A Family of Highly Capable Multimodal Models

Co-authored Gemini: A Family of Highly Capable Multimodal Models.

888

Elnaz Davoodi

Multimodal frontier models (Gemini)

Multimodal Gemini: A Family of Highly Capable Multimodal Models

Co-authored Gemini: A Family of Highly Capable Multimodal Models.

889

Elspeth White

Multimodal frontier models (Gemini)

Multimodal Gemini: A Family of Highly Capable Multimodal Models

Co-authored Gemini: A Family of Highly Capable Multimodal Models.

890

Elton Zheng

Large-scale transformer inference (DeepSpeed)

Co-authored DeepSpeed Inference: practical inference optimizations for serving large transformer models.

Systems & Infrastructure DeepSpeed Inference: Enabling Efficient Inference of Transformer Models at Unprecedented Scale

891

Emanuel Taropa

Multimodal frontier models (Gemini)

Multimodal Gemini: A Family of Highly Capable Multimodal Models

Co-authored Gemini: A Family of Highly Capable Multimodal Models.

892

Emilio Parisotto

Generalist agents (Gato)

Multimodal Agents & Reasoning A Generalist Agent

Co-authored Gato: a key reference for generalist, multi-task agents.

893

Emily Caveness

Multimodal frontier models (Gemini)

Multimodal Gemini: A Family of Highly Capable Multimodal Models

Co-authored Gemini: A Family of Highly Capable Multimodal Models.

894

Emily Denton

Text-to-image diffusion with strong language understanding (Imagen)

Diffusion & Generative Media Photorealistic Text-to-Image Diffusion Models with Deep Language Understanding

Co-authored Imagen: a milestone for photorealistic text-to-image diffusion models.

895

Emily Dinan

Open-weight frontier models (Llama 3)

Co-authored “The Llama 3 Herd of Models”.

896

Emily Hahn

Open-weight frontier models (Llama 3)

Co-authored “The Llama 3 Herd of Models”.

897

Emily Pitler

Multimodal frontier models (Gemini)

Multimodal Gemini: A Family of Highly Capable Multimodal Models

Co-authored Gemini: A Family of Highly Capable Multimodal Models.

898

Emily Reif

Pathways-scale language modeling (PaLM)

PaLM: Scaling Language Modeling with Pathways

Co-authored PaLM: Scaling Language Modeling with Pathways.

899

Emily Wood

Open-weight frontier models (Llama 3)

Co-authored “The Llama 3 Herd of Models”.

900

Emily Xue

Multimodal frontier models (Gemini)

Multimodal Gemini: A Family of Highly Capable Multimodal Models

Co-authored Gemini: A Family of Highly Capable Multimodal Models.

901

Emma Bou Hanna

Mixture-of-experts LLMs

Mistral

Co-authored Mixtral of Experts: a key MoE reference in the open-weights frontier.

Open Models Systems & Infrastructure Mixtral of Experts

902

Emma Strubell

Open, fully-documented language models (OLMo)

AI2

Co-authored OLMo: Accelerating the Science of Language Models.

Open Models OLMo: Accelerating the Science of Language Models

903

Emma Wang

Open language models (Gemma 2)

Open Models Gemma 2: Improving Open Language Models at a Practical Size

Co-authored Gemma 2: improving open language models at a practical size.

904

Emman Haider

Small, capable models (Phi-3)

Co-authored the Phi-3 Technical Report (capable models designed for smaller footprints).

Open Models Systems & Infrastructure Phi-3 Technical Report: A Highly Capable Language Model Locally on Your Phone

905

Emmanouil Koukoumidis

Multimodal frontier models (Gemini)

Multimodal Gemini: A Family of Highly Capable Multimodal Models

Co-authored Gemini: A Family of Highly Capable Multimodal Models.

906

Emy Parparita

Frontier model development (GPT-4)

Co-authored the GPT-4 Technical Report: a key reference for the GPT-4-era frontier.

907

Enrique Piqueras

Multimodal frontier models (Gemini)

Multimodal Gemini: A Family of Highly Capable Multimodal Models

Co-authored Gemini: A Family of Highly Capable Multimodal Models.

908

Eran Globen

Multimodal frontier models (Gemini)

Multimodal Gemini: A Family of Highly Capable Multimodal Models

Co-authored Gemini: A Family of Highly Capable Multimodal Models.

909

Eran Krakovsky

Hybrid Transformer–Mamba language models (Jamba)

Systems & Infrastructure Eran Krakovsky

A solid page for the engineering side of model development because it captures the people who turn hybrid-architecture research into actual trained and shipped systems rather than just writing the abstract.

910

Eran Ofek

Multimodal frontier models (Gemini)

Multimodal Gemini: A Family of Highly Capable Multimodal Models

Co-authored Gemini: A Family of Highly Capable Multimodal Models.

911

Erdem Guven

Multimodal frontier models (Gemini)

Multimodal Gemini: A Family of Highly Capable Multimodal Models

Co-authored Gemini: A Family of Highly Capable Multimodal Models.

912

Eren Sezener

Multimodal frontier models (Gemini)

Multimodal Gemini: A Family of Highly Capable Multimodal Models

Co-authored Gemini: A Family of Highly Capable Multimodal Models.

913

Erez Safahi

Hybrid Transformer–Mamba language models (Jamba)

Systems & Infrastructure Jamba: A Hybrid Transformer-Mamba Language Model

Useful because it turns one of the anonymous-looking Jamba authors into an actual person page, which makes the hybrid-model line easier to understand than treating it as a single monolithic team output.

914

Erez Schwartz

Hybrid Transformer–Mamba language models (Jamba)

Systems & Infrastructure Erez Schwartz

A useful page for the implementation layer of AI21 research because it captures the engineers who turn the company's hybrid-model ideas into trained systems and concrete releases.

915

Erhang Li

Open-model frontier reports (DeepSeek-V3)

DeepSeek

Co-authored the DeepSeek-V3 Technical Report.

Open Models DeepSeek-V3 Technical Report

916

Eri Latorre-Chimoto

Multimodal frontier models (Gemini)

Multimodal Gemini: A Family of Highly Capable Multimodal Models

Co-authored Gemini: A Family of Highly Capable Multimodal Models.

917

Eric Alcaide

RWKV and efficient sequence modeling

A distinctive page because his work bridges open-sequence-model experimentation with applied machine learning for molecules, proteins, and structural biology, and he shows up on multiple RWKV-family papers including the hybrid GoldFinch branch rather than only the first release.

Open Models Systems & Infrastructure Eric Alcaide

918

Eric Chu

Multimodal frontier models (Gemini)

Multimodal Gemini: A Family of Highly Capable Multimodal Models

Co-authored Gemini: A Family of Highly Capable Multimodal Models.

919

Eric Hallahan

Open-source LLMs (EleutherAI)

EleutherAI

Useful because his footprint runs through the early EleutherAI training stack, GPT-NeoX, and Pythia, which makes the page a better map of open-model infrastructure than a generic one-paper profile.

Open Models Systems & Infrastructure About Eric Hallahan

920

Eric Hambro

Open-weight foundation models (LLaMA)

Open Models Systems & Infrastructure Reinforcement Learning Dungeons and Data: A Large-Scale NetHack Dataset

Interesting because his work spans two fairly different but important threads: open-ended reinforcement-learning environments and the later open-weight model push around LLaMA.

921

Eric Johnston

Multimodal frontier models (Gemini)

Multimodal Gemini: A Family of Highly Capable Multimodal Models

Co-authored Gemini: A Family of Highly Capable Multimodal Models.

922

Eric Malmi

Multimodal frontier models (Gemini)

Multimodal Gemini: A Family of Highly Capable Multimodal Models

Co-authored Gemini: A Family of Highly Capable Multimodal Models.

923

Eric Michael Smith

Open-weight chat and foundation models (Llama 2)

Open Models Llama 2: Open Foundation and Fine-Tuned Chat Models

Co-authored Llama 2: Open Foundation and Fine-Tuned Chat Models.

924

Eric Mintun

Promptable segmentation foundation models (SAM)

Vision & Robotics Segment Anything

Co-authored Segment Anything.

925

Eric Mitchell

Direct preference optimization (DPO)

A high-signal name for the current alignment toolkit, especially if you want to understand how preference optimization connects back to broader language-model adaptation work.

Post-Training & Alignment Direct Preference Optimization: Your Language Model is Secretly a Reward Model

926

Eric Ni

Open language models from Google (Gemma)

Open Models Gemma: Open Models Based on Gemini Research and Technology

Co-authored Gemma: open models based on Gemini research and technology.

927

Eric Noland

Compute-optimal scaling for LLM training

Multimodal Systems & Infrastructure Gemini: A Family of Highly Capable Multimodal Models

A useful profile for the quieter contributor layer behind DeepMind’s frontier language-model systems, especially across Chinchilla and Gemini.

928

Eric P. Xing

LLM-as-a-judge evaluation (MT-Bench)

Co-authored MT-Bench / LLM-as-a-judge: a widely used template for scalable multi-turn evaluation.

Evaluation & Benchmarks Judging LLM-as-a-Judge with MT-Bench and Chatbot Arena

929

Eric Price

Small-model reasoning and capability (Phi-4)

Co-authored the Phi-4 Technical Report.

Open Models Agents & Reasoning Phi-4 Technical Report

930

Eric Sigler

Large-scale language modeling (GPT-3)

Language Models are Few-Shot Learners (GPT-3)

Co-authored GPT-3: Language Models are Few-Shot Learners.

931

Eric Wallace

Training-data extraction and privacy risks

Co-authored Extracting Training Data from Large Language Models: a core paper on memorization and extraction risk.

Security & Robustness Extracting Training Data from Large Language Models

932

Eric Zelikman

Holistic evaluation of language models (HELM)

Co-authored HELM: a framework for evaluating language models across many axes beyond raw accuracy.

Evaluation & Benchmarks Holistic Evaluation of Language Models

933

Eric Zhu

Multimodal frontier models (Gemini)

Multimodal Gemini: A Family of Highly Capable Multimodal Models

Co-authored Gemini: A Family of Highly Capable Multimodal Models.

934

Eric-Tuan Le

Open-weight frontier models (Llama 3)

Co-authored “The Llama 3 Herd of Models”.

935

Erica Moreira

Pathways-scale language modeling (PaLM)

PaLM: Scaling Language Modeling with Pathways

Co-authored PaLM: Scaling Language Modeling with Pathways.

936

Erich Elsen

Compute-optimal scaling for LLM training

Multimodal Gemini: A Family of Highly Capable Multimodal Models

A useful profile for the DeepMind scaling stack that fed directly into Gemini, especially across the Chinchilla and Gopher phases.

937

Erik Brinkman

Open-weight frontier models (Llama 3)

Co-authored “The Llama 3 Herd of Models”.

938

Erwin Huizenga

Open multimodal models (Gemma 3)

Co-authored the Gemma 3 Technical Report.

Evaluation & Benchmarks Holistic Evaluation of Language Models

939

Esin Durmus

Holistic evaluation of language models (HELM)

Co-authored HELM: a framework for evaluating language models across many axes beyond raw accuracy.

940

Esteban Arcaute

Open-weight frontier models (Llama 3)

Co-authored “The Llama 3 Herd of Models”.

941

Ethan Dyer

Multimodal frontier models (Gemini)

Multimodal Gemini: A Family of Highly Capable Multimodal Models

Co-authored Gemini: A Family of Highly Capable Multimodal Models.

942

Ethan Perez

Alignment via AI feedback (Constitutional AI)

Post-Training & Alignment Evaluation & Benchmarks Reinforcement Learning Constitutional AI: Harmlessness from AI Feedback

Important because he sits near the boundary between alignment theory and concrete failure-mode discovery, especially jailbreaks, preference training, and behavior evaluations.

943

Etienne Pot

Open multimodal models (Gemma 3)

Co-authored the Gemma 3 Technical Report.

944

Eugene Kharitonov

Open multimodal models (Gemma 3)