Researchers — page 9

Multimodal Gemini: A Family of Highly Capable Multimodal Models

Co-authored Gemini: A Family of Highly Capable Multimodal Models.

962

Faisal Azhar

Open-weight foundation models (LLaMA)

Open Models Systems & Infrastructure Code Llama: Open Foundation Models for Code

A useful page for the code-model branch of Meta’s open-weight work, especially where the broader LLaMA effort turned into stronger code-specialized systems.

963

Faisal Ladhak

Holistic evaluation of language models (HELM)

Co-authored HELM: a framework for evaluating language models across many axes beyond raw accuracy.

Evaluation & Benchmarks Holistic Evaluation of Language Models

964

Faizan Muhammad

Multimodal frontier models (Gemini)

Multimodal Gemini: A Family of Highly Capable Multimodal Models

Co-authored Gemini: A Family of Highly Capable Multimodal Models.

965

Fan Yang

Small, capable models (Phi-3)

Co-authored the Phi-3 Technical Report (capable models designed for smaller footprints).

Open Models Systems & Infrastructure Phi-3 Technical Report: A Highly Capable Language Model Locally on Your Phone

966

Fangxiaoyu Feng

Multimodal frontier models (Gemini)

Multimodal Gemini: A Family of Highly Capable Multimodal Models

Co-authored Gemini: A Family of Highly Capable Multimodal Models.

967

Fangyu Liu

Multimodal frontier models (Gemini)

Multimodal Gemini: A Family of Highly Capable Multimodal Models

Co-authored Gemini: A Family of Highly Capable Multimodal Models.

968

Fangyun Lin

Open-model frontier reports (DeepSeek-V3)

Co-authored the DeepSeek-V3 Technical Report.

969

Fantine Huot

Multimodal frontier models (Gemini)

Multimodal Gemini: A Family of Highly Capable Multimodal Models

Co-authored Gemini: A Family of Highly Capable Multimodal Models.

970

Federico Cassano

Self-reflection loops for LLM agents (Reflexion)

Co-authored Reflexion: a practical pattern for improving agents via self-critique and memory.

Systems & Infrastructure Agents & Reasoning Reinforcement Learning Reflexion: Language Agents with Verbal Reinforcement Learning

971

Federico Lebron

Multimodal frontier models (Gemini)

Multimodal Gemini: A Family of Highly Capable Multimodal Models

Co-authored Gemini: A Family of Highly Capable Multimodal Models.

972

Fedor Moiseev

Multimodal frontier models (Gemini)

Multimodal Gemini: A Family of Highly Capable Multimodal Models

Co-authored Gemini: A Family of Highly Capable Multimodal Models.

973

Fedor Zhdanov

Open code LLMs (StarCoder)

Co-authored StarCoder: a foundational open code model effort (BigCode).

Open Models Code Models StarCoder: may the source be with you!

974

Fei Huang

Open-weight LLMs (Qwen)

Qwen

Co-authored the Qwen Technical Report.

Open Models Qwen Technical Report

975

Fei Liu

Multimodal frontier models (Gemini)

Multimodal Gemini: A Family of Highly Capable Multimodal Models

Co-authored Gemini: A Family of Highly Capable Multimodal Models.

976

Fei Sun

Open-weight frontier models (Llama 3)

Co-authored “The Llama 3 Herd of Models”.

Agents & Reasoning Chain-of-Thought Prompting Elicits Reasoning in Large Language Models

977

Fei Xia

Chain-of-thought prompting and reasoning

Co-authored the chain-of-thought prompting paper; foundational for modern reasoning prompting.

978

Feiran Wang

Multimodal frontier models (Gemini)

Multimodal Gemini: A Family of Highly Capable Multimodal Models

Co-authored Gemini: A Family of Highly Capable Multimodal Models.

979

Felipe Petroski Such

Code-focused LLMs and evaluation (Codex)

Evaluation & Benchmarks Code Models Evaluating Large Language Models Trained on Code

Co-authored the Codex evaluation paper: an early anchor for code LLM capability measurement.

980

Felix Fischer

Multimodal frontier models (Gemini)

Multimodal Gemini: A Family of Highly Capable Multimodal Models

Co-authored Gemini: A Family of Highly Capable Multimodal Models.

981

Felix Kreuk

Open-weight frontier models (Llama 3)

Co-authored “The Llama 3 Herd of Models”.

982

Feng Tian

Open-weight frontier models (Llama 3)

Co-authored “The Llama 3 Herd of Models”.

983

Feng Yang

Multimodal frontier models (Gemini)

Multimodal Gemini: A Family of Highly Capable Multimodal Models

Co-authored Gemini: A Family of Highly Capable Multimodal Models.

984

Ferdinand Mom

RWKV and efficient sequence modeling

A useful RWKV page because his work does not stop at the original paper; it extends into multimodal and longer-context experiments that show how the RWKV line kept evolving afterward.

Open Models Multimodal Systems & Infrastructure RWKV: Reinventing RNNs for the Transformer Era

985

Fernando Pereira

Open language models from Google (Gemma)

Open Models Gemma: Open Models Based on Gemini Research and Technology

Co-authored Gemma: open models based on Gemini research and technology.

986

Filip Pavetic

Multimodal frontier models (Gemini)

Multimodal Gemini: A Family of Highly Capable Multimodal Models

Co-authored Gemini: A Family of Highly Capable Multimodal Models.

987

Filip Radenovic

Open-weight frontier models (Llama 3)

Co-authored “The Llama 3 Herd of Models”.

Filipe de Avila Belbute Peres

988

Frontier model development (GPT-4)

Co-authored the GPT-4 Technical Report: a key reference for the GPT-4-era frontier.

989

Filippos Kokkinos

Open-weight frontier models (Llama 3)

Co-authored “The Llama 3 Herd of Models”.

990

Fiona Macintosh

Multimodal frontier models (Gemini)

Multimodal Gemini: A Family of Highly Capable Multimodal Models

Co-authored Gemini: A Family of Highly Capable Multimodal Models.

991

Firat Ozgenel

Open-weight frontier models (Llama 3)

Co-authored “The Llama 3 Herd of Models”.

992

Flavien Prost

Multimodal frontier models (Gemini)

Multimodal Gemini: A Family of Highly Capable Multimodal Models

Co-authored Gemini: A Family of Highly Capable Multimodal Models.

993

Florencia Leoni Aleman

Frontier model development (GPT-4)

Co-authored the GPT-4 Technical Report: a key reference for the GPT-4-era frontier.

Open Models Systems & Infrastructure Mixtral of Experts

994

Florian Bressand

Mixture-of-experts LLMs

Mistral

Co-authored Mixtral of Experts: a key MoE reference in the open-weights frontier.

995

Florian Luisier

Multimodal frontier models (Gemini)

Multimodal Gemini: A Family of Highly Capable Multimodal Models

Co-authored Gemini: A Family of Highly Capable Multimodal Models.

996

Florian Tramer

Training-data extraction and privacy risks

Co-authored Extracting Training Data from Large Language Models: a core paper on memorization and extraction risk.

Security & Robustness Extracting Training Data from Large Language Models

997

Fotios Chantzis

Code-focused LLMs and evaluation (Codex)

Evaluation & Benchmarks Code Models Evaluating Large Language Models Trained on Code

Co-authored the Codex evaluation paper: an early anchor for code LLM capability measurement.

998

Fotis Chantzis

Frontier model development (GPT-4)

Co-authored the GPT-4 Technical Report: a key reference for the GPT-4-era frontier.

999

Francesco Bertolini

Multimodal frontier models (Gemini)

Multimodal Gemini: A Family of Highly Capable Multimodal Models

Co-authored Gemini: A Family of Highly Capable Multimodal Models.

1000

Francesco Caggioni

Open-weight frontier models (Llama 3)

Co-authored “The Llama 3 Herd of Models”.

1001

Francesco Piccinno

Multimodal frontier models (Gemini)

Multimodal Gemini: A Family of Highly Capable Multimodal Models

Co-authored Gemini: A Family of Highly Capable Multimodal Models.

1002

Francesco Pongetti

Multimodal frontier models (Gemini)

Multimodal Gemini: A Family of Highly Capable Multimodal Models

Co-authored Gemini: A Family of Highly Capable Multimodal Models.

1003

Francesco Visin

Open language models (Gemma 2)

Open Models Gemma 2: Improving Open Language Models at a Practical Size

Co-authored Gemma 2: improving open language models at a practical size.

1004

Francis Real

Frontier model development (GPT-4)

Co-authored the GPT-4 Technical Report: a key reference for the GPT-4-era frontier.

Post-Training & Alignment Evaluation & Benchmarks Security & Robustness Red Teaming Language Models with Language Models

1005

Francis Song

Red teaming with language models

Co-authored Red Teaming LMs with LMs: a concrete approach to stress-testing model behavior at scale.

1006

Francisco Guzmán

Open-weight frontier models (Llama 3)

Co-authored “The Llama 3 Herd of Models”.

Vision & Robotics End-to-End Object Detection with Transformers

1007

Francisco Massa

Transformer-based object detection (DETR)

Co-authored DETR: simplified object detection via end-to-end transformer training.

1008

François-Xavier Aubet

Multimodal frontier models (Gemini)

Multimodal Gemini: A Family of Highly Capable Multimodal Models

Co-authored Gemini: A Family of Highly Capable Multimodal Models.

1009

Frank Kanayet

Open-weight frontier models (Llama 3)

Co-authored “The Llama 3 Herd of Models”.

1010

Frank Seide

Open-weight frontier models (Llama 3)

Co-authored “The Llama 3 Herd of Models”.

1011

Frank Zhang

Open-weight frontier models (Llama 3)

Co-authored “The Llama 3 Herd of Models”.

1012

Fraser Kelton

Instruction-following via RLHF (InstructGPT)

Post-Training & Alignment Training language models to follow instructions with human feedback

Co-authored the InstructGPT paper that set the standard instruction-tuning + RLHF recipe.

1013

Fred Alcober

Multimodal frontier models (Gemini)

Multimodal Gemini: A Family of Highly Capable Multimodal Models

Co-authored Gemini: A Family of Highly Capable Multimodal Models.

1014

Frederick Liu

Open multimodal models (Gemma 3)

Co-authored the Gemma 3 Technical Report.

Evaluation & Benchmarks Holistic Evaluation of Language Models

1015

Frieda Rong

Holistic evaluation of language models (HELM)

Co-authored HELM: a framework for evaluating language models across many axes beyond raw accuracy.

1016

Fucong Dai

Open-model frontier reports (DeepSeek-V3)

Co-authored the DeepSeek-V3 Technical Report.

1017

Fuli Luo

Open-model frontier reports (DeepSeek-V3)

Co-authored the DeepSeek-V3 Technical Report.

1018

Gabriel Barth-Maron

Generalist agents (Gato)

Multimodal Agents & Reasoning A Generalist Agent

Co-authored Gato: a key reference for generalist, multi-task agents.

1019

Gabriel Bernadett-Shapiro

Frontier model development (GPT-4)

Co-authored the GPT-4 Technical Report: a key reference for the GPT-4-era frontier.

1020

Gabriel Goh

Vision-language pretraining (CLIP)

Multimodal Learning Transferable Visual Models From Natural Language Supervision

Co-authored CLIP: a core reference for contrastive multimodal pretraining.

1021

Gabriel Rasskin

Open language models (Gemma 2)

Open Models Gemma 2: Improving Open Language Models at a Practical Size

Co-authored Gemma 2: improving open language models at a practical size.

1022

Gabriel Synnaeve

Transformer-based object detection (DETR)

Co-authored DETR: simplified object detection via end-to-end transformer training.

Vision & Robotics End-to-End Object Detection with Transformers

1023

Gabriela Medina Florez

Open-weight frontier models (Llama 3)

Co-authored “The Llama 3 Herd of Models”.

1024

Gabriela Surita

Multimodal frontier models (Gemini)

Multimodal Gemini: A Family of Highly Capable Multimodal Models

Co-authored Gemini: A Family of Highly Capable Multimodal Models.

1025

Gabriella Schwarz

Open-weight frontier models (Llama 3)

Co-authored “The Llama 3 Herd of Models”.

1026

Gabrielle Lee

Open-weight frontier models (Llama 3)

Co-authored “The Llama 3 Herd of Models”.

1027

Gada Badeer

Open-weight frontier models (Llama 3)

Co-authored “The Llama 3 Herd of Models”.

1028

Gaël Liu

Open multimodal models (Gemma 3)

Co-authored the Gemma 3 Technical Report.

1029

Gagik Amirkhanyan

Open multimodal models (Gemma 3)

Co-authored the Gemma 3 Technical Report.

Systems & Infrastructure Jamba: A Hybrid Transformer-Mamba Language Model

1030

Gal Cohen

Hybrid Transformer–Mamba language models (Jamba)

AI21

Worth surfacing because he sits inside the original Jamba author group, which helps make the AI21 hybrid-model story legible at the contributor level instead of only at the company level.

1031

Gal Shachaf

Hybrid Transformer–Mamba language models (Jamba)

AI21

Worth knowing because his work links earlier dense-retrieval research to later MRKL and Jamba systems, which makes his page a good bridge between classic NLP retrieval and newer hybrid LLM stacks.

Evaluation & Benchmarks Systems & Infrastructure Agents & Reasoning AI21 Labs

1032

Gamaleldin Elsayed

Multimodal frontier models (Gemini)

Multimodal Gemini: A Family of Highly Capable Multimodal Models

Co-authored Gemini: A Family of Highly Capable Multimodal Models.

1033

Gao Liu

Open-weight LLMs (Qwen)

Qwen

Co-authored the Qwen Technical Report.

Open Models Qwen Technical Report

1034

Garima Pruthi

Multimodal frontier models (Gemini)

Multimodal Gemini: A Family of Highly Capable Multimodal Models

Co-authored Gemini: A Family of Highly Capable Multimodal Models.

1035

Gary Wei

Open language models (Gemma 2)

Open Models Gemma 2: Improving Open Language Models at a Practical Size

Co-authored Gemma 2: improving open language models at a practical size.

1036

Gaurav Mishra

Pathways-scale language modeling (PaLM)

PaLM: Scaling Language Modeling with Pathways

Co-authored PaLM: Scaling Language Modeling with Pathways.

1037

Gaurav Singh Tomar

Multimodal frontier models (Gemini)

Multimodal Gemini: A Family of Highly Capable Multimodal Models

Co-authored Gemini: A Family of Highly Capable Multimodal Models.

1038

Gautam Vasudevan

Multimodal frontier models (Gemini)

Multimodal Gemini: A Family of Highly Capable Multimodal Models

Co-authored Gemini: A Family of Highly Capable Multimodal Models.

1039

Gautier Izacard

Open-weight foundation models (LLaMA)

Open Models Evaluation & Benchmarks Systems & Infrastructure Gautier Izacard on Google Scholar

A stronger page than the old stub because his work cuts across two important threads in modern language models: early retrieval-augmented generation systems like Atlas and the later LLaMA open-weight model line.

1040

Geeta Chauhan

Fully Sharded Data Parallel training (FSDP)

Co-authored PyTorch FSDP: practical lessons for scaling fully-sharded training workloads.

Systems & Infrastructure PyTorch FSDP: Experiences on Scaling Fully Sharded Data Parallel

1041

Geng Yan

Open language models from Google (Gemma)

Open Models Gemma: Open Models Based on Gemini Research and Technology

Co-authored Gemma: open models based on Gemini research and technology.

1042

Geoff Bacon

Multimodal frontier models (Gemini)

Multimodal Gemini: A Family of Highly Capable Multimodal Models

Co-authored Gemini: A Family of Highly Capable Multimodal Models.

1043

Geoff Brown

Multimodal frontier models (Gemini)

Multimodal Gemini: A Family of Highly Capable Multimodal Models

Co-authored Gemini: A Family of Highly Capable Multimodal Models.

1044

Geoffrey Cideron

Open multimodal models (Gemma 3)

Co-authored the Gemma 3 Technical Report.

1045

Geoffrey E. Hinton

Representation learning, deep learning foundations

One of the central figures of the deep-learning revival, especially for work on distributed representations and the research culture that produced an entire generation of modern AI leaders.

Deep Learning

1046

Geoffrey Irving

Reasoning, verification, math

Multimodal Post-Training & Alignment Systems & Infrastructure Red Teaming Language Models with Language Models

A useful person to study if you care about alignment proposals that try to make superhuman systems legible enough for humans to supervise in practice.

1047

Georg Heigold

Vision Transformers (ViT)

Co-authored ViT: a turning point for transformers in vision.

Vision & Robotics An Image is Worth 16x16 Words: Transformers for Image Recognition at Scale

1048

George Papamakarios

Multimodal frontier models (Gemini)

Multimodal Gemini: A Family of Highly Capable Multimodal Models

Co-authored Gemini: A Family of Highly Capable Multimodal Models.

1049

George Polovets

Multimodal frontier models (Gemini)

Multimodal Gemini: A Family of Highly Capable Multimodal Models

Co-authored Gemini: A Family of Highly Capable Multimodal Models.

1050

George Tucker

Open language models from Google (Gemma)

Open Models Gemma: Open Models Based on Gemini Research and Technology

Co-authored Gemma: open models based on Gemini research and technology.

1051

George van den Driessche

Compute-optimal scaling for LLM training

Multimodal Gemini: A Family of Highly Capable Multimodal Models

A useful page for the long-running DeepMind contributor layer behind large-model training, especially across the Gopher, Chinchilla, and Gemini sequence.

1052

George-Christian Muraru

Open language models from Google (Gemma)

Open Models Gemma: Open Models Based on Gemini Research and Technology

Co-authored Gemma: open models based on Gemini research and technology.

1053

Georgia Lewis Anderson

Open-weight frontier models (Llama 3)

Co-authored “The Llama 3 Herd of Models”.

1054

Georgia Swee

Open-weight frontier models (Llama 3)

Co-authored “The Llama 3 Herd of Models”.

Giambattista Parascandolo

1055

Frontier model development (GPT-4)

Co-authored the GPT-4 Technical Report: a key reference for the GPT-4-era frontier.

Open Models Systems & Infrastructure Mixtral of Experts

1056

Gianna Lengyel

Mixture-of-experts LLMs

Mistral

Co-authored Mixtral of Experts: a key MoE reference in the open-weights frontier.

1057

Gil Halpern

Open-weight frontier models (Llama 3)

Co-authored “The Llama 3 Herd of Models”.

1058

Ginger Perng

Multimodal frontier models (Gemini)

Multimodal Gemini: A Family of Highly Capable Multimodal Models

Co-authored Gemini: A Family of Highly Capable Multimodal Models.

1059

Girish Sastry

Vision-language pretraining (CLIP)

Multimodal Learning Transferable Visual Models From Natural Language Supervision

Co-authored CLIP: a core reference for contrastive multimodal pretraining.

1060

Glenn Cameron

Open language models (Gemma 2)

Open Models Gemma 2: Improving Open Language Models at a Practical Size

Co-authored Gemma 2: improving open language models at a practical size.

1061

Golnaz Ghiasi

Multimodal frontier models (Gemini)

Multimodal Gemini: A Family of Highly Capable Multimodal Models

Co-authored Gemini: A Family of Highly Capable Multimodal Models.

1062

Govind Thattai

Open-weight frontier models (Llama 3)

Co-authored “The Llama 3 Herd of Models”.

1063

Graeme Nail

Open-weight frontier models (Llama 3)

Co-authored “The Llama 3 Herd of Models”.

1064

Grant Herman

Open-weight frontier models (Llama 3)

Co-authored “The Llama 3 Herd of Models”.

1065

Greg Brockman

Robust speech recognition (Whisper)

Multimodal Robust Speech Recognition via Large-Scale Weak Supervision

Co-authored Whisper: robust speech recognition via large-scale weak supervision.

1066

Gregoire Mialon

Open-weight frontier models (Llama 3)

Co-authored “The Llama 3 Herd of Models”.

1067

Gregory Thornton

Multimodal frontier models (Gemini)

Multimodal Gemini: A Family of Highly Capable Multimodal Models

Co-authored Gemini: A Family of Highly Capable Multimodal Models.

1068

Gretchen Krueger

Large-scale language modeling (GPT-3)

Language Models are Few-Shot Learners (GPT-3)

Co-authored GPT-3: Language Models are Few-Shot Learners.

1069

Grigory Rozhdestvenskiy

Open language models from Google (Gemma)

Open Models Gemma: Open Models Based on Gemini Research and Technology

Co-authored Gemma: open models based on Gemini research and technology.

1070

Grigory Sizov

Open-weight frontier models (Llama 3)

Co-authored “The Llama 3 Herd of Models”.

1071

Guan Pang

Open-weight frontier models (Llama 3)

Co-authored “The Llama 3 Herd of Models”.

1072

Guangbo Hao

Open-model frontier reports (DeepSeek-V3)

Co-authored the DeepSeek-V3 Technical Report.

Post-Training & Alignment Evaluation & Benchmarks Systems & Infrastructure Efficient Streaming Language Models with Attention Sinks

1073

Guangxuan Xiao

Streaming + long-context stability (attention sinks)

A strong systems page because his work repeatedly shows up where inference efficiency meets usable long context, especially in attention sinks, StreamingLLM, post-training quantization, and later long-context head designs.

1074

Guangyi

Open-weight frontier models (Llama 3)

Co-authored “The Llama 3 Herd of Models”.

Open Models Systems & Infrastructure RWKV: Reinventing RNNs for the Transformer Era

1075

Guangyu Song

RWKV and efficient sequence modeling

Worth tracking because he is one of the contributors who stays with the RWKV line from the original paper through Eagle/Finch, GoldFinch, and into RWKV-7, which is exactly the kind of repeated authorship signal that makes these long-tail pages valuable.

1076

Guanhua Wang

Small, capable models (Phi-3)

Co-authored the Phi-3 Technical Report (capable models designed for smaller footprints).

Open Models Systems & Infrastructure Phi-3 Technical Report: A Highly Capable Language Model Locally on Your Phone

1077

Guanting Chen

Open-model frontier reports (DeepSeek-V3)

Co-authored the DeepSeek-V3 Technical Report.