Researchers — page 25

Multimodal Gemini: A Family of Highly Capable Multimodal Models

Co-authored Gemini: A Family of Highly Capable Multimodal Models.

2904

Sophia Austin

Multimodal frontier models (Gemini)

Multimodal Gemini: A Family of Highly Capable Multimodal Models

Co-authored Gemini: A Family of Highly Capable Multimodal Models.

2905

Sophia Yang

Mixture-of-experts LLMs

Mistral

Co-authored Mixtral of Experts: a key MoE reference in the open-weights frontier.

Open Models Systems & Infrastructure Mixtral of Experts

2906

Sophie Bridgers

Multimodal frontier models (Gemini)

Multimodal Gemini: A Family of Highly Capable Multimodal Models

Co-authored Gemini: A Family of Highly Capable Multimodal Models.

2907

Soravit Changpinyo

Scaled multilingual vision-language models (PaLI)

Multimodal PaLI: A Jointly-Scaled Multilingual Language-Image Model

Co-authored PaLI: a key reference for scaling multilingual vision-language models.

2908

Soumith Chintala

Open-weight frontier models (Llama 3)

Co-authored “The Llama 3 Herd of Models”.

2909

Soumya Batra

Open-weight chat and foundation models (Llama 2)

Open Models Llama 2: Open Foundation and Fine-Tuned Chat Models

Co-authored Llama 2: Open Foundation and Fine-Tuned Chat Models.

2910

Spencer Whitehead

Promptable segmentation foundation models (SAM)

Vision & Robotics Segment Anything

Co-authored Segment Anything.

2911

Spencer Whitman

Open-weight frontier models (Llama 3)

Co-authored “The Llama 3 Herd of Models”.

Sri Gayatri Sundara Padmanabhan

2912

Multimodal frontier models (Gemini)

Multimodal Gemini: A Family of Highly Capable Multimodal Models

Co-authored Gemini: A Family of Highly Capable Multimodal Models.

2913

Srivatsa Kundurthy

Open large-scale image-text data (LAION-5B)

Co-authored LAION-5B: a widely used open dataset for vision-language foundation models.

Multimodal LAION-5B: An open large-scale dataset for training next generation image-text models

2914

Srivatsan Srinivasan

Multimodal frontier models (Gemini)

Multimodal Gemini: A Family of Highly Capable Multimodal Models

Co-authored Gemini: A Family of Highly Capable Multimodal Models.

2915

Srividya Pranavi Potharaju

Multimodal frontier models (Gemini)

Multimodal Gemini: A Family of Highly Capable Multimodal Models

Co-authored Gemini: A Family of Highly Capable Multimodal Models.

2916

Stanislav Fort

Alignment via AI feedback (Constitutional AI)

Anthropic

Important because his work sits at a useful junction of robustness, scaling, adversarial attacks, and security-minded analysis of large models rather than staying inside one narrow alignment niche.

Post-Training & Alignment Reinforcement Learning Security & Robustness Stanislav Fort

2917

Stanislaw Wozniak

RWKV and efficient sequence modeling

A worthwhile long-tail page because he appears on both the original RWKV paper and Eagle/Finch and also has visible follow-on work from the same Wrocław group rather than disappearing after the first release.

Open Models Systems & Infrastructure Stanisław Woźniak at Wrocław University of Science and Technology

2918

Stefano Ermon

Direct preference optimization (DPO)

A high-signal researcher for the probabilistic and generative-modeling side of modern AI, and an important bridge into the Stanford preference-optimization cluster that helped make DPO mainstream.

Post-Training & Alignment Systems & Infrastructure Diffusion & Generative Media Stefano Ermon at Stanford

2919

Stella Biderman

Open-source LLMs, datasets

EleutherAI

A key open-model ecosystem builder whose work matters because it combines research, public infrastructure, and field-level coordination rather than isolated paper output alone.

Open Models Systems & Infrastructure The Pile: An 800GB Dataset of Diverse Text for Language Modeling

2920

Sten Sootla

Open foundation models for code (Code Llama)

Open Models Code Models Code Llama: Open Foundation Models for Code

Co-authored Code Llama: a key open-model reference for code generation and coding assistants.

2921

Stephan Lee

Multimodal frontier models (Gemini)

Multimodal Gemini: A Family of Highly Capable Multimodal Models

Co-authored Gemini: A Family of Highly Capable Multimodal Models.

2922

Stephane Collot

Open-weight frontier models (Llama 3)

Co-authored “The Llama 3 Herd of Models”.

Post-Training & Alignment Evaluation & Benchmarks TruthfulQA: Measuring How Models Mimic Human Falsehoods

2923

Stephanie Lin

Truthfulness and hallucination evaluation

Co-authored TruthfulQA: an influential benchmark for truthfulness and falsehood mimicry in LMs.

2924

Stephanie Max

Open-weight frontier models (Llama 3)

Co-authored “The Llama 3 Herd of Models”.

2925

Stephanie Winkler

Multimodal frontier models (Gemini)

Multimodal Gemini: A Family of Highly Capable Multimodal Models

Co-authored Gemini: A Family of Highly Capable Multimodal Models.

2926

Stephen Cagle

Multimodal frontier models (Gemini)

Multimodal Gemini: A Family of Highly Capable Multimodal Models

Co-authored Gemini: A Family of Highly Capable Multimodal Models.

2927

Stephen Chen

Open-weight frontier models (Llama 3)

Co-authored “The Llama 3 Herd of Models”.

2928

Steve Dowling

Frontier model development (GPT-4)

Co-authored the GPT-4 Technical Report: a key reference for the GPT-4-era frontier.

2929

Steve Kehoe

Open-weight frontier models (Llama 3)

Co-authored “The Llama 3 Herd of Models”.

2930

Steve Li

Multimodal frontier models (Gemini)

Multimodal Gemini: A Family of Highly Capable Multimodal Models

Co-authored Gemini: A Family of Highly Capable Multimodal Models.

2931

Steve Satterfield

Open-weight frontier models (Llama 3)

Co-authored “The Llama 3 Herd of Models”.

2932

Steve Yadlowsky

Multimodal frontier models (Gemini)

Multimodal Gemini: A Family of Highly Capable Multimodal Models

Co-authored Gemini: A Family of Highly Capable Multimodal Models.

2933

Steven Adler

Frontier model development (GPT-4)

Co-authored the GPT-4 Technical Report: a key reference for the GPT-4-era frontier.

Evaluation & Benchmarks Measuring Massive Multitask Language Understanding

2934

Steven Basart

Broad capability evaluation (MMLU)

Co-authored MMLU: a widely used benchmark for general LLM capability across many subjects.

2935

Steven Hand

Multimodal frontier models (Gemini)

Multimodal Gemini: A Family of Highly Capable Multimodal Models

Co-authored Gemini: A Family of Highly Capable Multimodal Models.

2936

Steven Hansen

Multimodal frontier models (Gemini)

Multimodal Gemini: A Family of Highly Capable Multimodal Models

Co-authored Gemini: A Family of Highly Capable Multimodal Models.

2937

Steven Hoi

Bootstrapped vision-language pretraining (BLIP)

Co-authored BLIP: a high-impact recipe for unified vision-language understanding and generation.

Multimodal BLIP: Bootstrapping Language-Image Pre-training for Unified Vision-Language Understanding and Generation

2938

Steven Zheng

Multimodal frontier models (Gemini)

Multimodal Gemini: A Family of Highly Capable Multimodal Models

Co-authored Gemini: A Family of Highly Capable Multimodal Models.

2939

Subha Puttagunta

Multimodal frontier models (Gemini)

Multimodal Gemini: A Family of Highly Capable Multimodal Models

Co-authored Gemini: A Family of Highly Capable Multimodal Models.

2940

Subhabrata Das

Multimodal frontier models (Gemini)

Multimodal Gemini: A Family of Highly Capable Multimodal Models

Co-authored Gemini: A Family of Highly Capable Multimodal Models.

2941

Subhajit Naskar

Multimodal frontier models (Gemini)

Multimodal Gemini: A Family of Highly Capable Multimodal Models

Co-authored Gemini: A Family of Highly Capable Multimodal Models.

2942

Subhrajit Roy

Multimodal frontier models (Gemini)

Multimodal Gemini: A Family of Highly Capable Multimodal Models

Co-authored Gemini: A Family of Highly Capable Multimodal Models.

2943

Suchin Gururangan

Open-weight frontier models (Llama 3)

Co-authored “The Llama 3 Herd of Models”.

2944

Suchir Balaji

Code-focused LLMs and evaluation (Codex)

Evaluation & Benchmarks Code Models Evaluating Large Language Models Trained on Code

Co-authored the Codex evaluation paper: an early anchor for code LLM capability measurement.

2945

Sudarshan Govindaprasad

Open-weight frontier models (Llama 3)

Co-authored “The Llama 3 Herd of Models”.

2946

Sue Ronstrom

Open language models (Gemma 2)

Open Models Gemma 2: Improving Open Language Models at a Practical Size

Co-authored Gemma 2: improving open language models at a practical size.

2947

Sujeevan Rajayogam

Multimodal frontier models (Gemini)

Multimodal Gemini: A Family of Highly Capable Multimodal Models

Co-authored Gemini: A Family of Highly Capable Multimodal Models.

2948

Sully Chen

Frontier model development (GPT-4)

Co-authored the GPT-4 Technical Report: a key reference for the GPT-4-era frontier.

2949

Sumit Bagri

Multimodal frontier models (Gemini)

Multimodal Gemini: A Family of Highly Capable Multimodal Models

Co-authored Gemini: A Family of Highly Capable Multimodal Models.

2950

Sumit Gupta

Open-weight frontier models (Llama 3)

Co-authored “The Llama 3 Herd of Models”.

2951

Summer Deng

Open-weight frontier models (Llama 3)

Co-authored “The Llama 3 Herd of Models”.

2952

Summer Yue

Multimodal frontier models (Gemini)

Multimodal Gemini: A Family of Highly Capable Multimodal Models

Co-authored Gemini: A Family of Highly Capable Multimodal Models.

2953

Sungmin Cho

Open-weight frontier models (Llama 3)

Co-authored “The Llama 3 Herd of Models”.

2954

Sunipa Dev

Pathways-scale language modeling (PaLM)

PaLM: Scaling Language Modeling with Pathways

Co-authored PaLM: Scaling Language Modeling with Pathways.

2955

Sunny Virk

Open-weight frontier models (Llama 3)

Co-authored “The Llama 3 Herd of Models”.

2956

Suraj Subramanian

Open-weight frontier models (Llama 3)

Co-authored “The Llama 3 Herd of Models”.

Open Models Systems & Infrastructure Phi-3 Technical Report: A Highly Capable Language Model Locally on Your Phone

2957

Suriya Gunasekar

Small, capable models (Phi-3)

Co-authored the Phi-3 Technical Report (capable models designed for smaller footprints).

2958

Surya Bhupatiraju

Open language models from Google (Gemma)

Open Models Gemma: Open Models Based on Gemini Research and Technology

Co-authored Gemma: open models based on Gemini research and technology.

2959

Surya Ganguli

Holistic evaluation of language models (HELM)

Co-authored HELM: a framework for evaluating language models across many axes beyond raw accuracy.

Evaluation & Benchmarks Holistic Evaluation of Language Models

2960

Susan Chan

Open language models (Gemma 2)

Open Models Gemma 2: Improving Open Language Models at a Practical Size

Co-authored Gemma 2: improving open language models at a practical size.

2961

Susan Zhang

Open multimodal models (Gemma 3)

Open Models Multimodal Gemma 3 Technical Report

Co-authored the Gemma 3 Technical Report.

2962

Sushant Kafle

Multimodal frontier models (Gemini)

Multimodal Gemini: A Family of Highly Capable Multimodal Models

Co-authored Gemini: A Family of Highly Capable Multimodal Models.

2963

Sushil Mittal

Multimodal frontier models (Gemini)

Multimodal Gemini: A Family of Highly Capable Multimodal Models

Co-authored Gemini: A Family of Highly Capable Multimodal Models.

2964

Swadheen Shukla

Small, capable models (Phi-3)

Co-authored the Phi-3 Technical Report (capable models designed for smaller footprints).

Open Models Systems & Infrastructure Phi-3 Technical Report: A Highly Capable Language Model Locally on Your Phone

2965

Swaroop Mishra

Synthetic instructions for alignment (Self-Instruct)

Co-authored Self-Instruct: a key reference for instruction data generation pipelines.

Post-Training & Alignment Self-Instruct: Aligning Language Models with Self-Generated Instructions

2966

Swayam Singh

Open code LLMs (StarCoder)

Co-authored StarCoder: a foundational open code model effort (BigCode).

Open Models Code Models StarCoder: may the source be with you!

2967

Swetha Sankar

Multimodal frontier models (Gemini)

Multimodal Gemini: A Family of Highly Capable Multimodal Models

Co-authored Gemini: A Family of Highly Capable Multimodal Models.

2968

Sy Choudhury

Open-weight frontier models (Llama 3)

Co-authored “The Llama 3 Herd of Models”.

2969

Sydney Borodinsky

Open-weight frontier models (Llama 3)

Co-authored “The Llama 3 Herd of Models”.

2970

Sydney Goldman

Open-weight frontier models (Llama 3)

Co-authored “The Llama 3 Herd of Models”.

Vision & Robotics An Image is Worth 16x16 Words: Transformers for Image Recognition at Scale

2971

Sylvain Gelly

Vision Transformers (ViT)

Co-authored ViT: a turning point for transformers in vision.

2972

Sylvain Gugger

Open-source tooling for modern NLP (Transformers library)

Hugging Face

Co-authored the Hugging Face Transformers paper that helped standardize modern NLP workflows.

Open Models Transformers: State-of-the-Art Natural Language Processing

2973

Szymon Antoniak

Mixture-of-experts LLMs

Mistral

Co-authored Mixtral of Experts: a key MoE reference in the open-weights frontier.

Open Models Systems & Infrastructure Mixtral of Experts

2974

Szymon Sidor

Frontier model development (GPT-4)

Co-authored the GPT-4 Technical Report: a key reference for the GPT-4-era frontier.

Open Models DeepSeek-V3 Technical Report

2975

T. Wang

Open-model frontier reports (DeepSeek-V3)

DeepSeek

Co-authored the DeepSeek-V3 Technical Report.

2976

Tabarak Khan

Frontier model development (GPT-4)

Co-authored the GPT-4 Technical Report: a key reference for the GPT-4-era frontier.

Systems & Infrastructure JAMBA: Hybrid Transformer-Mamba Language Models

2977

Tal Delbari

Hybrid Transformer–Mamba language models (Jamba)

AI21

A useful page because it gives another one of the non-model contributors on Jamba-1.5 a real place in the map; frontier-model launches depend on product and execution work, not just research authorship.

2978

Tal Ness

Hybrid Transformer–Mamba language models (Jamba)

AI21

A worthwhile long-tail researcher page because it makes the data-and-evaluation layer of modern language-model work visible instead of treating frontier systems as if they were only architecture or scaling stories.

Evaluation & Benchmarks Systems & Infrastructure Tal Ness

2979

Tal Remez

Open foundation models for code (Code Llama)

Open Models Code Models Code Llama: Open Foundation Models for Code

Co-authored Code Llama: a key open-model reference for code generation and coding assistants.

2980

Tamar Glaser

Open-weight frontier models (Llama 3)

Co-authored “The Llama 3 Herd of Models”.

2981

Tamar Herman

Open-weight frontier models (Llama 3)

Co-authored “The Llama 3 Herd of Models”.

2982

Tamara Best

Open-weight frontier models (Llama 3)

Co-authored “The Llama 3 Herd of Models”.

2983

Tamara von Glehn

Multimodal frontier models (Gemini)

Multimodal Gemini: A Family of Highly Capable Multimodal Models

Co-authored Gemini: A Family of Highly Capable Multimodal Models.

2984

Tamera Lanham

Alignment via AI feedback (Constitutional AI)

Anthropic

A high-signal page for anyone tracking whether model reasoning traces are actually trustworthy, not just fluent explanations pasted on after the fact.

Post-Training & Alignment Agents & Reasoning Reinforcement Learning Measuring Faithfulness in Chain-of-Thought Reasoning

2985

Tanya Grunina

Multimodal frontier models (Gemini)