Researchers — page 23

Open Models Gemma: Open Models Based on Gemini Research and Technology

Co-authored Gemma: open models based on Gemini research and technology.

2657

Ryan Pham

Multimodal frontier models (Gemini)

Multimodal Gemini: A Family of Highly Capable Multimodal Models

Co-authored Gemini: A Family of Highly Capable Multimodal Models.

2658

Ryan Sepassi

Pathways-scale language modeling (PaLM)

Co-authored PaLM: Scaling Language Modeling with Pathways.

Open Models DeepSeek-V3 Technical Report

2659

S. S. Li

Open-model frontier reports (DeepSeek-V3)

DeepSeek

Co-authored the DeepSeek-V3 Technical Report.

2660

S. Sara Mahdavi

Text-to-image diffusion with strong language understanding (Imagen)

Diffusion & Generative Media Photorealistic Text-to-Image Diffusion Models with Deep Language Understanding

Co-authored Imagen: a milestone for photorealistic text-to-image diffusion models.

2661

Saaber Fatehi

Multimodal frontier models (Gemini)

Multimodal Gemini: A Family of Highly Capable Multimodal Models

Co-authored Gemini: A Family of Highly Capable Multimodal Models.

2662

Sabela Ramos

Open language models (Gemma 2)

Open Models Gemma 2: Improving Open Language Models at a Practical Size

Co-authored Gemma 2: improving open language models at a practical size.

2663

Sabine Lehmann

Multimodal frontier models (Gemini)

Multimodal Gemini: A Family of Highly Capable Multimodal Models

Co-authored Gemini: A Family of Highly Capable Multimodal Models.

2664

Sachin Mehta

Open-weight frontier models (Llama 3)

Co-authored “The Llama 3 Herd of Models”.

2665

Sachin Siby

Open-weight frontier models (Llama 3)

Co-authored “The Llama 3 Herd of Models”.

Post-Training & Alignment Evaluation & Benchmarks Security & Robustness Red Teaming Language Models with Language Models

2666

Saffron Huang

Red teaming with language models

Co-authored Red Teaming LMs with LMs: a concrete approach to stress-testing model behavior at scale.

2667

Saghar Hosseini

Open-weight chat and foundation models (Llama 2)

Open Models Llama 2: Open Foundation and Fine-Tuned Chat Models

Co-authored Llama 2: Open Foundation and Fine-Tuned Chat Models.

2668

Sahana Chennabasappa

Open-weight frontier models (Llama 3)

Co-authored “The Llama 3 Herd of Models”.

2669

Sahand Sharifzadeh

Few-shot vision-language models (Flamingo)

Multimodal Flamingo: a Visual Language Model for Few-Shot Learning

Co-authored Flamingo: an influential multimodal model for few-shot vision-language tasks.

2670

Sahil Dua

Multimodal frontier models (Gemini)

Multimodal Gemini: A Family of Highly Capable Multimodal Models

Co-authored Gemini: A Family of Highly Capable Multimodal Models.

2671

Sahitya Potluri

Multimodal frontier models (Gemini)

Multimodal Gemini: A Family of Highly Capable Multimodal Models

Co-authored Gemini: A Family of Highly Capable Multimodal Models.

2672

Sai Jayesh Bondu

Open-weight frontier models (Llama 3)

Co-authored “The Llama 3 Herd of Models”.

2673

Sai Krishnakumaran

Multimodal frontier models (Gemini)

Multimodal Gemini: A Family of Highly Capable Multimodal Models

Co-authored Gemini: A Family of Highly Capable Multimodal Models.

2674

Sainbayar Sukhbaatar

Self-rewarding post-training

Co-authored Self-Rewarding Language Models: explores self-improvement via internal reward modeling.

Post-Training & Alignment Self-Rewarding Language Models

2675

Saining Xie

Masked autoencoders for vision (MAE)

Co-authored MAE: a strong template for scalable self-supervised vision pretraining.

Vision & Robotics Masked Autoencoders Are Scalable Vision Learners

2676

Salem Haykal

Multimodal frontier models (Gemini)

Multimodal Gemini: A Family of Highly Capable Multimodal Models

Co-authored Gemini: A Family of Highly Capable Multimodal Models.

2677

Salvatore Scellato

Multimodal frontier models (Gemini)

Multimodal Gemini: A Family of Highly Capable Multimodal Models

Co-authored Gemini: A Family of Highly Capable Multimodal Models.

2678

Sam Ade Jacobs

Small, capable models (Phi-3)

Co-authored the Phi-3 Technical Report (capable models designed for smaller footprints).

Open Models Systems & Infrastructure Phi-3 Technical Report: A Highly Capable Language Model Locally on Your Phone

2679

Sam Altman

Frontier model development (GPT-4)

Co-authored the GPT-4 Technical Report: a key reference for the GPT-4-era frontier.

2680

Sam Manning

Frontier model development (GPT-4)

Co-authored the GPT-4 Technical Report: a key reference for the GPT-4-era frontier.

2681

Sam McCandlish

Alignment via AI feedback (Constitutional AI)

Post-Training & Alignment Evaluation & Benchmarks Systems & Infrastructure Rahul Patil joins Anthropic as Chief Technology Officer

One of the clearest people to follow if you care about scaling laws, training efficiency, and the systems choices that quietly shape frontier-model progress.

2682

Sam Ringer

Alignment via AI feedback (Constitutional AI)

Post-Training & Alignment Evaluation & Benchmarks Systems & Infrastructure Discovering Language Model Behaviors with Model-Written Evaluations

Important because he is right at the center of the model-written-evals line, which became one of Anthropic’s clearest attempts to discover behaviors faster than manual evaluation can.

2683

Sam Shleifer

Open-source tooling for modern NLP (Transformers library)

Hugging Face

Co-authored the Hugging Face Transformers paper that helped standardize modern NLP workflows.

Open Models Transformers: State-of-the-Art Natural Language Processing

2684

Sam Sobell

Multimodal frontier models (Gemini)

Multimodal Gemini: A Family of Highly Capable Multimodal Models

Co-authored Gemini: A Family of Highly Capable Multimodal Models.

2685

Samaneh Saadat

Open language models (Gemma 2)

Open Models Gemma 2: Improving Open Language Models at a Practical Size

Co-authored Gemma 2: improving open language models at a practical size.

2686

Sambudha Roy

Small, capable models (Phi-3)

Co-authored the Phi-3 Technical Report (capable models designed for smaller footprints).

Open Models Systems & Infrastructure Phi-3 Technical Report: A Highly Capable Language Model Locally on Your Phone

2687

Samer Hassan

Multimodal frontier models (Gemini)

Multimodal Gemini: A Family of Highly Capable Multimodal Models

Co-authored Gemini: A Family of Highly Capable Multimodal Models.

2688

Samira Daruki

Multimodal frontier models (Gemini)

Multimodal Gemini: A Family of Highly Capable Multimodal Models

Co-authored Gemini: A Family of Highly Capable Multimodal Models.

2689

Sammy Jerome

Open language models (Gemma 2)

Open Models Gemma 2: Improving Open Language Models at a Practical Size

Co-authored Gemma 2: improving open language models at a practical size.

2690

Samuel Andermatt

Multimodal frontier models (Gemini)

Multimodal Gemini: A Family of Highly Capable Multimodal Models

Co-authored Gemini: A Family of Highly Capable Multimodal Models.

2691

Samuel Arcadinho

RWKV and efficient sequence modeling

Co-authored RWKV: Reinventing RNNs for the Transformer Era.

Open Models Systems & Infrastructure RWKV: Reinventing RNNs for the Transformer Era

2692

Samuel L Smith

Open language models from Google (Gemma)

Open Models Gemma: Open Models Based on Gemini Research and Technology

Co-authored Gemma: open models based on Gemini research and technology.

2693

Samuel R. Bowman

Alignment via AI feedback (Constitutional AI)

Post-Training & Alignment Evaluation & Benchmarks Agents & Reasoning Constitutional AI: Harmlessness from AI Feedback

A useful page if you care about the harder question of whether a model’s visible chain of reasoning is actually faithful, not just plausible-looking.

2694

Samuel Weinbach

Open-source LLMs (EleutherAI)

EleutherAI

Important if you care about the European sovereign-AI track, especially the attempt to build multilingual, explainable, and compliance-conscious frontier systems outside the US lab stack.

Open Models Evaluation & Benchmarks Systems & Infrastructure Aleph Alpha leadership

2695

Samuel Wolrich

Frontier model development (GPT-4)

Co-authored the GPT-4 Technical Report: a key reference for the GPT-4-era frontier.

2696

Samyak Datta

Open-weight frontier models (Llama 3)

Co-authored “The Llama 3 Herd of Models”.

Systems & Infrastructure ZeRO: Memory Optimizations Toward Training Trillion Parameter Models

2697

Samyam Rajbhandari

Memory-efficient distributed training (ZeRO)

Co-authored ZeRO: foundational memory optimizations for training very large models.

2698

Sanaz Bahargam

Multimodal frontier models (Gemini)

Multimodal Gemini: A Family of Highly Capable Multimodal Models

Co-authored Gemini: A Family of Highly Capable Multimodal Models.

2699

Sandeep Subramanian

Mixture-of-experts LLMs

Mistral

Co-authored Mixtral of Experts: a key MoE reference in the open-weights frontier.

Open Models Systems & Infrastructure Mixtral of Experts

2700

Sandhini Agarwal

Instruction tuning and RLHF

Post-Training & Alignment Evaluation & Benchmarks Systems & Infrastructure New and improved content moderation tooling

A good person to follow if you care about what deployment-minded safety work looks like inside a frontier lab, especially around moderation, image systems, and system-card style evaluation.

2701

Sandipan Kundu

Alignment via AI feedback (Constitutional AI)

Post-Training & Alignment Evaluation & Benchmarks Agents & Reasoning Many-shot Jailbreaking

Useful for the attack-and-evaluation side of alignment work, especially long-context jailbreak research and the measurement work that turns safety concerns into concrete tests.

2702

Sang Michael Xie

Holistic evaluation of language models (HELM)

Co-authored HELM: a framework for evaluating language models across many axes beyond raw accuracy.

Evaluation & Benchmarks Holistic Evaluation of Language Models

2703

Sanil Jain

Multimodal frontier models (Gemini)

Multimodal Gemini: A Family of Highly Capable Multimodal Models

Co-authored Gemini: A Family of Highly Capable Multimodal Models.

2704

Sanjay Ganapathy

Multimodal frontier models (Gemini)

Multimodal Gemini: A Family of Highly Capable Multimodal Models

Co-authored Gemini: A Family of Highly Capable Multimodal Models.

2705

Sanjay Ghemawat

Pathways-scale language modeling (PaLM)

Co-authored PaLM: Scaling Language Modeling with Pathways.

2706

Sanjay Singh

Open-weight frontier models (Llama 3)

Co-authored “The Llama 3 Herd of Models”.

2707

Sara Chugh

Open-weight frontier models (Llama 3)

Co-authored “The Llama 3 Herd of Models”.

2708

Sara Hunt

Open-weight frontier models (Llama 3)

Co-authored “The Llama 3 Herd of Models”.

2709

Sara Mc Carthy

Open language models (Gemma 2)

Open Models Gemma 2: Improving Open Language Models at a Practical Size

Co-authored Gemma 2: improving open language models at a practical size.

2710

Sara Smoot

Open multimodal models (Gemma 3)

Open Models Multimodal Gemma 3 Technical Report

Co-authored the Gemma 3 Technical Report.

2711

Sarah Cogan

Open language models (Gemma 2)

Open Models Gemma 2: Improving Open Language Models at a Practical Size

Co-authored Gemma 2: improving open language models at a practical size.

2712

Sarah Hodkinson

Multimodal frontier models (Gemini)

Multimodal Gemini: A Family of Highly Capable Multimodal Models

Co-authored Gemini: A Family of Highly Capable Multimodal Models.

2713

Sarah Perrin

Open language models (Gemma 2)

Open Models Gemma 2: Improving Open Language Models at a Practical Size

Co-authored Gemma 2: improving open language models at a practical size.

2714

Sarah Shoker

Frontier model development (GPT-4)

Co-authored the GPT-4 Technical Report: a key reference for the GPT-4-era frontier.

2715

Sarah Yoo

Frontier model development (GPT-4)

Co-authored the GPT-4 Technical Report: a key reference for the GPT-4-era frontier.

2716

Sarah York

Multimodal frontier models (Gemini)

Multimodal Gemini: A Family of Highly Capable Multimodal Models

Co-authored Gemini: A Family of Highly Capable Multimodal Models.

2717

Sargun Dhillon

Open-weight frontier models (Llama 3)

Co-authored “The Llama 3 Herd of Models”.

2718

Sarmad Hashmi

Open code models (CodeGemma)

Open Models Code Models CodeGemma: Open Code Models Based on Gemma

Co-authored CodeGemma: open code models based on Gemma.

2719

Sarmishta Velury

Multimodal frontier models (Gemini)

Multimodal Gemini: A Family of Highly Capable Multimodal Models

Co-authored Gemini: A Family of Highly Capable Multimodal Models.

2720

Sarthak Jauhari

Multimodal frontier models (Gemini)

Multimodal Gemini: A Family of Highly Capable Multimodal Models

Co-authored Gemini: A Family of Highly Capable Multimodal Models.

2721

Sasan Tavakkol

Multimodal frontier models (Gemini)

Multimodal Gemini: A Family of Highly Capable Multimodal Models

Co-authored Gemini: A Family of Highly Capable Multimodal Models.

2722

Sasha Brown

Multimodal frontier models (Gemini)

Multimodal Gemini: A Family of Highly Capable Multimodal Models

Co-authored Gemini: A Family of Highly Capable Multimodal Models.

2723

Sasha Luccioni

Open code LLMs (StarCoder)

Co-authored StarCoder: a foundational open code model effort (BigCode).

Open Models Code Models StarCoder: may the source be with you!

2724

Sasha Sidorov

Open-weight frontier models (Llama 3)

Co-authored “The Llama 3 Herd of Models”.

2725

Sasha Tsvyashchenko

Pathways-scale language modeling (PaLM)

Co-authored PaLM: Scaling Language Modeling with Pathways.

2726

Sasha Zykova

Multimodal frontier models (Gemini)

Multimodal Gemini: A Family of Highly Capable Multimodal Models

Co-authored Gemini: A Family of Highly Capable Multimodal Models.

2727

Satadru Pan

Open-weight frontier models (Llama 3)

Co-authored “The Llama 3 Herd of Models”.

2728

Saurabh Kumar

Multimodal frontier models (Gemini)

Multimodal Gemini: A Family of Highly Capable Multimodal Models

Co-authored Gemini: A Family of Highly Capable Multimodal Models.

2729

Saurabh Mahajan

Open-weight frontier models (Llama 3)

Co-authored “The Llama 3 Herd of Models”.

2730

Saurabh Saxena

Text-to-image diffusion with strong language understanding (Imagen)

Diffusion & Generative Media Photorealistic Text-to-Image Diffusion Models with Deep Language Understanding

Co-authored Imagen: a milestone for photorealistic text-to-image diffusion models.

2731

Saurabh Shah

Open, fully-documented language models (OLMo)

AI2

Co-authored OLMo: Accelerating the Science of Language Models.

Open Models OLMo: Accelerating the Science of Language Models

2732

Saurabh Verma

Open-weight frontier models (Llama 3)

Co-authored “The Llama 3 Herd of Models”.

2733

Saurav Kadavath

Alignment via AI feedback (Constitutional AI)

Post-Training & Alignment Evaluation & Benchmarks Agents & Reasoning Discovering Language Model Behaviors with Model-Written Evaluations

A good person to follow for the part of alignment work that becomes concrete measurement: model-written tests, chain-of-thought faithfulness, and behavior-shaping methods that can actually be audited.

2734

Sayed Hadi Hashemi

Multimodal frontier models (Gemini)

Multimodal Gemini: A Family of Highly Capable Multimodal Models

Co-authored Gemini: A Family of Highly Capable Multimodal Models.

2735

Scott Gray

Text-to-image generation (DALL·E)

Multimodal Diffusion & Generative Media Zero-Shot Text-to-Image Generation

Co-authored the original DALL·E paper: zero-shot text-to-image generation.

2736

Scott Heiner

Model-written evaluations for LM behavior

Post-Training & Alignment Evaluation & Benchmarks Discovering Language Model Behaviors with Model-Written Evaluations

Co-authored model-written evals: a practical technique for discovering and measuring LM behaviors.

2737

Scott Huffman

Open code models (CodeGemma)

Open Models Code Models CodeGemma: Open Code Models Based on Gemma

Co-authored CodeGemma: open code models based on Gemma.

2738

Scott Johnston

Alignment via AI feedback (Constitutional AI)

Post-Training & Alignment Evaluation & Benchmarks Reinforcement Learning Training a Helpful and Harmless Assistant with Reinforcement Learning from Human Feedback

Useful for the arc from early RLHF assistant work into the later evaluation-heavy safety layer Anthropic built on top of it.

2739

Scott Mayer McKinney

Frontier model development (GPT-4)

Co-authored the GPT-4 Technical Report: a key reference for the GPT-4-era frontier.

2740

Scott Reed

Generalist agents (Gato)

Multimodal Agents & Reasoning A Generalist Agent

Co-authored Gato: a key reference for generalist, multi-task agents.

2741

Sean Bell

Open-weight frontier models (Llama 3)

Co-authored “The Llama 3 Herd of Models”.

Open Models Code Models StarCoder: may the source be with you!

2742

Sean Hughes

Open code LLMs (StarCoder)

Co-authored StarCoder: a foundational open code model effort (BigCode).

2743

Seb Noury

Multimodal frontier models (Gemini)

Multimodal Gemini: A Family of Highly Capable Multimodal Models

Co-authored Gemini: A Family of Highly Capable Multimodal Models.

2744

Sebastian Borgeaud

Gemini (multimodal foundation models)

A high-signal researcher for understanding the modern scaling playbook, especially around compute-optimal training, retrieval-augmented language models, and the text side of Gemini-era multimodal systems.

Multimodal Evaluation & Benchmarks Systems & Infrastructure An empirical analysis of compute-optimal large language model training

2745

Sebastian Gehrmann

Pathways-scale language modeling (PaLM)

Co-authored PaLM: Scaling Language Modeling with Pathways.

2746

Sebastian Goodman

Scaled multilingual vision-language models (PaLI)