Researchers — page 5

Multimodal Gemini: A Family of Highly Capable Multimodal Models

Co-authored Gemini: A Family of Highly Capable Multimodal Models.

506

Carolyn Jane Anderson

Open code LLMs (StarCoder)

Co-authored StarCoder: a foundational open code model effort (BigCode).

Open Models Code Models StarCoder: may the source be with you!

507

Carrie Muir

Multimodal frontier models (Gemini)

Multimodal Gemini: A Family of Highly Capable Multimodal Models

Co-authored Gemini: A Family of Highly Capable Multimodal Models.

508

Carrie Spadine

Multimodal frontier models (Gemini)

Multimodal Gemini: A Family of Highly Capable Multimodal Models

Co-authored Gemini: A Family of Highly Capable Multimodal Models.

509

Carroll L. Wainwright

Instruction-following via RLHF (InstructGPT)

Post-Training & Alignment Training language models to follow instructions with human feedback

Co-authored the InstructGPT paper that set the standard instruction-tuning + RLHF recipe.

510

Carroll Wainwright

Frontier model development (GPT-4)

Co-authored the GPT-4 Technical Report: a key reference for the GPT-4-era frontier.

511

Casey Chu

Diffusion-based text-to-image generation (DALL·E 2)

Diffusion & Generative Media Hierarchical Text-Conditional Image Generation with CLIP Latents

Co-authored DALL·E 2: hierarchical text-conditional image generation with CLIP latents.

512

Cassidy Hardin

Open language models from Google (Gemma)

Open Models Gemma: Open Models Based on Gemini Research and Technology

Co-authored Gemma: open models based on Gemini research and technology.

513

Catalina Mejia

Open-weight frontier models (Llama 3)

Co-authored “The Llama 3 Herd of Models”.

Post-Training & Alignment Reinforcement Learning Interpretability A Mathematical Framework for Transformer Circuits

514

Catherine Olsson

Alignment via AI feedback (Constitutional AI)

Anthropic

One of the clearest people to follow if you want the mechanistic-interpretability thread at Anthropic rather than only its safety-policy surface.

515

Ce Liu

Small, capable models (Phi-3)

Co-authored the Phi-3 Technical Report (capable models designed for smaller footprints).

Open Models Systems & Infrastructure Phi-3 Technical Report: A Highly Capable Language Model Locally on Your Phone

516

Ce Zhang

Holistic evaluation of language models (HELM)

Co-authored HELM: a framework for evaluating language models across many axes beyond raw accuracy.

Evaluation & Benchmarks Holistic Evaluation of Language Models

517

Ce Zheng

Multimodal frontier models (Gemini)

Multimodal Gemini: A Family of Highly Capable Multimodal Models

Co-authored Gemini: A Family of Highly Capable Multimodal Models.

518

Chaitanya Krishna Lanka

Multimodal frontier models (Gemini)

Multimodal Gemini: A Family of Highly Capable Multimodal Models

Co-authored Gemini: A Family of Highly Capable Multimodal Models.

519

Chak Ming Li

Frontier model development (GPT-4)

Co-authored the GPT-4 Technical Report: a key reference for the GPT-4-era frontier.

Chalence Safranek-Shrader

520

Multimodal frontier models (Gemini)

Multimodal Gemini: A Family of Highly Capable Multimodal Models

Co-authored Gemini: A Family of Highly Capable Multimodal Models.

521

Chang Lan

Multimodal frontier models (Gemini)

Multimodal Gemini: A Family of Highly Capable Multimodal Models

Co-authored Gemini: A Family of Highly Capable Multimodal Models.

522

Chang Zhou

Open-weight LLMs (Qwen)

Open Models Qwen Technical Report

Co-authored the Qwen Technical Report.

523

Changhan Wang

Open-weight frontier models (Llama 3)

Co-authored “The Llama 3 Herd of Models”.

524

Changkyu Kim

Open-weight frontier models (Llama 3)

Co-authored “The Llama 3 Herd of Models”.

525

Chao Jia

Scaled multilingual vision-language models (PaLI)

Multimodal PaLI: A Jointly-Scaled Multilingual Language-Image Model

Co-authored PaLI: a key reference for scaling multilingual vision-language models.

526

Chao Zhou

Open-weight frontier models (Llama 3)

Co-authored “The Llama 3 Herd of Models”.

527

Charles Chen

Multimodal frontier models (Gemini)

Multimodal Gemini: A Family of Highly Capable Multimodal Models

Co-authored Gemini: A Family of Highly Capable Multimodal Models.

528

Charles Foster

Open-source LLMs (EleutherAI)

EleutherAI

A useful person to track for the evaluation side of AI risk work, especially where open-model benchmarking meets the question of which measurements are actually trustworthy enough to inform decisions.

Open Models Evaluation & Benchmarks Charles Foster

529

Charles Sutton

Multimodal frontier models (Gemini)

Multimodal Gemini: A Family of Highly Capable Multimodal Models

Co-authored Gemini: A Family of Highly Capable Multimodal Models.

530

Charlie Chen

Open language models (Gemma 2)

Open Models Gemma 2: Improving Open Language Models at a Practical Size

Co-authored Gemma 2: improving open language models at a practical size.

531

Charlie Deck

Multimodal frontier models (Gemini)

Multimodal Gemini: A Family of Highly Capable Multimodal Models

Co-authored Gemini: A Family of Highly Capable Multimodal Models.

532

Charline Le Lan

Open language models from Google (Gemma)

Open Models Gemma: Open Models Based on Gemini Research and Technology

Co-authored Gemma: open models based on Gemini research and technology.

533

Charlotte Caucheteux

Open-weight frontier models (Llama 3)

Co-authored “The Llama 3 Herd of Models”.

534

Charlotte Smith

Multimodal frontier models (Gemini)

Multimodal Gemini: A Family of Highly Capable Multimodal Models

Co-authored Gemini: A Family of Highly Capable Multimodal Models.

535

Chaya Nayak

Open-weight frontier models (Llama 3)

Co-authored “The Llama 3 Herd of Models”.

536

Che Chang

Frontier model development (GPT-4)

Co-authored the GPT-4 Technical Report: a key reference for the GPT-4-era frontier.

537

Chelsea Carlson

Frontier model development (GPT-4)

Co-authored the GPT-4 Technical Report: a key reference for the GPT-4-era frontier.

Post-Training & Alignment Vision & Robotics Chelsea Finn at Stanford HAI

538

Chelsea Finn

Direct preference optimization (DPO)

One of the clearest people to follow for the overlap between modern robotics, meta-learning, and preference-optimization-era alignment research.

539

Chelsea Voss

Text-to-image generation (DALL·E)

Multimodal Diffusion & Generative Media Zero-Shot Text-to-Image Generation

Co-authored the original DALL·E paper: zero-shot text-to-image generation.

540

Chen Almagor

Hybrid Transformer–Mamba language models (Jamba)

AI21

One of the more useful long-tail AI21 pages because it points to the algorithm leadership behind the company’s hybrid-model releases instead of flattening everyone into the same generic coauthor template.

Systems & Infrastructure Chen Almagor

541

Chen Elkind

Multimodal frontier models (Gemini)

Multimodal Gemini: A Family of Highly Capable Multimodal Models

Co-authored Gemini: A Family of Highly Capable Multimodal Models.

542

Chen Liang

Small, capable models (Phi-3)

Co-authored the Phi-3 Technical Report (capable models designed for smaller footprints).

Open Models Systems & Infrastructure Phi-3 Technical Report: A Highly Capable Language Model Locally on Your Phone

543

Chen Zhou

Multimodal frontier models (Gemini)

Multimodal Gemini: A Family of Highly Capable Multimodal Models

Co-authored Gemini: A Family of Highly Capable Multimodal Models.

544

Chen Zhu

Multimodal frontier models (Gemini)

Multimodal Gemini: A Family of Highly Capable Multimodal Models

Co-authored Gemini: A Family of Highly Capable Multimodal Models.

545

Chenel Elkind

Multimodal frontier models (Gemini)

Multimodal Gemini: A Family of Highly Capable Multimodal Models

Co-authored Gemini: A Family of Highly Capable Multimodal Models.

546

Cheng Li

Large-scale transformer inference (DeepSpeed)

Co-authored DeepSpeed Inference: practical inference optimizations for serving large transformer models.

Systems & Infrastructure DeepSpeed Inference: Enabling Efficient Inference of Transformer Models at Unprecedented Scale

547

Cheng-Chun Lee

Multimodal frontier models (Gemini)

Multimodal Gemini: A Family of Highly Capable Multimodal Models

Co-authored Gemini: A Family of Highly Capable Multimodal Models.

548

Chengda Lu

Open-model frontier reports (DeepSeek-V3)

Co-authored the DeepSeek-V3 Technical Report.

549

Chenggang Zhao

Open-model frontier reports (DeepSeek-V3)

Co-authored the DeepSeek-V3 Technical Report.

Open Models Code Models StarCoder: may the source be with you!

550

Chenghao Mou

Open code LLMs (StarCoder)

Co-authored StarCoder: a foundational open code model effort (BigCode).

551

Chengpeng Li

Open-weight LLMs (Qwen2)

Open Models Qwen2 Technical Report

Co-authored the Qwen2 Technical Report.

552

Chengqi Deng

Open-model frontier reports (DeepSeek-V3)

Co-authored the DeepSeek-V3 Technical Report.

553

Chengqiang Lu

Open-weight LLMs (Qwen)

Open Models Qwen Technical Report

Co-authored the Qwen Technical Report.

554

Chengyuan Li

Open-weight LLMs (Qwen2)

Open Models Qwen2 Technical Report

Co-authored the Qwen2 Technical Report.

555

Chenjie Gu

Multimodal frontier models (Gemini)

Multimodal Gemini: A Family of Highly Capable Multimodal Models

Co-authored Gemini: A Family of Highly Capable Multimodal Models.

556

Chenkai Kuang

Multimodal frontier models (Gemini)

Multimodal Gemini: A Family of Highly Capable Multimodal Models

Co-authored Gemini: A Family of Highly Capable Multimodal Models.

557

Chenmei Li

Multimodal frontier models (Gemini)

Multimodal Gemini: A Family of Highly Capable Multimodal Models

Co-authored Gemini: A Family of Highly Capable Multimodal Models.

558

Chenruidong Zhang

Small, capable models (Phi-3)

Co-authored the Phi-3 Technical Report (capable models designed for smaller footprints).

Open Models Systems & Infrastructure Phi-3 Technical Report: A Highly Capable Language Model Locally on Your Phone

559

Chenxi Liu

Multimodal frontier models (Gemini)

Multimodal Gemini: A Family of Highly Capable Multimodal Models

Co-authored Gemini: A Family of Highly Capable Multimodal Models.

560

Chenxi Pang

Multimodal frontier models (Gemini)

Multimodal Gemini: A Family of Highly Capable Multimodal Models

Co-authored Gemini: A Family of Highly Capable Multimodal Models.

561

Chenyu Zhang

Open-model frontier reports (DeepSeek-V3)

Co-authored the DeepSeek-V3 Technical Report.

562

Chester Cho

Frontier model development (GPT-4)

Co-authored the GPT-4 Technical Report: a key reference for the GPT-4-era frontier.

563

Chester Hu

Open-weight frontier models (Llama 3)

Co-authored “The Llama 3 Herd of Models”.

564

Chester Kwak

Multimodal frontier models (Gemini)

Multimodal Gemini: A Family of Highly Capable Multimodal Models

Co-authored Gemini: A Family of Highly Capable Multimodal Models.

565

Chetan Ahuja

Multimodal frontier models (Gemini)

Multimodal Gemini: A Family of Highly Capable Multimodal Models

Co-authored Gemini: A Family of Highly Capable Multimodal Models.

566

Chetan Tekur

Multimodal frontier models (Gemini)

Multimodal Gemini: A Family of Highly Capable Multimodal Models

Co-authored Gemini: A Family of Highly Capable Multimodal Models.

567

Chien-Chin Huang

Fully Sharded Data Parallel training (FSDP)

Co-authored PyTorch FSDP: practical lessons for scaling fully-sharded training workloads.

Systems & Infrastructure PyTorch FSDP: Experiences on Scaling Fully Sharded Data Parallel

568

Chih-Kuan Yeh

Multimodal frontier models (Gemini)

Multimodal Gemini: A Family of Highly Capable Multimodal Models

Co-authored Gemini: A Family of Highly Capable Multimodal Models.

569

Chih-Wei "Louis" Chen

Multimodal frontier models (Gemini)

Multimodal Gemini: A Family of Highly Capable Multimodal Models

Co-authored Gemini: A Family of Highly Capable Multimodal Models.

570

Chimezie Iwuanyanwu

Multimodal frontier models (Gemini)

Multimodal Gemini: A Family of Highly Capable Multimodal Models

Co-authored Gemini: A Family of Highly Capable Multimodal Models.

571

Ching-Hsiang Chu

Open-weight frontier models (Llama 3)

Co-authored “The Llama 3 Herd of Models”.

572

Chintu Kumar

Open language models (Gemma 2)

Open Models Gemma 2: Improving Open Language Models at a Practical Size

Co-authored Gemma 2: improving open language models at a practical size.

573

Chitwan Saharia

Text-to-image diffusion with strong language understanding (Imagen)

Diffusion & Generative Media Photorealistic Text-to-Image Diffusion Models with Deep Language Understanding

Co-authored Imagen: a milestone for photorealistic text-to-image diffusion models.

574

Chloe Bi

Open-weight frontier models (Llama 3)

Co-authored “The Llama 3 Herd of Models”.

575

Chloe Rolland

Promptable segmentation foundation models (SAM)

Vision & Robotics Segment Anything

Co-authored Segment Anything.

576

Chloe Thornton

Multimodal frontier models (Gemini)

Multimodal Gemini: A Family of Highly Capable Multimodal Models

Co-authored Gemini: A Family of Highly Capable Multimodal Models.

577

Chong Luo

Small, capable models (Phi-3)

Co-authored the Phi-3 Technical Report (capable models designed for smaller footprints).

Open Models Systems & Infrastructure Phi-3 Technical Report: A Highly Capable Language Model Locally on Your Phone

578

Chong Ruan

Open-model frontier reports (DeepSeek-V3)

Co-authored the DeepSeek-V3 Technical Report.

579

Chong Zhang

Instruction-following via RLHF (InstructGPT)

Post-Training & Alignment Training language models to follow instructions with human feedback

Co-authored the InstructGPT paper that set the standard instruction-tuning + RLHF recipe.

580

Chris Alberti

Multimodal frontier models (Gemini)

Multimodal Gemini: A Family of Highly Capable Multimodal Models

Co-authored Gemini: A Family of Highly Capable Multimodal Models.

581

Chris Bamford

Mixture-of-experts LLMs

Mistral

Co-authored Mixtral of Experts: a key MoE reference in the open-weights frontier.

Open Models Systems & Infrastructure Mixtral of Experts

582

Chris Cai

Open-weight frontier models (Llama 3)

Co-authored “The Llama 3 Herd of Models”.

583

Chris Gorgolewski

Multimodal frontier models (Gemini)

Multimodal Gemini: A Family of Highly Capable Multimodal Models

Co-authored Gemini: A Family of Highly Capable Multimodal Models.

584

Chris Hallacy

Vision-language pretraining (CLIP)

Multimodal Learning Transferable Visual Models From Natural Language Supervision

Co-authored CLIP: a core reference for contrastive multimodal pretraining.

585

Chris Hesse

Frontier model development (GPT-4)

Co-authored the GPT-4 Technical Report: a key reference for the GPT-4-era frontier.

586

Chris Hidey

Multimodal frontier models (Gemini)

Multimodal Gemini: A Family of Highly Capable Multimodal Models

Co-authored Gemini: A Family of Highly Capable Multimodal Models.

587

Chris Marra

Open-weight frontier models (Llama 3)

Co-authored “The Llama 3 Herd of Models”.

588

Chris McConnell

Open-weight frontier models (Llama 3)

Co-authored “The Llama 3 Herd of Models”.

Post-Training & Alignment Reinforcement Learning Interpretability Feature Visualization

589

Chris Olah

Mechanistic interpretability, visualization

One of the clearest interpreters of neural-network internals, especially in the line of work that turned interpretability into a concrete research agenda rather than a vague aspiration.

590

Chris Perry

Open language models (Gemma 2)

Open Models Gemma 2: Improving Open Language Models at a Practical Size

Co-authored Gemma 2: improving open language models at a practical size.

591

Chris Tindal

Open-weight frontier models (Llama 3)