Back to researchers

Qingyang Wu

Visual instruction tuning (LLaVA)

Co-authored Visual Instruction Tuning: a widely-cited recipe for LLaVA-style multimodal assistants.

Highlights

LLaVAMultimodalVision-languageOpen-source
Focus: Visual instruction tuning (LLaVA)
Why it matters: Co-authored Visual Instruction Tuning: a widely-cited recipe for LLaVA-style multimodal assistants.

Research Areas

LLaVAMultimodalVision-languageOpen-source
Qingyang Wu - AI Researcher Profile | 500AI