Back to researchers

Wei-Lin Chiang

Human preference evaluation at scale (Chatbot Arena)

Co-authored Chatbot Arena: a high-impact human-preference evaluation platform for LLMs.

Highlights

EvaluationLMSysBenchmarks
Focus: Human preference evaluation at scale (Chatbot Arena)
Why it matters: Co-authored Chatbot Arena: a high-impact human-preference evaluation platform for LLMs.

Research Areas

EvaluationLMSysBenchmarks
Wei-Lin Chiang - AI Researcher Profile | 500AI