Back to researchers

Michael Jordan

Human preference evaluation at scale (Chatbot Arena)

Co-authored Chatbot Arena: a high-impact human-preference evaluation platform for LLMs.

Highlights

EvaluationLMSysBenchmarks
Focus: Human preference evaluation at scale (Chatbot Arena)
Why it matters: Co-authored Chatbot Arena: a high-impact human-preference evaluation platform for LLMs.

Research Areas

EvaluationLMSysBenchmarks
Michael Jordan - AI Researcher Profile | 500AI