Back to researchers
Michael Jordan
Human preference evaluation at scale (Chatbot Arena)
Co-authored Chatbot Arena: a high-impact human-preference evaluation platform for LLMs.
Highlights
EvaluationLMSysBenchmarks
Focus: Human preference evaluation at scale (Chatbot Arena)
Why it matters: Co-authored Chatbot Arena: a high-impact human-preference evaluation platform for LLMs.
Research Areas
EvaluationLMSysBenchmarks