Back to researchers

Yonghao Zhuang

LLM-as-a-judge evaluation (MT-Bench)

Co-authored MT-Bench / LLM-as-a-judge: a widely used template for scalable multi-turn evaluation.

Highlights

EvaluationLMSysLLM-as-a-judge
Focus: LLM-as-a-judge evaluation (MT-Bench)
Why it matters: Co-authored MT-Bench / LLM-as-a-judge: a widely used template for scalable multi-turn evaluation.

Research Areas

EvaluationLMSysLLM-as-a-judge
Yonghao Zhuang - AI Researcher Profile | 500AI