Back to researchers

Yifan Mai

Holistic evaluation of language models (HELM)

Co-authored HELM: a framework for evaluating language models across many axes beyond raw accuracy.

Highlights

EvaluationBenchmarksResponsible AI
Focus: Holistic evaluation of language models (HELM)
Why it matters: Co-authored HELM: a framework for evaluating language models across many axes beyond raw accuracy.

Research Areas

EvaluationBenchmarksResponsible AI
Yifan Mai - AI Researcher Profile | 500AI