Back to researchers
Owain Evans
Truthfulness and hallucination evaluation
Co-authored TruthfulQA: an influential benchmark for truthfulness and falsehood mimicry in LMs.
Highlights
EvaluationTruthfulnessSafety
Focus: Truthfulness and hallucination evaluation
Why it matters: Co-authored TruthfulQA: an influential benchmark for truthfulness and falsehood mimicry in LMs.
Research Areas
EvaluationTruthfulnessSafety