Back to Publications

EMNLP 2024 2024

Humans or LLMs as the Judge? A Study on Judgement Biases

Guiming Hardy Chen*, Shunian Chen*, Ziche Liu, Feng Jiang, Benyou Wang

Abstract

A comprehensive study investigating biases in LLM-based evaluation compared to human judgment.

Resources