Back to Publications
EMNLP 2024 2024
Humans or LLMs as the Judge? A Study on Judgement Biases
Guiming Hardy Chen*, Shunian Chen*, Ziche Liu, Feng Jiang, Benyou Wang
Abstract
A comprehensive study investigating biases in LLM-based evaluation compared to human judgment.