Judging the Judges: Evaluating Alignment and Vulnerabilities in LLMs-as-Judges
Published in Under Review, NeurIPS, 2024
A comprehensive study of the LLM-as-a-judge paradigm in a controlled setup that reveals new results about its strengths and weaknesses.