Evaluating LLMs at Detecting Errors in LLM Responses
Ryo Kamoi,
Sarkar Snigdha Sarathi Das,
Renze Lou,
Jihyun Janice Ahn,
Yilun Zhao,
Xiaoxin Lu,
Nan Zhang,
Yusen Zhang,
Ranran Haoran Zhang,
Sujeeth Reddy Vummanthala,
Salika Dave,
Shaobo Qin,
Arman Cohan,
Wenpeng Yin,
Rui Zhang
April, 2024