BriefGPT.xyz
Apr, 2023
通过正确性和信息性评估推理链:ReCEval
ReCEval: Evaluating Reasoning Chains via Correctness and Informativeness
HTML
PDF
Archiki Prasad, Swarnadeep Saha, Xiang Zhou, Mohit Bansal
TL;DR
利用信息论度量和自然语言推理模型,以信息性和正确性为关键属性构建 ReCEval 框架,评估认知链作为形式证明推导出最终答案的有效性和质量,以提高思维任务的表现。
Abstract
multi-step reasoning
ability is fundamental to many
natural language tasks
, yet it is unclear what constitutes a good reasoning chain and how to evaluate them. Most existing methods focus solely on whether the re
→