BriefGPT.xyz
Apr, 2025
超越最终答案:你的推理轨迹揭示了更多的内容
Beyond the Last Answer: Your Reasoning Trace Uncovers More than You Think
HTML
PDF
Hasan Abed Al Kader Hammoud, Hani Itani, Bernard Ghanem
TL;DR
本研究针对大语言模型在复杂问题求解中对最终答案的依赖,提出了质疑。通过分析中间推理步骤(子思维)并基于此提出了一种方法,表明聚合多条推理路径生成的答案,通常能显著提高准确性。实验证明,该方法在多个大语言模型和具有挑战性的数学推理数据集上提升了回答的准确率。
Abstract
Large Language Models
(LLMs) leverage step-by-step reasoning to solve complex problems. Standard evaluation practice involves generating a complete
Reasoning Trace
and assessing the correctness of the final answe
→