BriefGPT.xyz
Jan, 2025
理解、解决与翻译:弥合多语言数学推理差距
Understand, Solve and Translate: Bridging the Multilingual Mathematical Reasoning Gap
HTML
PDF
Hyunwoo Ko, Guijin Son, Dasol Choi
TL;DR
本研究针对语言模型在非高资源语言(如韩语)上的推理性能差距,提出了HRM8K基准测试。通过分析,我们发现此差距主要源于模型对非英语输入的理解困难,而非推理能力的限制。基于此,提出UST方法,通过在英文基础上进行推理和解决,显著提升了模型表现,并有效减少了多语言的性能差距。
Abstract
Large
Language Models
(LLMs) demonstrate exceptional performance on complex reasoning tasks. However, despite their strong reasoning capabilities in high-resource languages (e.g., English and Chinese), a significant
Per
→