BriefGPT.xyz
May, 2025
通过测试时扩展进行跨语言推理
Crosslingual Reasoning through Test-Time Scaling
HTML
PDF
Zheng-Xin Yong, M. Farid Adilazuarda, Jonibek Mansurov, Ruochen Zhang, Niklas Muennighoff...
TL;DR
本研究解决了大语言模型在多语言推理能力上的研究空白,尤其是英语推理模型在训练后如何在其他语言上表现。通过扩展推理计算,发现英语推理模型可以在多种语言中实现跨语言数学推理的提升,尤其是在高资源语言中更为有效。这项研究揭示了跨语言推理的潜力和机制,同时指出了低资源语言和不同领域知识推理的局限性。
Abstract
Reasoning
capabilities of
Large Language Models
are primarily studied for English, even when pretrained models are multilingual. In this work, we investigate to what extent English
→