BriefGPT.xyz
Oct, 2024
词的思考:提升大型语言模型推理能力
ToW: Thoughts of Words Improve Reasoning in Large Language Models
HTML
PDF
Zhikun Xu, Ming Shen, Jacob Dineen, Zhaonan Li, Xiao Ye...
TL;DR
本文提出了一种名为“词的思考”(ToW)的新型数据增强方法,旨在解决现有下一词预测学习方案的事实幻觉和效率低下问题。通过从大型模型中提取ToW注释,在仅使用70K ToW注释的情况下,模型的推理能力提高了7%至9%,同时减少了高达10%的幻觉现象,展示了显著的潜在影响。
Abstract
We introduce thoughts of words (ToW), a novel training-time data-augmentation method for
Next-word Prediction
. ToW views
Next-word Prediction
as a core
→