BriefGPT.xyz
Nov, 2023
ArthModel: 提升大型语言模型的算术技能
ArthModel: Enhance Arithmetic Skills to Large Language Model
HTML
PDF
Yingdi Guo
TL;DR
通过训练语言模型生成与算数问题相关的后缀表达式,并将其与小型预训练模型结合,将令牌嵌入转化为真实稠密数字,通过深度学习平台的本地函数获得正确答案。最后,通过提示注入将小型模型的结果输出添加到语言模型中,从而增强了语言模型的算数能力。
Abstract
With the great success of ChatGPT, the research of large
language models
has become increasingly popular. However, the models have several limitations, such as
toxicity
and pool performance of arithmetic solving.
→