BriefGPT.xyz
Aug, 2023
教小型语言模型如何推广到未见过的组合问题
Teaching Smaller Language Models To Generalise To Unseen Compositional Questions
HTML
PDF
Tim Hartill, Neset TAN, Michael Witbrock, Patricia J. Riddle
TL;DR
我们在本文中提出了一种通过多任务监督预训练和密集检索系统的组合来实现对具有挑战性的复合问题的泛化的方法,并且展示了通过添加用于训练的检索增强数据集可以显著提高模型的性能。
Abstract
We equip a smaller
language model
to generalise to answering challenging compositional questions that have not been seen in training. To do so we propose a combination of
multitask supervised pretraining
on up to
→