BriefGPT.xyz
Apr, 2023
使用语料提取优化长文本生成的指令调整
LongForm: Optimizing Instruction Tuning for Long Text Generation with Corpus Extraction
HTML
PDF
Abdullatif Köksal, Timo Schick, Anna Korhonen, Hinrich Schütze
TL;DR
使用 LongForm 数据集进行指导调整机制可以提高语言模型的泛化能力,该数据集通过 LLMs 生成一组多样的人类撰写的文档和相应的指导语句,支持长文本生成,并在文本生成、多语言指令识别等任务上表现出色。
Abstract
instruction tuning
enables
language models
to generalize more effectively and better follow user intent. However, obtaining instruction data can be costly and challenging. Prior works employ methods such as expen
→