BriefGPT.xyz
Oct, 2021
文本摘要模型的训练动态
Training Dynamics for Text Summarization Models
HTML
PDF
Tanya Goyal, Jiacheng Xu, Junyi Jessy Li, Greg Durrett
TL;DR
本文分析生成模型的训练动态,特别是聚焦于总结的方面,并研究了不同阶段的训练过程中模型学到的东西,通过简单的训练修正可以实现不同目标,比如提高事实性和提高抽象程度。
Abstract
pre-trained language models
(e.g. BART) have shown impressive results when fine-tuned on large
summarization
datasets. However, little is understood about this
→