BriefGPT.xyz
Dec, 2022
语言生成模型的自然偏好
A Natural Bias for Language Generation Models
HTML
PDF
Clara Meister, Wojciech Stokowiec, Tiago Pimentel, Lei Yu, Laura Rimell...
TL;DR
本文提出了一种以unigram分布为先验知识的初始化模型权重的方法,在神经语言生成模型中应用该方法可提高学习效率、整体性能以及鼓励模型专注于非频率相关的语言特性。
Abstract
After just a few hundred training updates, a standard
probabilistic model
for
language generation
has likely not yet learnt many semantic or syntactic rules of natural language, which inherently makes it difficul
→