BriefGPT.xyz
Oct, 2021
基于全局上下文的GNN语言建模
GNN-LM: Language Modeling based on Global Contexts via GNN
HTML
PDF
Yuxian Meng, Shi Zong, Xiaoya Li, Xiaofei Sun, Tianwei Zhang...
TL;DR
通过建立一个包含语料库中与输入内容相关的类语境环境节点的有向异构图,将GNN-LM扩展到传统的神经语言模型中(LM),并基于该图构建GNN网络来提高模型的泛化能力,有效地实现了对参考上下文的直接访问,展示了它在不同数据集上优于现有方法的结果(例如在WikiText-103上的最佳困惑度约为14.8)。
Abstract
Inspired by the notion that ``{\it to copy is easier than to memorize}``, in this work, we introduce
gnn-lm
, which extends the vanilla
neural language model
(LM) by allowing to reference similar contexts in the e
→