BriefGPT.xyz
Apr, 2020
基于实体监督的稀疏记忆访问
Entities as Experts: Sparse Memory Access with Entity Supervision
HTML
PDF
Thibault Févry, Livio Baldini Soares, Nicholas FitzGerald, Eunsol Choi, Tom Kwiatkowski
TL;DR
使用实体作为专家的具有记忆分离能力的新模型(EAE),能够捕获文本中实体的声明性知识,比具有10倍参数的编码-生成变压器模型的性能更好,并比类似大小的BERT和以前整合外部实体知识的方法具有更多的事实知识。
Abstract
We focus on the problem of capturing declarative knowledge in the learned parameters of a
language model
. We introduce a new model,
entities as experts
(EaE), that can access distinct memories of the entities men
→