Mogrifier LSTM

Sep, 2019

Gábor Melis, Tomáš Kočiský, Phil Blunsom

TL;DR本文介绍了长短时记忆网络的互相门机制，以实现更好地建模自然语言处理中上下文之间的交互，并通过实验在多个数据集上证明了其在语言建模上较传统模型具有更好的泛化能力和性能表现。

Abstract

Many advances in natural language processing have been based upon more expressive models for how inputs interact with the context in which they occur. Recurrent networks, which have enjoyed a modicum of success, still lack the →