BriefGPT.xyz
Nov, 2018
评估LSTM模型在形式语言中的泛化能力
On Evaluating the Generalization of LSTM Models in Formal Languages
HTML
PDF
Mirac Suzgun, Yonatan Belinkov, Stuart M. Shieber
TL;DR
本研究对长短期记忆网络的归纳学习能力进行了实证评估,发现在不同的训练设置下模型性能存在显著差异,并强调在提出神经网络模型的学习能力时需要进行仔细的分析和评估。
Abstract
recurrent neural networks
(RNNs) are theoretically Turing-complete and established themselves as a dominant model for
language processing
. Yet, there still remains an uncertainty regarding their language learning
→