BriefGPT.xyz
Mar, 2015
使用堆栈增强循环网络推断算法模式
Inferring Algorithmic Patterns with Stack-Augmented Recurrent Nets
HTML
PDF
Armand Joulin, Tomas Mikolov
TL;DR
本文讨论了标准深度学习方法的局限性,并展示了如何通过以结构化方式增加模型的复杂性来克服这些限制,具体地,研究了仅适用于具有计数和记忆序列能力模型的算法生成序列的最简单序列预测问题,证明了可以使用与可训练内存相关的循环网络从序列数据中学习一些基本算法。
Abstract
While
machine learning
is currently very successful in several application domains, we are still very far from a real
artificial intelligence
. In this paper, we study basic sequence prediction problems that are b
→