BriefGPT.xyz
Nov, 2017
变分双向 LSTM
Variational Bi-LSTMs
HTML
PDF
Samira Shabanian, Devansh Arpit, Adam Trischler, Yoshua Bengio
TL;DR
本文介绍了一种双向长短时记忆神经网络(bidirectional LSTMs)的变型架构——变分双向LSTM(Variational Bi-LSTM),该架构可以在训练期间(但在推理时可能被省略)在两条路径之间创建通道,从而共同优化其模型,实现不同信息的交互利用,进而在各项测试中表现出优秀的预测性能。
Abstract
recurrent neural networks
like long short-term memory (LSTM) are important architectures for
sequential prediction tasks
. LSTMs (and RNNs in general) model sequences along the forward time direction.
→