BriefGPT.xyz
Jul, 2015
使用基于注意力机制的编码-解码网络描述多媒体内容
Describing Multimedia Content using Attention-based Encoder--Decoder Networks
HTML
PDF
Kyunghyun Cho, Aaron Courville, Yoshua Bengio
TL;DR
本文介绍了使用深度神经网络中的递归神经网络和卷积神经网络等措施来实现注意力机制,以解决多元随机变量富含联合分布的结构化输出问题。通过典型任务如机器翻译、图像字幕生成、视频片段描述和语音识别实验,系统展示了很好的性能。
Abstract
Whereas
deep neural networks
were first mostly used for classification tasks, they are rapidly expanding in the realm of
structured output problems
, where the observed target is composed of multiple random variab
→