BriefGPT.xyz
May, 2019
基于记忆注意力的循环神经网络用于视频字幕生成
Memory-Attended Recurrent Network for Video Captioning
HTML
PDF
Wenjie Pei, Jiyuan Zhang, Xiangrong Wang, Lei Ke, Xiaoyong Shen...
TL;DR
提出了一种记忆注意力循环网络用于视频字幕生成,可以在训练数据中探索词与其各种类似视觉上下文的全谱对应关系,从而实现对每个单词的更全面理解,并提高字幕生成质量。
Abstract
Typical techniques for
video captioning
follow the
encoder-decoder framework
, which can only focus on one source video being processed. A potential disadvantage of such design is that it cannot capture the multip
→