BriefGPT.xyz
Jul, 2018
图像字幕的循环融合网络
Recurrent Fusion Network for Image Captioning
HTML
PDF
Wenhao Jiang, Lin Ma, Yu-Gang Jiang, Wei Liu, Tong Zhang
TL;DR
本文提出了一种使用多个编码器的循环融合网络(RFNet)来处理图像字幕生成问题,RFNet可以利用多个编码器的输出之间的相互作用,生成新的、紧凑而且信息丰富的表示,验证实验表明,RFNet对于图像字幕生成问题是有效的,并且取得了最新的最好结果。
Abstract
Recently, much advance has been made in
image captioning
, and an
encoder-decoder framework
has been adopted by all the state-of-the-art models. Under this framework, an input image is encoded by a
→