BriefGPT.xyz
Mar, 2018
神经婴儿语
Neural Baby Talk
HTML
PDF
Jiasen Lu, Jianwei Yang, Dhruv Batra, Devi Parikh
TL;DR
本论文提出了一种新颖的图像字幕生成模型,可在生成自然语言描述的同时,引入与图像实体检测相关的概念填充,通过生成带有显式图像区域链接的句子模板,并利用检测到的可视化概念填充这些区域,实现端到端的可微分框架,并在标准图像字幕生成和新物体字幕生成上达到了当前最先进水平。
Abstract
We introduce a novel framework for
image captioning
that can produce
natural language
explicitly grounded in entities that object detectors find in the image. Our approach reconciles classical slot filling approa
→