BriefGPT.xyz
Mar, 2019
Align2Ground: 基于图像-字幕对准的弱监督短语对齐
Align2Ground: Weakly Supervised Phrase Grounding Guided by Image-Caption Alignment
HTML
PDF
Samyak Datta, Karan Sikka, Anirban Roy, Karuna Ahuja, Devi Parikh...
TL;DR
使用图像字幕对弱监督进行自由文本短语连接的问题展开研究,提出了一种新颖的端到端模型,并使用字幕到图像检索作为“下游”任务来指导短语定位的过程。
Abstract
We address the problem of
grounding
free-form
textual phrases
by using weak supervision from
image-caption pairs
. We propose a novel end-t
→