BriefGPT.xyz
Jun, 2020
MAGNet:自然语言查询短语级别多区域注意力引导定位
MAGNet: Multi-Region Attention-Assisted Grounding of Natural Language Queries at Phrase Level
HTML
PDF
Amar Shrestha, Krittaphat Pugdeethosapol, Haowen Fang, Qinru Qiu
TL;DR
利用空间注意力网络实现图像级视觉-文本融合,结合本地(单词)和全局(短语)信息实现区域建议,将其应用于短语查询并利用MAGNet模型在ReferIt游戏数据集上取得了超过12%的性能提升。
Abstract
grounding
free-form textual queries necessitates an understanding of these textual phrases and its relation to the visual cues to reliably reason about the described locations.
spatial attention networks
are know
→