BriefGPT.xyz
Jan, 2018
MAttNet: 模块化注意力网络用于指代表达理解
MAttNet: Modular Attention Network for Referring Expression Comprehension
HTML
PDF
Licheng Yu, Zhe Lin, Xiaohui Shen, Jimei Yang, Xin Lu...
TL;DR
本文提出了一种通过使用模块化组件和多种注意力机制,实现对自然语言描述的图像区域定位的方法,该方法在特征抽象、指向性和篮球场景等任务中都优于以往最先进的模型。
Abstract
In this paper, we address
referring expression comprehension
: localizing an image region described by a natural language expression. While most recent work treats expressions as a single unit, we propose to decompose them into three
→