BriefGPT.xyz
Apr, 2016
面向视觉问答的聚焦动态注意力模型
A Focused Dynamic Attention Model for Visual Question Answering
HTML
PDF
Ilija Ilievski, Shuicheng Yan, Jiashi Feng
TL;DR
本文提出了一种基于 Focused Dynamic Attention 模型的视觉问答方法,该方法通过结合全局特征和重点区域信息,能够更好地处理细粒度信息和语言语义,进而提高了视觉问答的表现。
Abstract
visual question and answering
(VQA) problems are attracting increasing interest from multiple research disciplines. Solving VQA problems requires techniques from both
computer vision
for understanding the visual
→