BriefGPT.xyz
Jun, 2019
视觉问答的深度模块化协同注意力网络
Deep Modular Co-Attention Networks for Visual Question Answering
HTML
PDF
Zhou Yu, Jun Yu, Yuhao Cui, Dacheng Tao, Qi Tian
TL;DR
本文提出了一种深度Modular Co-Attention Network模型,用于有效处理Visual Question Answering中的co-attention问题,并在评估中显示了显著优于其他方法的性能。
Abstract
visual question answering
(VQA) requires a fine-grained and simultaneous understanding of both the visual content of images and the textual content of questions. Therefore, designing an effective `
co-attention
' m
→