BriefGPT.xyz
Oct, 2020
层次化条件关系网络用于多模态视频问答
Hierarchical Conditional Relation Networks for Multimodal Video Question Answering
HTML
PDF
Thao Minh Le, Vuong Le, Svetha Venkatesh, Truyen Tran
TL;DR
该论文主要介绍了一种基于条件计算结构的一般性可重用神经元CRN和视频QA中的分层条件关系网络HCRN,旨在解决视频问题答案推理的问题。并在广泛的真实世界数据集上展示了其优越性能。
Abstract
video qa
challenges modelers in multiple fronts. Modeling video necessitates building not only spatio-temporal models for the dynamic visual channel but also
multimodal structures
for associated information chann
→