学习用于视觉问答的稀疏专家混合模型

Sep, 2019

学习用于视觉问答的稀疏专家混合模型

Learning Sparse Mixture of Experts for Visual Question Answering

Vardaan Pahuja, Jie Fu, Christopher J. Pal

TL;DR本文提出了一种模块化的神经架构，特别针对 VQA 任务中的卷积神经网络模块，通过网络的稀疏性提高了模型的运行效率，实验表明其可与传统的 CNN VQA 模型相媲美。

Abstract

There has been a rapid progress in the task of visual question answering with improved model architectures. Unfortunately, these models are usually computationally intensive due to their sheer size which poses a serious challenge for deployment. We aim to tackle this issue for the spec