Kai Shen, Lingfei Wu, Siliang Tang, Fangli Xu, Bo Long...
TL;DR利用双提示-答案和视觉兴趣区域,以及动态图和图序列模型进行视觉问题生成的研究。
Abstract
The visual question generation (VQG) task aims to generate human-like questions from an image and potentially other side information (e.g. answer type). Previous works on VQG fall in two aspects: i) They suffer from one image to many questions mapping problem, which leads to the failur