TL;DR本文提出了一种利用 GAN 进行视觉数据合成的框架来解决零样本视频分类问题,并结合多级语义推理和匹配感知的互信息相关来提高合成视频特征的鉴别能力,实验结果表明这种方法可以显著提高零样本视频分类的性能。
Abstract
zero-shot learning (ZSL) in video classification is a promising research
direction, which aims to tackle the challenge from explosive growth of video
categories. Most existing methods exploit seen-to-unseen corre