视觉语义角色标记

May, 2015

Visual Semantic Role Labeling

Saurabh Gupta, Jitendra Malik

TL;DR本文介绍了视觉语义角色标注的问题，即在给定图像的情况下，我们希望检测人们进行的动作并定位交互对象，为了实现这个目标，我们注释了一组数据集，并提供了一组基准算法来解决这个问题，并分析了错误模式，为未来的工作提供了方向。

Abstract

In this paper we introduce the problem of visual semantic role labeling: given an image we want to detect people doing actions and localiz