Haldun Balim, Seonwook Park, Xi Wang, Xucong Zhang, Otmar Hilliges
TL;DR该论文提出了一种基于帧的直接预测 3D 注视原点和 3D 注视方向的网络,在三个公共焦点数据集上实现了可比较的结果。
Abstract
Despite the recent development of learning-basedgaze estimation methods,
most methods require one or more eye or face region crops as inputs and produce
a gaze direction vector as output. Cropping results in a h