BriefGPT.xyz
Apr, 2018
视频理解的动作表示端到端学习
End-to-End Learning of Motion Representation for Video Understanding
HTML
PDF
Lijie Fan, Wenbing Huang, Chuang Gan, Stefano Ermon, Boqing Gong...
TL;DR
提出了一种名为TVNet的新型端到端可训练神经网络,能够从数据中学习类似光流的特征,通过端到端训练可以进一步微调TVNet的参数以学习更丰富的和任务特定的模式,实验证明该方法在动作识别方面比所有对比方法都更准确,同时在特征提取时间方面与当前最快的对手相当。
Abstract
Despite the recent success of
end-to-end
learned representations, hand-crafted
optical flow
features are still widely used in video analysis tasks. To fill this gap, we propose
→