BriefGPT.xyz
Apr, 2021
可组合增强编码用于视频表示学习
Composable Augmentation Encoding for Video Representation Learning
HTML
PDF
Chen Sun, Arsha Nagrani, Yonglong Tian, Cordelia Schmid
TL;DR
研究自监督视频表示学习中的对比方法,提出一种考虑数据增强变量的对比学习框架,以提高针对时间信息进行的微粒视频动作识别的性能,并在多个视频基准测试中达到最先进水平。
Abstract
We focus on
contrastive methods
for self-supervised
video representation
learning. A common paradigm in contrastive learning is to construct positive pairs by sampling different data views for the same instance,
→