BriefGPT.xyz
Aug, 2021
UniCon: 统一的上下文网络用于强韧的活动说话人检测
UniCon: Unified Context Network for Robust Active Speaker Detection
HTML
PDF
Yuanhang Zhang, Susan Liang, Shuang Yang, Xiao Liu, Zhongqin Wu...
TL;DR
提出了一种新的有效框架 UniCon,用于鲁棒的活动演讲者检测,其聚焦于联合建模多种类型的情境信息,包括与候选者之间的视觉关系,以及音频和视觉的关系,并通过聚合长期信息,进一步提高检测效果。
Abstract
We introduce a new efficient framework, the
unified context network
(UniCon), for
robust
active speaker detection
(ASD). Traditional metho
→