BriefGPT.xyz
Aug, 2020
基于深度学习的音视频语音增强和分离概述
An Overview of Deep-Learning-Based Audio-Visual Speech Enhancement and Separation
HTML
PDF
Daniel Michelsanti, Zheng-Hua Tan, Shi-Xiong Zhang, Yong Xu, Meng Yu...
TL;DR
本文系统综述了基于深度学习的音视频语音增强和分离技术,特别关注了声学和视觉特征、深度学习方法、融合技术以及训练目标和目标函数。同时,还回顾了基于深度学习的无声视频语音重建和语音信号分离的常见方法,并介绍了常用的音视频数据集和评估方法。
Abstract
speech enhancement
and
speech separation
are two related tasks, whose purpose is to extract either one or more target speech signals, respectively, from a mixture of sounds generated by several sources. Tradition
→