BriefGPT.xyz
Jan, 2020
多任务自监督学习用于鲁棒语音识别
Multi-task self-supervised learning for Robust Speech Recognition
HTML
PDF
Mirco Ravanelli, Jianyuan Zhong, Santiago Pascual, Pawel Swietojanski, Joao Monteiro...
TL;DR
本文介绍PASE+,未标注音频表示学习的无偏见语音编码器的改进版本,使用在线语音失真模块,改进编码器和自监督方法,用于在嘈杂和混响的环境中进行鲁棒语音识别。在TIMIT,DIRHA和CHiME-5上的结果表明,PASE+明显优于PASE和常见的声学特征,具有可转移学习表示。
Abstract
Despite the growing interest in
unsupervised learning
, extracting meaningful knowledge from unlabelled audio remains an open challenge. To take a step in this direction, we recently proposed a problem-agnostic
speech en
→