BriefGPT.xyz
Oct, 2023
基于小样本语音语言识别的小波散射变换提升泛化能力
Wavelet Scattering Transform for Improving Generalization in Low-Resourced Spoken Language Identification
HTML
PDF
Spandan Dey, Premjeet Singh, Goutam Saha
TL;DR
改进了现有语音识别中常用的特征提取方法,采用小波散射变换(WST)为低资源语音识别系统提供精确信息,通过优化WST特征和使用不同的WST超参数开发ECAPA-TDNN基于LID系统,大大改善了对未知数据的泛化能力。
Abstract
Commonly used features in
spoken language identification
(LID), such as mel-spectrogram or MFCC, lose high-frequency information due to windowing. The loss further increases for longer temporal contexts. To improve generalization of the
→