BriefGPT.xyz
Sep, 2024
重新审视鲁棒自动语音识别的声学特征
Revisiting Acoustic Features for Robust ASR
HTML
PDF
Muhammad A. Shah, Bhiksha Raj
TL;DR
本文解决了自动语音识别(ASR)系统在面对多种现实世界噪声时的鲁棒性不足的问题。研究通过评估生物启发的声学特征,包括新提出的频率掩蔽谱图和伽马音叉谱图,展示了这些特征在提高ASR准确性和鲁棒性方面的潜力,尤其是在对抗攻击下的表现显著优于传统特征。
Abstract
Automatic Speech Recognition
(ASR) systems must be robust to the myriad types of noises present in real-world environments including environmental noise, room impulse response, special effects as well as attacks by malicious actors (
→