Xiaoxue Gao, Zexin Li, Yiming Chen, Cong Liu, Haizhou Li
TL;DR本研究解决了在现实应用中,自动语音识别(ASR)模型对输入扰动的鲁棒性不足的问题。我们提出了一种创新的方法,通过时间域的可转移攻击和语音感知梯度优化(SAGO),有效地增强了黑箱 ASR 模型的抗攻击能力。实验结果表明,在两个数据库的五个模型上,我们的方法显著优于基线方法。
Abstract
Given the extensive research and real-world applications of Automatic Speech Recognition (ASR), ensuring the Robustness of ASR models against minor input perturbations becomes a crucial consideration for maintain