BriefGPT.xyz
Sep, 2022
自监督语音模型在音频表示方面的功效
The Ability of Self-Supervised Speech Models for Audio Representations
HTML
PDF
Tung-Yu Wu, Chen-An Li, Tzu-Han Lin, Tsu-Yuan Hsu, Hung-Yi Lee
TL;DR
本研究提出融合自监督学习语音模型嵌入的集成框架,旨在探究其在音频和非语音任务中的表示能力,实验证明该框架普遍优于当前最先进的自监督学习语音/音频模型,特别在面对细粒度音乐任务时也表现出强大的能力。
Abstract
self-supervised learning
(SSL) speech models have achieved unprecedented success in
speech representation
learning, but some questions regarding their representation ability remain unanswered. This paper addresse
→