BriefGPT.xyz
Jun, 2023
自我监督的语音模型对单词的了解程度如何?
What do self-supervised speech models know about words?
HTML
PDF
Ankita Pasad, Chung-Ming Chien, Shane Settle, Karen Livescu
TL;DR
本研究发现,不同的自监督语音模型可以在不同的层次编码语言特征,在中间层最大程度地捕获了词级的信息,同时在较高层保留了发音等低层次信息,并用在无额外参数的情况下测试了这些模型的层次表现,同时发现使用HuBERT或WavLM的最佳表现层可以实现与更复杂的方法相媲美的词分割和语义句子相似性的表现。
Abstract
Many
self-supervised speech models
(S3Ms) have been introduced over the last few years, producing performance and data efficiency improvements for a variety of speech tasks. Evidence is emerging that different S3Ms encode
→