BriefGPT.xyz
Jan, 2023
使用预训练的Diffusion模型改善源分离
Separate And Diffuse: Using a Pretrained Diffusion Model for Improving Source Separation
HTML
PDF
Shahar Lutati, Eliya Nachmani, Lior Wolf
TL;DR
本文研究了语音分离问题,通过将分离模型和扩散模型的输出线性相结合,并利用学习到的权重来实现在多说话人的同时达到前所未有的语音分离效果,从而推翻了先前成立的基于人类语音确定性模型的上界限制。
Abstract
The problem of
speech separation
, also known as the cocktail party problem, refers to the task of isolating a single speech signal from a mixture of speech signals. Previous work on
source separation
derived an u
→