BriefGPT.xyz
Dec, 2022
通过分层韵律模型学习配音电影
Learning to Dub Movies via Hierarchical Prosody Models
HTML
PDF
Gaoxiang Cong, Liang Li, Yuankai Qi, Zhengjun Zha, Qi Wu...
TL;DR
该研究提出了一种新的电影配音架构,采用层次化韵律建模的方法,在三个方面的口型、面部表情和场景方面将视觉信息与对应的语音韵律相结合,包括使用情感增强器捕捉情境气氛,获得了良好的实验结果。
Abstract
Given a piece of text, a video clip and a reference audio, the
movie dubbing
(also known as
visual voice clone
V2C) task aims to generate speeches that match the speaker's emotion presented in the video using the
→