带正则化驱动和增强渲染器的知觉式对话头生成

Jun, 2022

带正则化驱动和增强渲染器的知觉式对话头生成

Perceptual Conversational Head Generation with Regularized Driver and Enhanced Renderer

Ailin Huang, Zhewei Huang, Shuchang Zhou

TL;DR该论文主要介绍了他们在ACM Multimedia ViCo 2022 会话头部生成挑战中的解决方案，包括使用正则化训练广义的音频到头部的驱动程序和组装高质量的渲染器，以及利用前景-背景融合模块调整音频至行为模型和后处理生成的视频。该方案达到了听觉头部生成跟踪的第一名和说话头部生成跟踪的第二名。

Abstract

This paper reports our solution for MultiMedia ViCo 2022 Conversational Head Generation Challenge, which aims to generate vivid face-to-face conversation videos based on audio and reference images. Our solution focuses on training a generalized →