BriefGPT.xyz
Apr, 2020
从推断到生成:端到端完全自监督的人脸语音生成
From Inference to Generation: End-to-end Fully Self-supervised Generation of Human Face from Speech
HTML
PDF
Hyeong-Seok Choi, Changdae Park, Kyogu Lee
TL;DR
本研究提出了一种多模态学习框架,利用近期发展的GAN技术,从声音波形中直接生成自然人脸图像分布,同时分析网络是否能够自然地分离生成人脸图像的两个潜在因素,并探索网络是否能够通过建模这些因素来生成自然的人脸图像分布。
Abstract
This work seeks the possibility of generating the human face from voice solely based on the
audio-visual data
without any human-labeled annotations. To this end, we propose a
multi-modal learning
framework that l
→