BriefGPT.xyz
Aug, 2023
剧本音视频的讲话人分离
Speaker Diarization of Scripted Audiovisual Content
HTML
PDF
Yogesh Virkar, Brian Thompson, Rohit Paturi, Sundararajan Srinivasan, Marcello Federico
TL;DR
利用制作脚本为演讲人辨别任务提取伪标记数据的半监督方法在66个节目测试集上相对于两个非监督基准模型显示出了51.7%的改进。
Abstract
The
media localization industry
usually requires a
verbatim script
of the final film or TV production in order to create subtitles or dubbing scripts in a foreign language. In particular, the
→