BriefGPT.xyz
Jan, 2022
混合语言语料统一多模态标点修复框架
Unified Multimodal Punctuation Restoration Framework for Mixed-Modality Corpus
HTML
PDF
Yaoming Zhu, Liwei Wu, Shanbo Cheng, Mingxuan Wang
TL;DR
本文介绍了一种名为UniPunc的多模态标点恢复框架,使用混合样本并基于共享潜在空间学习混合表示来标点。该模型在真实世界数据集中的表现优于各种强基线模型(例如BERT,MuSe)至少0.8个整体F1得分,成为新的最先进技术。
Abstract
The
punctuation restoration
task aims to correctly punctuate the output transcriptions of
automatic speech recognition
systems. Previous punctuation models, either using text only or demanding the corresponding a
→