BriefGPT.xyz
Jan, 2022
大规模人工语焉不详生成
LARD: Large-scale Artificial Disfluency Generation
HTML
PDF
T. Passali, T. Mavropoulos, G. Tsoumakas, G. Meditskos, S. Vrochidis
TL;DR
本文提出了用于在对话系统中检测语言不流畅的复杂和真实的人工生成方法 LARD,同时发布了一个包含不流畅性的大规模数据集,可以用于四种不同的任务,实验结果表明该方法生成的数据可有效用于检测和移除不流畅性语言。
Abstract
disfluency detection
is a critical task in
real-time dialogue systems
. However, despite its importance, it remains a relatively unexplored field, mainly due to the lack of appropriate
→