BriefGPT.xyz
Oct, 2020
零样本跨语言问答的合成数据增强
Synthetic Data Augmentation for Zero-Shot Cross-Lingual Question Answering
HTML
PDF
Arij Riabi, Thomas Scialom, Rachel Keraron, Benoît Sagot, Djamé Seddah...
TL;DR
本研究提出了一种方法来改善跨语言问答的表现,利用问答生成模型以跨语言的方式生成合成数据,无需额外标注数据,并展示了在四个多语言数据集上的表现显著优于仅使用英文数据的基线模型,创造了新的最优性能水平。
Abstract
Coupled with the availability of large scale datasets,
deep learning
architectures have enabled rapid progress on the
question answering
task. However, most of those datasets are in English, and the performances
→