BriefGPT.xyz
Feb, 2022
JaQuAD: 用于机器阅读理解的日语问答数据集
JaQuAD: Japanese Question Answering Dataset for Machine Reading Comprehension
HTML
PDF
ByungHoon So, Kyuhong Byun, Kyungwon Kang, Seongjin Cho
TL;DR
本文提出了JaQuAD数据集,它是一种由人类注释的日语问答数据集,用于非英语语言的QA任务的研究。该数据集由39,696个问题-答案对组成并且基于日本维基百科文章。我们针对基线模型进行微调,测试数据集上的F1得分为78.92%,EM为63.38%。
Abstract
question answering
(QA) is a task in which a machine understands a given document and a question to find an answer. Despite impressive progress in the
nlp
area, QA is still a challenging problem, especially for n
→