PQuAD：一个波斯语问答数据集

Feb, 2022

PQuAD: A Persian Question Answering Dataset

Kasra Darvishi, Newsha Shahbodagh, Zahra Abbasiantaeb, Saeedeh Momtazi

TL;DR我们介绍了一个众包的波斯语阅读理解数据集，包括80,000个问题和答案，其中25％的问题是具有对抗性无法回答的，该数据集被用于建立波斯语阅读理解和提供基线结果的研究。

Abstract

We present persian question answering dataset (PQuAD), a crowdsourced reading comprehension dataset on Persian Wikipedia articles. It includes 80,000 questions along with their answers, with 25% of the questions