Xiaoman Pan, Kai Sun, Dian Yu, Jianshu Chen, Heng Ji...
TL;DR本研究旨在探索使用 Wikipedia 的文本信息和添加更多的训练数据来解决在科学等学科领域中的多项选择题答题任务,实验表明,我们的方法在准确性上相较于先前的最先进技术获得了显著的提升。
Abstract
We focus on multiple-choice question answering (QA) tasks in subject areas
such as science, where we require both broad background knowledge and the facts
from the given subject-area reference corpus. In this wor
Open-domain Question Answering research investigates the generalization performance of a retrieval-augmented QA model, proposing Corpus-Invariant Tuning as an effective training strategy to mitigate knowledge over-memorization and achieve better generalizability.