BriefGPT.xyz
Apr, 2021
利用合成预训练构建稳健的神经检索模型
Towards Robust Neural Retrieval Models with Synthetic Pre-Training
HTML
PDF
Revanth Gangi Reddy, Vikas Yadav, Md Arafat Sultan, Martin Franz, Vittorio Castelli...
TL;DR
研究表明,机器阅读理解数据集可以用于训练高性能的神经信息检索系统,利用序列到序列生成器生成的合成样本的预训练可以提高神经信息检索系统的鲁棒性和检索表现。
Abstract
Recent work has shown that commonly available
machine reading comprehension
(MRC) datasets can be used to train high-performance
neural information retrieval
(IR) systems. However, the evaluation of neural IR has
→