BriefGPT.xyz
Dec, 2022
使用文本扩展的密集检索课程采样
Curriculum Sampling for Dense Retrieval with Document Expansion
HTML
PDF
Xingwei He, Yeyun Gong, A-Long Jin, Hang Zhang, Anlei Dong...
TL;DR
本文提出了一种课程抽样策略来对原始的双编码器的不足进行改进,使得检索模型能够学习将注意力从文档扩展至文档和查询,从而获得高质量的查询知情文档表示,实验结果表明,我们的方法优于以前的密集检索方法。
Abstract
The
dual-encoder
has become the de facto architecture for
dense retrieval
. Typically, it computes the latent representations of the query and document independently, thus failing to fully capture the interactions
→