Condenser: 用于密集检索的预训练模型架构

Apr, 2021

Condenser: 用于密集检索的预训练模型架构

Is Your Language Model Ready for Dense Representation Fine-tuning?

Luyu Gao, Jamie Callan

TL;DR该论文提出了一种基于 Condenser 的 Transformer 架构，可以提高标准 LM 在文本检索和相似性任务上的效果。

Abstract

Pre-trained language models (LM) have become go-to text representation encoders. Prior research used deep LMs to encode text sequences such as sentences and passages into single dense vector representations. These dense representations have been used in efficient text comparison and em