从英语Retriever学习跨语言IR

Dec, 2021

Learning Cross-Lingual IR from an English Retriever

Yulong Li, Martin Franz, Md Arafat Sultan, Bhavani Iyer, Young-Suk Lee...

TL;DR使用多阶段知识蒸馏训练的DR.DECR是一种新的跨语言信息检索(CLIR)系统，其学习了强大的多语言表示以及简化的CLIR，具有比使用有标记的CLIR数据进行直接微调更高的准确性。

Abstract

We present a new cross-lingual information retrieval (clir) model trained using multi-stage knowledge distillation (KD). The teacher and t