BriefGPT.xyz
Apr, 2024
评估检索增强生成的检索质量
Evaluating Retrieval Quality in Retrieval-Augmented Generation
HTML
PDF
Alireza Salemi, Hamed Zamani
TL;DR
评估检索增强生成(RAG)面临挑战,传统的端到端评估方法计算开销高,我们提出了一种新的评估方法eRAG,通过使用每个检索列表中的文档,基于下游任务的真实标签评估生成的输出。实验证明eRAG与下游RAG的性能呈较高相关性,并且具有显著的计算优势。
Abstract
Evaluating
retrieval-augmented generation
(RAG) presents challenges, particularly for retrieval models within these systems. Traditional end-to-end
evaluation
methods are computationally expensive. Furthermore, <
→