BriefGPT.xyz
Nov, 2023
ARES:一种用于检索增强生成系统的自动化评估框架
ARES: An Automated Evaluation Framework for Retrieval-Augmented Generation Systems
HTML
PDF
Jon Saad-Falcon, Omar Khattab, Christopher Potts, Matei Zaharia
TL;DR
通过使用综合训练数据,ARES通过微调轻量级语言模型评估RAG组件的质量,在KILT和SuperGLUE两个领域多任务中准确评估RAG系统的有效性。
Abstract
Evaluating
retrieval-augmented generation
(RAG) systems traditionally relies on hand annotations for input queries, passages to retrieve, and responses to generate. We introduce
ares
, an Automated RAG Evaluation
→