BriefGPT.xyz
May, 2024
RefChecker:基于引用的细粒度幻觉检查器和大语言模型基准
RefChecker: Reference-based Fine-grained Hallucination Checker and Benchmark for Large Language Models
HTML
PDF
Xiangkun Hu, Dongyu Ru, Lin Qiu, Qipeng Guo, Tianhang Zhang...
TL;DR
利用Claim-Triplets框架探测大型语言模型中的幻觉,并展示出相较于其他粒度如回复、句子和子句级别的声明,claim-triplets在幻觉检测方面表现出更好的性能。
Abstract
large language models
(LLMs) have shown impressive capabilities but also a concerning tendency to hallucinate. This paper presents
refchecker
, a framework that introduces
→