BriefGPT.xyz
Sep, 2023
AutoHall: 大型语言模型的自动幻觉数据集生成
AutoHall: Automated Hallucination Dataset Generation for Large Language Models
HTML
PDF
Zouying Cao, Yifei Yang, Hai Zhao
TL;DR
该论文提出了AutoHall方法,通过自相矛盾的方式自动构建模型特定的幻觉数据集,然后基于这些数据集实现了无资源和黑盒幻觉检测方法,对开源和闭源大型语言模型进行了实验证明,在幻觉检测性能上优于现有基准模型,并且发现了不同模型之间的幻觉比例和类型的差异。
Abstract
While
large language models
(
llms
) have garnered widespread applications across various domains due to their powerful language understanding and generation capabilities, the detection of non-factual or hallucinat
→