BriefGPT.xyz
Jul, 2024
ANAH-v2: 大规模语言模型的分析幻觉注释扩展
ANAH-v2: Scaling Analytical Hallucination Annotation of Large Language Models
HTML
PDF
Yuzhe Gu, Ziwei Ji, Wenwei Zhang, Chengqi Lyu, Dahua Lin...
TL;DR
该论文介绍了一种迭代自训练框架,可以扩展大型语言模型幻觉注释数据集的规模,提高幻觉注释器的准确性,并且通过先进的零样本推理,在HaluEval和HalluQA上获得了全新的幻觉检测结果。
Abstract
large language models
(LLMs) exhibit hallucinations in long-form question-answering tasks across various domains and wide applications. Current
hallucination detection
and mitigation datasets are limited in domai
→