零样本知识测试的LLM幻觉推理

Nov, 2024

LLM Hallucination Reasoning with Zero-shot Knowledge Test

Seongmin Lee, Hsiang Hsu, Chun-Fu Chen

TL;DR本研究解决了大型语言模型（LLM）在实际应用中产生不准确文本的问题，通过引入幻觉推理任务，将生成文本分类为一致、不一致和虚构三类。我们提出的零样本方法能够评估LLM对给定提示和文本的知识掌握程度，实验结果显示该方法在幻觉推理方面的有效性，强调了其在提升检测性能方面的重要性。

Abstract

LLM hallucination, where LLMs occasionally generate unfaithful text, poses significant challenges for their practical applications. Most existing detection methods rely on external knowledge, LLM fine-tuning, or hallucination-labeled datasets, and they do not distinguish between differ