BriefGPT.xyz
Nov, 2023
轻量但更忠实:研究剪枝大型语言模型在抽象摘要生成中的幻觉
Lighter, yet More Faithful: Investigating Hallucinations in Pruned Large Language Models for Abstractive Summarization
HTML
PDF
George Chrysostomou, Zhixue Zhao, Miles Williams, Nikolaos Aletras
TL;DR
通过对大型语言模型在抽象摘要生成中修剪算法的实证研究,发现修剪后的模型与完整模型相比更少出现幻觉,并提出其与源输入之间的更高词汇重叠可能是幻觉减少的原因。
Abstract
Despite their remarkable performance on
abstractive summarization
,
large language models
(LLMs) face two significant challenges: their considerable size and tendency to hallucinate.
→