BriefGPT.xyz
Apr, 2024
您的精调大型语言模型已是强大的超分布检测器
Your Finetuned Large Language Model is Already a Powerful Out-of-distribution Detector
HTML
PDF
Andi Zhang, Tim Z. Xiao, Weiyang Liu, Robert Bamler, Damon Wischik
TL;DR
通过重新审视预训练大型语言模型和其微调变体之间的似然比作为一种区分所需分布检测的标准,我们展示了似然比可以作为一种有效的OOD检测器,并将其应用于问题回答系统中以改善LLMs在一般问题上的性能。
Abstract
We revisit the
likelihood ratio
between a
pretrained large language model
(LLM) and its finetuned variant as a criterion for out-of-distribution (OOD) detection. The intuition behind such a criterion is that, the
→