BriefGPT.xyz
Nov, 2024
基于图的自信度校准方法用于大型语言模型
Graph-based Confidence Calibration for Large Language Models
HTML
PDF
Yukun Li, Sijia Wang, Lifu Huang, Li-Ping Liu
TL;DR
本文解决了大型语言模型(LLMs)在提供准确自信度估计时的可靠性问题。研究提出了一种新颖的方法,通过结合LLM的自一致性与标记数据,训练一个辅助模型来预测其回答的正确性,最终实验表明该方法在多个基准数据集上明显优于现有自信度校准方法,并显著提升了模型在域外数据上的泛化能力。
Abstract
One important approach to improving the reliability of
Large Language Models
(LLMs) is to provide accurate confidence estimations regarding the correctness of their answers. However, developing a well-calibrated
Confide
→