语言模型置信度评估与校准调查

Nov, 2023

A Survey of Language Model Confidence Estimation and Calibration

Jiahui Geng, Fengyu Cai, Yuxia Wang, Heinz Koeppl, Preslav Nakov...

TL;DR评估语言模型预测的可靠性和置信度以及解决其与AI安全需求的关系是一项重要研究领域，本文综述了语言模型置信度估计和校准的方法、技术和挑战，并提出了未来研究的方向。

Abstract

language models (LMs) have demonstrated remarkable capabilities across a wide range of tasks in various domains. Despite their impressive performance, the reliability of their output is concerning and questionable regarding the demand for →