BriefGPT.xyz
Nov, 2023
语言模型置信度评估与校准调查
A Survey of Language Model Confidence Estimation and Calibration
HTML
PDF
Jiahui Geng, Fengyu Cai, Yuxia Wang, Heinz Koeppl, Preslav Nakov...
TL;DR
评估语言模型预测的可靠性和置信度以及解决其与AI安全需求的关系是一项重要研究领域,本文综述了语言模型置信度估计和校准的方法、技术和挑战,并提出了未来研究的方向。
Abstract
language models
(LMs) have demonstrated remarkable capabilities across a wide range of tasks in various domains. Despite their impressive performance, the reliability of their output is concerning and questionable regarding the demand for
→