BriefGPT.xyz
Feb, 2024
用样本一致性校准大型语言模型
Calibrating Large Language Models with Sample Consistency
HTML
PDF
Qing Lyu, Kumar Shridhar, Chaitanya Malaviya, Li Zhang, Yanai Elazar...
TL;DR
通过从多个随机抽样的模型生成的分布中导出确定度來提高大型语言模型(LLM)预测的准确度。在多个开放和闭源模型上进行广泛评估,结果表明基于一致性的校准方法优于现有的事后方法,并提供了选择适用于不同LLMs特性的合适一致性度量标准的实用指南。
Abstract
Accurately gauging the
confidence level
of
large language models
' (LLMs) predictions is pivotal for their reliable application. However, LLMs are often uncalibrated inherently and elude conventional
→