BriefGPT.xyz
Jun, 2024
用LM-Polygraph为大型语言模型基准化不确定性量化方法
Benchmarking Uncertainty Quantification Methods for Large Language Models with LM-Polygraph
HTML
PDF
Roman Vashurin, Ekaterina Fadeeva, Artem Vazhentsev, Akim Tsvigun, Daniil Vasilev...
TL;DR
使用新的基准测试实现了对大规模语言模型在不确定性量化和归一化技术方面的评估,旨在解决其在文本生成任务中的不安全性和低质量输出等挑战。
Abstract
uncertainty quantification
(UQ) is becoming increasingly recognized as a critical component of applications that rely on
machine learning
(ML). The rapid proliferation of
→