BriefGPT.xyz
Dec, 2024
黑箱大语言模型的校准过程调查
A Survey of Calibration Process for Black-Box LLMs
HTML
PDF
Liangru Xie, Hui Liu, Jingying Zeng, Xianfeng Tang, Yan Han...
TL;DR
本研究针对黑箱大语言模型(LLMs)在输出可靠性评估上的挑战,提供了首个全面的校准技术调查。通过定义校准过程的关键步骤并系统性回顾相关方法,本文不仅揭示了实现这些步骤的独特挑战,还探讨了黑箱LLMs校准过程的应用及未来研究方向,从而为提升可靠性和人机协作提供新视角。
Abstract
Large Language Models (LLMs) demonstrate remarkable performance in semantic understanding and generation, yet accurately assessing their output
Reliability
remains a significant challenge. While numerous studies have explored
→