BriefGPT.xyz
Oct, 2024
大型语言模型偏好排序不一致性的测量
Measuring the Inconsistency of Large Language Models in Preferential Ranking
HTML
PDF
Xiutian Zhao, Ke Wang, Wei Peng
TL;DR
本研究解决了大型语言模型在提供一致的偏好排序方面的不足,尤其是在缺乏绝对答案的情况下。通过基于序理论的形式化,我们验证了当前先进的语言模型在满足一致性标准方面的能力,结果显示这些模型存在显著的不一致,提示需要进一步的研究以克服这些局限性。
Abstract
Despite
Large Language Models
' (LLMs) recent advancements, their
Bias
and hallucination issues persist, and their ability to offer consistent preferential rankings remains underexplored. This study investigates t
→