BriefGPT.xyz
Apr, 2024
探测大型语言模型中的未预料偏见
Towards detecting unanticipated bias in Large Language Models
HTML
PDF
Anna Kruspe
TL;DR
通过探索新的方法来检测大型语言模型中的潜在偏见,本研究聚焦于不确定性量化和可解释人工智能方法,旨在提高模型决策的透明性,以识别和理解不明显的偏见,从而为更加公平和透明的人工智能系统的发展做出贡献。
Abstract
Over the last year,
large language models
(LLMs) like ChatGPT have become widely available and have exhibited
fairness issues
similar to those in previous machine learning systems. Current research is primarily f
→