角色扮演推理中的偏见与毒性

Sep, 2024

Bias and Toxicity in Role-Play Reasoning

Jinman Zhao, Zifan Qian, Linbo Cao, Yining Wang, Yitian Ding

TL;DR本研究解决了角色扮演在大型语言模型中可能引发的偏见和有害输出的问题。通过系统评估角色扮演对模型在不同基准测试中的影响，研究发现尽管模型的推理能力有所提升，但角色扮演的应用往往增加了生成刻板印象和有害内容的可能性。这一发现对未来的语言模型设计和应用具有重要影响。

Abstract

Role-Play in the Large Language Model (LLM) is a crucial technique that enables models to adopt specific perspectives, enhancing their ability to generate contextually relevant and accurate responses. By simulati