Owain Evans, Owen Cotton-Barratt, Lukas Finnveden, Adam Bales, Avital Balwit...
TL;DR研究 AI 谎言的伦理学和政治学意义以及建立机构和标准来评估和训练 AI 系统,以在未来降低政治敏感性和道德风险。
Abstract
In many contexts, lying -- the use of verbal falsehoods to deceive -- is
harmful. While lying has traditionally been a human affair, AI systems that
make sophisticated verbal statements are becoming increasingly prevalent. This
raises the question of how we should limit the harm caused by AI "lies" (i.e.
falsehoods that are actively selected for). Human trut