BriefGPT.xyz
May, 2024
AI风险管理应同时考虑安全与保障
AI Risk Management Should Incorporate Both Safety and Security
HTML
PDF
Xiangyu Qi, Yangsibo Huang, Yi Zeng, Edoardo Debenedetti, Jonas Geiping...
TL;DR
介绍了AI安全和AI安全漏洞之间的相互作用,讨论了定义上的不一致和缺乏共识,并引入一个统一的参考框架来澄清AI安全和AI安全之间的差异和相互作用,旨在促进不同社区之间的共识和有效合作。
Abstract
The exposure of
security
vulnerabilities
in
safety
-aligned language models, e.g., susceptibility to
→