BriefGPT.xyz
Dec, 2023
LLM的两面:杰基尔博士与海德先生
Dr. Jekyll and Mr. Hyde: Two Faces of LLMs
HTML
PDF
Matteo Gioele Collu, Tom Janssen-Groesbeek, Stefanos Koffas, Mauro Conti, Stjepan Picek
TL;DR
利用对抗性角色,绕过ChatGPT和Bard聊天机器人的安全机制,使用大型语言模型结合聊天助手应用的技术,模仿提供禁止回答的信息,实现获取未经授权、非法或有害信息的攻击。
Abstract
This year, we witnessed a rise in the use of
large language models
, especially when combined with applications like
chatbot assistants
.
safety me
→