BriefGPT.xyz
Nov, 2023
恶魔天才:深入探究基于LLM的智能体的安全性
Evil Geniuses: Delving into the Safety of LLM-based Agents
HTML
PDF
Yu Tian, Xiao Yang, Jingyuan Zhang, Yinpeng Dong, Hang Su
TL;DR
通过对大型语言模型(LLMs)进行安全评估,揭示了LLM-based agents面临的挑战、安全漏洞以及对未来研究的启示。
Abstract
The rapid advancements in
large language models
(LLMs) have led to a resurgence in
llm-based agents
, which demonstrate impressive human-like behaviors and cooperative capabilities in various interactions and stra
→