BriefGPT.xyz
Jul, 2024
大规模语言模型的实用取消学习
Practical Unlearning for Large Language Models
HTML
PDF
Chongyang Gao, Lixu Wang, Chenkai Weng, Xiao Wang, Qi Zhu
TL;DR
LLM中各种领域和任务展现出了令人印象深刻的性能,但其安全问题日益严重。我们提出了O3框架,通过包含离散分布检测器和正交低秩适配器,解决连续的反学习请求,同时在保持效用的同时确保最佳的反学习效果。
Abstract
While
llms
have demonstrated impressive performance across various domains and tasks, their
security issues
have become increasingly severe.
mach
→