BriefGPT.xyz
Oct, 2024
深入探讨大型语言模型的机器遗忘
A Closer Look at Machine Unlearning for Large Language Models
HTML
PDF
Xiaojian Yuan, Tianyu Pang, Chao Du, Kejiang Chen, Weiming Zhang...
TL;DR
本研究解决了大型语言模型在涉及敏感或版权内容时的隐私和法律问题,提出了机器遗忘的思路。我们引入了三种新的评估指标,探讨了无目标和有目标的遗忘方法及其问题,并提出了基于最大化熵和答案保留损失的改进策略。实验结果表明,所提方法在多种场景下均有效。
Abstract
Large Language Models
(LLMs) may memorize sensitive or copyrighted content, raising
Privacy
and legal concerns. Due to the high cost of retraining from scratch, researchers attempt to employ
→