BriefGPT.xyz
Mar, 2025
利用遵循指令的检索器进行恶意信息检索
Exploiting Instruction-Following Retrievers for Malicious Information Retrieval
HTML
PDF
Parishad BehnamGhader, Nicholas Meade, Siva Reddy
TL;DR
本研究探讨了遵循指令的检索器在满足恶意查询方面的安全风险,填补了这一领域的研究空白。我们通过实证研究发现,六个主要检索器在处理恶意请求时,能够选择相关的有害信息,且这类风险与检索器的指令遵循能力密切相关。这表明,随着检索能力的提升,恶意滥用的风险也在增加。
Abstract
Instruction-following retrievers
have been widely adopted alongside LLMs in real-world applications, but little work has investigated the
Safety risks
surrounding their increasing search capabilities. We empirica
→