BriefGPT.xyz
Jul, 2023
OUTFOX:基于情境学习和对抗生成示例的LLM生成式文章检测
OUTFOX: LLM-generated Essay Detection through In-context Learning with Adversarially Generated Examples
HTML
PDF
Ryuto Koike, Masahiro Kaneko, Naoaki Okazaki
TL;DR
提出OUTFOX框架,通过允许检测器和攻击者考虑彼此的输出来提高LLM生成文本检测器的鲁棒性,并将其应用于学生作文领域。
Abstract
large language models
(LLMs) have achieved human-level fluency in text generation, making it difficult to distinguish between human-written and
llm-generated texts
. This poses a growing risk of misuse of LLMs and
→