BriefGPT.xyz
Sep, 2023
逆转诅咒:基于“A是B”训练的LLMs无法学习到“B是A
The Reversal Curse: LLMs trained on "A is B" fail to learn "B is A"
HTML
PDF
Lukas Berglund, Meg Tong, Max Kaufmann, Mikita Balesni, Asa Cooper Stickland...
TL;DR
该研究揭示了自回归大型语言模型(LLM)中的泛化失败现象,即逆转诅咒,导致逻辑推断的基本失败。通过证据和评估表明Reversal Curse在不同模型大小和家族中都是普遍存在的。
Abstract
We expose a surprising failure of
generalization
in auto-regressive large
language models
(LLMs). If a model is trained on a sentence of the form "A is B", it will not automatically generalize to the reverse dire
→