LZ惩罚：一种信息论中的自回归语言模型重复惩罚

Apr, 2025

LZ惩罚：一种信息论中的自回归语言模型重复惩罚

LZ Penalty: An information-theoretic repetition penalty for autoregressive language models

Antonio A. Ginart, Naveen Kodali, Jason Lee, Caiming Xiong, Silvio Savarese...

TL;DR本研究解决了自回归语言模型中重复问题的缺陷，提出了LZ惩罚以降低重复现象而不损失模型能力。该方法基于LZ77无损压缩算法的编码长度，通过预测-压缩对偶性，LZ惩罚能够使开放源码推理模型在无损能力的情况下采用贪婪解码，并显著降低重复率。

Abstract

We introduce the LZ penalty, a penalty specialized for reducing degenerate repetitions in autoregressive language models without loss of capability. The penalty is based on the codelengths in the LZ77 universal lossless compression algorithm. Through the lens of the prediction-compress