TL;DR本文提出了 SelfCP,通过使用 Large Language Models (LLMs)自身来将长提示压缩为紧凑的虚拟标记,实现了无条件和有条件提示的压缩,适应标准任务和具有特定目标的任务。结果表明,压缩的虚拟标记可以有效地替代原始提示。
Abstract
Long prompt leads to huge hardware costs when using large language models (LLMs). Unfortunately, many tasks, such as summarization, inevitably introduce long task-inputs, and the wide application of in-context le