TL;DR在Dueling Bandits情境中,本文研究了Large Language Models (LLMs)作为决策者的表现,并引入了一个增强算法IF-Enhanced LLM,该算法结合了LLMs的上下文决策能力和经典DB算法的理论保证,以提高LLMs在做决策任务时的可信度和性能鲁棒性。
Abstract
In-context decision-making is an important capability of artificial general intelligence, which large language models (LLMs) have effectively demonstrated in various scenarios. However, LLMs often face challenges