学习提出有信息量的问题：利用偏好优化和期望信息增益增强LLM

Jun, 2024

学习提出有信息量的问题：利用偏好优化和期望信息增益增强LLM

Learning to Ask Informative Questions: Enhancing LLMs with Preference Optimization and Expected Information Gain

Davide Mazzaccara, Alberto Testoni, Raffaella Bernardi

TL;DR该论文介绍了一种增强大型语言模型生成问题信息性的方法，通过在20问游戏对话中应用直接偏好优化算法，从同一模型中抽样多个问题，创建低信息增益和高信息增益问题的配对来提高问题的效果，并在不同领域展示了该方法的有效性。

Abstract

questions are essential tools for acquiring the necessary information to complete information-seeking tasks. However, large language models (LLMs), especially open-source models, often perform poorly in generatin