通过多提示改进最小贝叶斯风险解码

Jul, 2024

通过多提示改进最小贝叶斯风险解码

Improving Minimum Bayes Risk Decoding with Multi-Prompt

David Heineman, Yao Dou, Wei Xu

TL;DR该研究解决了指令微调的大型语言模型对提示构建敏感性导致性能不稳定的问题。提出了多提示解码的新方法，通过来自提示库的多个候选生成进行解码，并使用最小贝叶斯风险解码进行集成。研究结果表明，多提示能够提高生成任务的表现，显著优化候选空间的多样性和质量。

Abstract

While instruction fine-tuned LLMs are effective text generators, sensitivity to prompt construction makes performance unstable and sub-optimal in practice. Relying on a single "best" prompt cannot capture all differing approaches to a generation problem. Using this observation, we propose Mul