基于吉布斯抽样的自动思维链推导重复提示

May, 2023

基于吉布斯抽样的自动思维链推导重复提示

Reprompting: Automated Chain-of-Thought Prompt Inference Through Gibbs Sampling

Weijia Xu, Andrzej Banburski-Fahey, Nebojsa Jojic

TL;DRReprompting通过迭代抽样搜索Chain-of-Thought配方, 使用Gibbs抽样推导出一组在多步推理方面表现良好的CoT配方。它在五个需要多步推理的任务中的性能均优于零样本、少样本和人类编写的CoT基线, 并可以促进从更强的模型向弱模型的知识转移, 提高弱模型的性能。

Abstract

We introduce reprompting, an iterative sampling algorithm that searches for the chain-of-thought (CoT) recipes for a given task without human intervention. Through →