BriefGPT.xyz
May, 2023
大型语言模型中的思维过程图推理:超越思维链的有效方法
Beyond Chain-of-Thought, Effective Graph-of-Thought Reasoning in Large Language Models
HTML
PDF
Yao Yao, Zuchao Li, Hai Zhao
TL;DR
使用T5预训练模型实现了图形思维推理模型,提高了基于文本推理任务GSM8K的性能和面向多模态的推理任务ScienceQA的准确率,通过较少的骨干模型参数与基于700M参数的Multimodal-CoT模型获得相当的结果,探索了非线性思考的建模方法。
Abstract
With the widespread use of large language models (LLMs) in
nlp
tasks, researchers have discovered the potential of
chain-of-thought
(CoT) to assist LLMs in accomplishing complex reasoning tasks by generating inte
→