BriefGPT.xyz
Apr, 2025
大规模视觉语言模型推理的快速-缓慢思维
Fast-Slow Thinking for Large Vision-Language Model Reasoning
HTML
PDF
Wenyi Xiao, Leilei Gan, Weilong Dai, Wanggui He, Ziwei Huang...
TL;DR
本研究解决了大规模视觉语言模型中的“过度思考”现象,提出了一种名为FAST的快速-缓慢思维框架,该框架根据问题特征动态调整推理深度。实验结果表明,FAST在七个推理基准上的准确性达到最新水平,相较于基础模型实现了超过10%的相对提升,并将标记使用量减少了32.7%至67.3%。
Abstract
Recent advances in
Large Vision-Language Models
(LVLMs) have revealed an \textit{overthinking} phenomenon, where models generate verbose
Reasoning
across all tasks regardless of questions. To address this issue,
→