BriefGPT.xyz
Apr, 2025
SARI:通过课程引导强化学习实现结构化音频推理
SARI: Structured Audio Reasoning via Curriculum-Guided Reinforcement Learning
HTML
PDF
Cheng Wen, Tingwei Guo, Shuaijiang Zhao, Wei Zou, Xiangang Li
TL;DR
本研究解决了音频语言推理中强化学习模型推理能力如何转移的缺口,提出了SARI模型,通过课程引导的强化学习方法进行结构化音频推理。研究发现,该模型显著提高了推理准确率,并且明确的结构化推理和课程学习能有效增强音频语言理解能力。
Abstract
Recent work shows that
Reinforcement Learning
(RL) can markedly sharpen the reasoning ability of large
Language Models
(LLMs) by prompting them to "think before answering." Yet whether and how these gains transfer
→