BriefGPT.xyz
Feb, 2024
BAT:利用大型语言模型学习关于空间声音的推理
BAT: Learning to Reason about Spatial Sounds with Large Language Models
HTML
PDF
Zhisheng Zheng, Puyuan Peng, Ziyang Ma, Xie Chen, Eunsol Choi...
TL;DR
通过结合双耳声音场景分析模型的空间声音知觉能力和大型语言模型的自然语言推理能力,我们提出了BAT,以模拟人类的空间声音推理能力。BAT在各个方面进行了训练,并具有优越的空间声音认知和推理能力,展示了大型语言模型在解读和理解复杂的空间音频环境中的巨大潜力。
Abstract
spatial sound reasoning
is a fundamental human skill, enabling us to navigate and interpret our surroundings based on sound. In this paper we present BAT, which combines the spatial sound perception ability of a
binaura
→