BriefGPT.xyz
Feb, 2024
超越文本:通过语音指示提升 LLM 机器人导航的决策能力
Beyond Text: Improving LLM's Decision Making for Robot Navigation via Vocal Cues
HTML
PDF
Xingpeng Sun, Haoming Meng, Souradip Chakraborty, Amrit Singh Bedi, Aniket Bera
TL;DR
在人与机器人交互方面,本研究发现纯文本作为对话模态在某些应用中存在不足之处,并通过引入音频转录及其相关特征来提升大型语言模型的决策能力,以在社交机器人导航和人机交互领域实现更好的性能。
Abstract
This work highlights a critical shortcoming in text-based
large language models
(LLMs) used for
human-robot interaction
, demonstrating that text alone as a conversation modality falls short in such applications.
→