BriefGPT.xyz
Jun, 2024
RoboGolf: 用反射式多模态视觉语言模型掌握真实世界迷你高尔夫
RoboGolf: Mastering Real-World Minigolf with a Reflective Multi-Modality Vision-Language Model
HTML
PDF
Hantao Zhou, Tianying Ji, Jianwei Zhang, Fuchun Sun, Huazhe Xu
TL;DR
RoboGolf是一个框架,通过感知双摄像头输入、嵌套的VLM增强闭环控制和反思平衡循环,有效地解决了挑战性的迷你高尔夫球场问题。
Abstract
minigolf
, a game with countless court layouts, and complex ball motion, constitutes a compelling real-world testbed for the study of
embodied intelligence
. As it not only challenges spatial and kinodynamic reason
→