BriefGPT.xyz
Jul, 2024
VoxAct-B: 基于体素的双手操作与稳定策略
VoxAct-B: Voxel-Based Acting and Stabilizing Policy for Bimanual Manipulation
HTML
PDF
I-Chun Arthur Liu, Sicheng He, Daniel Seita, Gaurav Sukhatme
TL;DR
VoxAct-B是一种基于语言驱动的基于体素的方法,通过利用视觉语言模型(VLMs)优先考虑场景中的关键区域并重建一个体素网格,在仿真和真实世界的实验中,VoxAct-B在精细双臂操纵任务上表现优异,实现了更高效的策略学习。
Abstract
bimanual manipulation
is critical to many robotics applications. In contrast to single-arm manipulation,
bimanual manipulation
tasks are challenging due to higher-dimensional action spaces. Prior works leverage l
→