BriefGPT.xyz
Mar, 2024
点击抓取:通过视觉扩散描述符实现零射击精确操控
Click to Grasp: Zero-Shot Precise Manipulation via Visual Diffusion Descriptors
HTML
PDF
Nikolaos Tsagkas, Jack Rome, Subramanian Ramamoorthy, Oisin Mac Aodha, Chris Xiaoxuan Lu
TL;DR
利用网络训练的文本到图像扩散生成模型,在无样本情况下对细粒度部件描述符进行准确操作,通过将问题框架化为密集语义部件对应任务,返回用于操作特定部件的夹爪位姿,无需手动示教,验证了该方法在真实世界的桌面场景中的实验,证明了其推进语义感知机器人操作的潜力。
Abstract
precise manipulation
that is
generalizable
across scenes and objects remains a persistent challenge in robotics. Current approaches for this task heavily depend on having a significant number of training instance
→