Isaac Sheidlower, Reuben Aronson, Elaine Schaertl Short
TL;DR基于 Diffusion for Policy Parameters (DPP) 的机器人基础模型设计可以生成独立的、任务特定的策略,从而提高模型的可解释性和可用性。
Abstract
foundation models are a promising path toward general-purpose and user-friendly robots. The prevalent approach involves training a generalist policy that, like a reinforcement learning policy, uses observations t