Dec, 2023

虚拟世界建模与物理动力学理解

TL;DRCounterfactual World Modeling (CWM) is a pure vision model that uses a temporally-factored masking policy for predicting video data and enables counterfactual queries to extract vision structures, achieving state-of-the-art performance on the Physion benchmark.