BriefGPT.xyz
Jul, 2024
BEVWorld:基于统一BEV潜空间的自动驾驶多模态世界模型
BEVWorld: A Multimodal World Model for Autonomous Driving via Unified BEV Latent Space
HTML
PDF
Yumeng Zhang, Shi Gong, Kaixin Xiong, Xiaoqing Ye, Xiao Tan...
TL;DR
通过对多模态传感器输入进行统一而紧凑的鸟瞰视图(BEV)潜在空间编码,BEVWorld 提供了一种新的方法来预测潜在的未来场景,证明了其在自动驾驶任务中生成未来场景的有效性,并对感知和运动预测等下游任务提供了帮助。
Abstract
world models
are receiving increasing attention in
autonomous driving
for their ability to predict potential future scenarios. In this paper, we present
→