Feb, 2024
RAG-Driver:多模态大型语言模型中具有上下文学习和检索增强的通用自动驾驶解释
RAG-Driver: Generalisable Driving Explanations with Retrieval-Augmented In-Context Learning in Multi-Modal Large Language Model
Jianhao Yuan, Shuyang Sun, Daniel Omeiza, Bo Zhao, Paul Newman...
TL;DRRobots powered by 'blackbox' models need to provide human-understandable explanations, and RAG-Driver is a retrieval-augmented multi-modal large language model that achieves state-of-the-art performance in producing driving action explanations, justifications, and control signal prediction, with exceptional zero-shot generalisation capabilities to unseen environments.