Feb, 2024

RAG-Driver:多模态大型语言模型中具有上下文学习和检索增强的通用自动驾驶解释

TL;DRRobots powered by 'blackbox' models need to provide human-understandable explanations, and RAG-Driver is a retrieval-augmented multi-modal large language model that achieves state-of-the-art performance in producing driving action explanations, justifications, and control signal prediction, with exceptional zero-shot generalisation capabilities to unseen environments.