BriefGPT.xyz
Feb, 2022
基于人工智能副驾驶优化的安全驾驶策略高效学习
Efficient Learning of Safe Driving Policy via Human-AI Copilot Optimization
HTML
PDF
Quanyi Li, Zhenghao Peng, Bolei Zhou
TL;DR
本文介绍了一种新的基于人工智能协作的优化学习方法,即HACO,它能够在保证训练安全的同时,并非常高效地利用少量的人类干预来训练出一个性能很高、泛化性很好、且适用于各种交通情景的自主驾驶代理。
Abstract
human intervention
is an effective way to inject human knowledge into the training loop of
reinforcement learning
, which can bring fast learning and ensured
→