TL;DR提出了一个名为Learning by Watching (LbW)的框架,通过间接观察周围车辆的演示来增加驾驶策略的数据量和新颖性,从而实现更加鲁棒的驾驶,快速适应新场景,并且只需要10分钟的数据即可达到82%的成功率。
Abstract
When in a new situation or geographical location, human drivers have an extraordinary ability to watch others and learn maneuvers that they themselves may have never performed. In contrast, existing techniques for learning to drive preclude such a possibility as they assume direct access to an instrumented ego-vehicle with fully known observations and expert