BriefGPT.xyz
Nov, 2017
无痕迹:学会重置以实现安全和自主的强化学习
Leave no Trace: Learning to Reset for Safe and Autonomous Reinforcement Learning
HTML
PDF
Benjamin Eysenbach, Shixiang Gu, Julian Ibarz, Sergey Levine
TL;DR
本文提出了一种可以同时学习前向策略和清除策略的自动化安全有效的强化学习方法,可以显著减少手动重置,减少不安全的动作,并能自动诱导课程。
Abstract
deep reinforcement learning
algorithms can learn complex behavioral skills, but real-world application of these methods requires a large amount of experience to be collected by the agent. In practical settings, such as
→