BriefGPT.xyz
Sep, 2022
存在干扰下的强化学习安全探索方法
Safe Exploration Method for Reinforcement Learning under Existence of Disturbance
HTML
PDF
Yoshihiro Okawa, Tomotake Sasaki, Hitoshi Yanami, Toru Namerikawa
TL;DR
提出了一种安全探索方法,该方法利用受控对象和干扰的部分先前知识,以确保满足特定的状态约束条件,即使受控对象暴露于遵循正常分布的随机干扰下。
Abstract
Recent rapid developments in
reinforcement learning
algorithms have been giving us novel possibilities in many fields. However, due to their exploring property, we have to take the risk into consideration when we apply those algorithms to
→