BriefGPT.xyz
May, 2025
基于凸评分函数的风险敏感强化学习
Risk-sensitive Reinforcement Learning Based on Convex Scoring Functions
HTML
PDF
Shanyu Han, Yang Liu, Xiang Yu
TL;DR
本文解决了风险目标下强化学习的问题,采用了一类广泛的凸评分函数,该方法涵盖了多种常见风险衡量标准。通过引入辅助变量和增强状态空间,提出了一种定制的演员-评论家算法,为不需要连续马尔可夫决策过程的理论贡献奠定基础,实验证明该算法在统计套利交易中的有效性。
Abstract
We propose a
Reinforcement Learning
(RL) framework under a broad class of risk objectives, characterized by
Convex Scoring Functions
. This class covers many common risk measures, such as variance, Expected Shortf
→