BriefGPT.xyz
Aug, 2016
感知奖励函数
Perceptual Reward Functions
HTML
PDF
Ashley Edwards, Charles Isbell, Atsuo Takanishi
TL;DR
该论文研究了使用感知奖励函数的方法,以提供视觉任务的描述,使代理能够从基于原始像素而不是内部参数的奖励中进行学习。
Abstract
reinforcement learning
problems are often described through rewards that indicate if an
agent
has completed some task. This specification can yield desirable behavior, however many problems are difficult to speci
→