感知奖励函数

Aug, 2016

Perceptual Reward Functions

Ashley Edwards, Charles Isbell, Atsuo Takanishi

TL;DR该论文研究了使用感知奖励函数的方法，以提供视觉任务的描述，使代理能够从基于原始像素而不是内部参数的奖励中进行学习。

Abstract

reinforcement learning problems are often described through rewards that indicate if an agent has completed some task. This specification can yield desirable behavior, however many problems are difficult to speci