In reinforcement learning, an agent learns to reach a set of goals by means
of an external reward signal. In the natural world, intelligent organisms learn
from internal drives, bypassing the need for external signals, which is
beneficial for a wide range of tasks. Motivated by this ob