reinforcement learning (RL) algorithms allow artificial agents to improve
their selection of actions to increase rewarding experiences in their
environments. Temporal Difference (TD) Learning -- a model-free RL method -- is
a leading account of the midbrain dopamine system and the basa