reward design is a critical part of the application of reinforcement
learning, the performance of which strongly depends on how well the reward
signal frames the goal of the designer and how well the signal assesses
progress in reaching that goal. In many cases, the extrinsic rewards p