Continually solving new, unsolved tasks is the key to learning diverse
behaviors. Through reinforcement learning (RL), we have made massive strides
towards solving tasks that have a single goal. However, in the multi-task
domain, where an agent needs to reach multiple goals, the choice