How can a reinforcement learning (RL) agent prepare to solve downstream tasks
if those tasks are not known a priori? One approach is unsupervised skill
discovery, a class of algorithms that learn a set of policies without access to
a reward function. Such algorithms bear a close resemb