Recently, a simple yet effective algorithm -- goal-conditioned
supervised-learning (GCSL) -- was proposed to tackle goal-conditioned
reinforcement-learning. GCSL is based on the principle of hindsight learning:
by observing states visited in previously executed trajectories and treating
them as attained goals, GCSL learns the corresponding actions via superv