Many AI problems, in robotics and other domains, are goal-based, essentially seeking trajectories leading to various goal states. reinforcement learning (RL), building on Bellman's optimality equation, naturally optimizes for a single goal, yet can be made multi-goal by augmenting the