Multi-agent planning under stochastic dynamics is usually formalised using decentralized (partially observable) Markov Decision Processes ( MDPs) and reachability or expected reward specifications. In this paper, we propose a different approach: we use an MDP describing how a single ag