This paper presents a decentralized leader-follower multi-robot formation
control based on a reinforcement learning (RL) algorithm applied to a swarm of
small educational Sphero robots. Since the basic Q-learning method is known to
require large memory resources for Q-tables, this work