Learning to control robots without requiring engineered models has been a
long-term goal, promising diverse and novel applications. Yet, reinforcement
learning has only achieved limited impact on real-time robot control due to its
high demand of real-world interactions. In this work, b