Recent advances in deep reinforcement learning have demonstrated the
capability of learning complex control policies from many types of
environments. When learning policies for safety-critical applications, it is
essential to be sensitive to risks and avoid catastrophic events. Towards