optimal transport ($\OT$) theory provides a useful set of tools to compare probability distributions. As a consequence, the field of $\OT$ is gaining traction and interest within the machine learning community. A few deficiencies usually associated with $\OT$ include its high computati