Existing methods for pixel-wise labelling tasks generally disregard the underlying structure of labellings and so the resulting predictions may be visually implausible. While incorporating structure into the model should improve prediction quality, doing so is challenging - manually specifying the form of structural constraints may be impractical and inferen