The segmentation task has traditionally been formulated as a complete-label
pixel classification task to predict a class for each pixel from a fixed number
of predefined semantic categories shared by all images or videos. Yet,
following this formulation, standard architectures will ine