A common heuristic in semi-supervised deep learning (SSDL) is to select
unlabelled data based on a notion of semantic similarity to the labelled data.
For example, labelled images of numbers should be paired with unlabelled images
of numbers instead of, say, unlabelled images of cars.