Semi-supervised video action recognition tends to enable deep neural networks
to achieve remarkable performance even with very limited labeled data. However,
existing methods are mainly transferred from current image-based methods (e.g.,
FixMatch). Without specifically utilizing the te