We propose a new approach to video face recognition. Our component-wise
feature aggregation network (C-FAN) accepts a set of face images of a subject
as an input, and outputs a single feature vector as the face representation of
the set for the recognition task. The whole network is tr