Understanding videos such as TV series and movies requires analyzing who the
characters are and what they are doing. We address the challenging problem of
clustering face tracks based on their identity. Different from previous work in
this area, we choose to operate in a realistic and