machine learning systems must adapt to data distributions that evolve over
time, in applications ranging from sensor networks and self-driving car
perception modules to brain-machine interfaces. We consider gradual domain
adaptation, where the goal is to adapt an initial classifier tra