We present a model that uses a single first-person image to generate an egocentric basketball motion sequence in the form of a 12D camera configuration trajectory, which encodes a player's 3d location and 3D head orientation throughout the sequence. To do this, we first introduce a fut