Camera captured human pose is an outcome of several sources of variation. Performance of supervised 3d pose estimation approaches comes at the cost of dispensing with variations, such as shape and appearance, that may be useful for solving other related tasks. As a result, the learned