A major stumbling block to progress in understanding basic human
interactions, such as getting out of bed or opening a refrigerator, is lack of
good training data. Most past efforts have gathered this data explicitly:
starting with a laundry list of action labels, and then querying sea