Human perceptual studies are the gold standard for the evaluation of many
research tasks in machine learning, linguistics, and psychology. However, these
studies require significant time and cost to perform. As a result, many
researchers use objective measures that can correlate poorly