Elisa Bassignana, Max Müller-Eberstein, Mike Zhang, Barbara Plank
TL;DR本文提出了一种量化方法 - 最大证据对数(Logarithm of Maximum Evidence), 以预测在目标任务上表现最好的语言模型,通过与人工排名进行比较,本文发现来自定量指标的证据更加稳健,并且可以帮助识别出意想不到的最佳语言模型候选项。
Abstract
With the increase in availability of large pre-trained language models (LMs) in Natural Language Processing (NLP), it becomes critical to assess their fit for a specific target task a priori - as fine-tuning the entire space of available LMs is computationally prohibitive and unsustain