计算词语的概率

Jun, 2024

How to Compute the Probability of a Word

Tiago Pimentel, Clara Meister

TL;DR正确计算字词概率的方法及其对句子理解和词汇优化分析的影响。

Abstract

language models (LMs) estimate the probability distribution over sequences of natural language; these distributions are crucial for computing perplexity and surprisal in linguistics research. While we are usually concerned with measuring these values for words, most LMs operate over