We present an unsupervised word segmentation model, in which the learning
objective is to maximize the generation probability of a sentence given its all
possible segmentation. Such generation probability can be factorized into the
likelihood of each possible segment given the context