BriefGPT.xyz
May, 2022
复杂系统分类的方法:文字、文本等
Approaches to the classification of complex systems: Words, texts, and more
HTML
PDF
Andrij Rovenchak
TL;DR
通过物理学类比,定义了基于温度、化学势、熵等参数的文本分类,提出在语言学类比的基础上,研究基因组的方法,同时讨论了熵作为文本分类参数的作用和意义。
Abstract
The Chapter starts with introductory information about
quantitative linguistics
notions, like rank--frequency dependence,
zipf's law
, frequency spectra, etc. Similarities in distributions of words in texts with l
→