Article ID Journal Published Year Pages File Type
5102795 Physica A: Statistical Mechanics and its Applications 2017 9 Pages PDF
Abstract
Treating a text, after the removal of paragraphs and punctuations, as a spectrum of blanks, the distributions of the length of words of ten languages are analyzed. Using models from the statistical theory of spectra, it is found that the ten languages can be classified into two families: one with words that follow a Wigner-like distribution while the words of the other obey a Poisson-like distribution.
Related Topics
Physical Sciences and Engineering Mathematics Mathematical Physics
Authors
, ,