Article ID Journal Published Year Pages File Type
10328117 Computational Statistics & Data Analysis 2010 11 Pages PDF
Abstract
We study the distributions of distances between identical elements of a random sequence (e.g. a sequence of coin tosses or die tosses). We provide methods to generate observations by means of a statistical simulation and show in particular that distributions of multiple distances obey a linear or geometric (mixture) probability model, respectively. The results are useful to discover certain structures in texts or other information strings.
Related Topics
Physical Sciences and Engineering Computer Science Computational Theory and Mathematics
Authors
,