کد مقاله کد نشریه سال انتشار مقاله انگلیسی نسخه تمام متن
978955 933312 2010 7 صفحه PDF دانلود رایگان
عنوان انگلیسی مقاله ISI
Entropy analysis of natural language written texts
موضوعات مرتبط
مهندسی و علوم پایه ریاضیات فیزیک ریاضی
پیش نمایش صفحه اول مقاله
Entropy analysis of natural language written texts
چکیده انگلیسی

The aim of the present work is to investigate the relative contribution of ordered and stochastic components in natural written texts and examine the influence of text category and language on these. To this end, a binary representation of written texts and the generated symbolic sequences are examined by the standard block entropy analysis and the Shannon and Kolmogorov entropies are obtained. It is found that both entropies are sensitive to both language and text category with the text category sensitivity to follow almost the same trends in both languages (English and Greek) considered. The values of these entropies are compared with those of stochastically generated symbolic sequences and the nature of correlations present in this representation of real written texts is identified.

ناشر
Database: Elsevier - ScienceDirect (ساینس دایرکت)
Journal: Physica A: Statistical Mechanics and its Applications - Volume 389, Issue 16, 15 August 2010, Pages 3260–3266
نویسندگان
, , , , ,