کد مقاله کد نشریه سال انتشار مقاله انگلیسی نسخه تمام متن
532993 870037 2005 11 صفحه PDF دانلود رایگان
عنوان انگلیسی مقاله ISI
The method of NN-grams in large-scale clustering of DNA texts
کلمات کلیدی
موضوعات مرتبط
مهندسی و علوم پایه مهندسی کامپیوتر چشم انداز کامپیوتر و تشخیص الگو
پیش نمایش صفحه اول مقاله
The method of NN-grams in large-scale clustering of DNA texts
چکیده انگلیسی

This paper is devoted to the techniques of clustering of texts based on the comparison of vocabularies of N-grams. In contrast to the regular N-grams approach, the proposed N-grams method is based on calculation of imperfect occurrences of N-grams in a text up to a number of mismatched strings. We demonstrated that such an approach essentially improves the resolving capacity of the N-grams method for DNA texts. Additionally, we discuss a mutual usage scheme of different clustering technique types to verify the partition quality.

ناشر
Database: Elsevier - ScienceDirect (ساینس دایرکت)
Journal: Pattern Recognition - Volume 38, Issue 11, November 2005, Pages 1902–1912
نویسندگان
, , , , ,