کد مقاله کد نشریه سال انتشار مقاله انگلیسی نسخه تمام متن
6369082 1623804 2016 12 صفحه PDF دانلود رایگان
عنوان انگلیسی مقاله ISI
Clustering DNA sequences using the out-of-place measure with reduced n-grams
موضوعات مرتبط
علوم زیستی و بیوفناوری علوم کشاورزی و بیولوژیک علوم کشاورزی و بیولوژیک (عمومی)
پیش نمایش صفحه اول مقاله
Clustering DNA sequences using the out-of-place measure with reduced n-grams
چکیده انگلیسی
The alignment-free n-gram based method with the out-of-place measures as the distance has been successfully applied to automatic text or natural languages categorization in real time. However, it is not clear about its performance and the selection of n for comparing genome sequences. Here we propose a symmetric version of the out-of-place measure and a new approach for finding the optimal range of n to construct a phylogenetic tree with the symmetric out-of-place measures. Our method is then applied to real genome sequence datasets. The resulting phylogenetic trees are matching with the standard biological classification. It shows that our proposed method is a very powerful tool for phylogenetic analysis in terms of both classification accuracy and computation efficiency.
ناشر
Database: Elsevier - ScienceDirect (ساینس دایرکت)
Journal: Journal of Theoretical Biology - Volume 406, 7 October 2016, Pages 61-72
نویسندگان
, ,