کد مقاله کد نشریه سال انتشار مقاله انگلیسی نسخه تمام متن
4500769 1320021 2008 6 صفحه PDF دانلود رایگان
عنوان انگلیسی مقاله ISI
WSE, a new sequence distance measure based on word frequencies
موضوعات مرتبط
علوم زیستی و بیوفناوری علوم کشاورزی و بیولوژیک علوم کشاورزی و بیولوژیک (عمومی)
پیش نمایش صفحه اول مقاله
WSE, a new sequence distance measure based on word frequencies
چکیده انگلیسی

In this article, we present a new distance metric, the Weighted Sequence Entropy (WSE), based on the short word composition of biological sequences. As a revision of the classical relative entropy (RE), our metric (1) works equivalently with RE in the case of small k  , (2) avoids the degeneracy when some word types are absent in one sequence but not in the other. Experiments on 25 viruses including SARS-CoVs show that our method and RE give exactly the same phylogenetic tree when word length k⩽3k⩽3. When k>3k>3, our method still works and gets convergent phylogenetic topology but the RE gives degenerate results.

ناشر
Database: Elsevier - ScienceDirect (ساینس دایرکت)
Journal: Mathematical Biosciences - Volume 215, Issue 1, September 2008, Pages 78–83
نویسندگان
, ,