کد مقاله کد نشریه سال انتشار مقاله انگلیسی نسخه تمام متن
1144989 957444 2010 11 صفحه PDF دانلود رایگان
عنوان انگلیسی مقاله ISI
Statistical considerations underpinning an alignment-free sequence comparison method
موضوعات مرتبط
مهندسی و علوم پایه ریاضیات آمار و احتمال
پیش نمایش صفحه اول مقاله
Statistical considerations underpinning an alignment-free sequence comparison method
چکیده انگلیسی

The D2D2 statistic is defined as the number of word matches of prespecified length kk, with up to tt mismatches, shared between two given sequences. This statistic finds its application in alignment-free comparisons of biological sequences. It has two main advantages over alignment-based methods for nucleotide and amino-acid sequence comparisons, such as BLAST (basic local alignment search tool). These are (i) D2D2 does not assume that homologous segments are contiguous, and (ii) the algorithm is computationally extremely fast, the runtime being proportional to the size of the sequences in the case of exact matches. This review article summarises results to date on determining the distributional properties of the D2D2 statistic for a range of biologically relevant parameters, describes existing applications of the method, and outlines future research directions.

ناشر
Database: Elsevier - ScienceDirect (ساینس دایرکت)
Journal: Journal of the Korean Statistical Society - Volume 39, Issue 3, September 2010, Pages 325–335
نویسندگان
, , , ,