کد مقاله کد نشریه سال انتشار مقاله انگلیسی نسخه تمام متن
1149888 957900 2008 11 صفحه PDF دانلود رایگان
عنوان انگلیسی مقاله ISI
Distribution of the length of the longest common subsequence of two multi-state biological sequences
موضوعات مرتبط
مهندسی و علوم پایه ریاضیات ریاضیات کاربردی
پیش نمایش صفحه اول مقاله
Distribution of the length of the longest common subsequence of two multi-state biological sequences
چکیده انگلیسی

The length of the longest common subsequence (LCS) among two biological sequences has been used as a measure of similarity, and the application of this statistic is of importance in genomic studies. Even for the simple case of two sequences of equal length and composed of binary elements with equal state probabilities, the exact distribution of the length of the LCS remains an open question. This problem is also known as an NP-hard problem in computer science. Apart from combinatorial analysis, using the finite Markov chain imbedding technique, we derive the exact distribution for the length of the LCS between two multi-state sequences of different lengths. Numerical results are provided to illustrate the theoretical results.

ناشر
Database: Elsevier - ScienceDirect (ساینس دایرکت)
Journal: Journal of Statistical Planning and Inference - Volume 138, Issue 11, 1 November 2008, Pages 3605–3615
نویسندگان
, ,