کد مقاله کد نشریه سال انتشار مقاله انگلیسی نسخه تمام متن
5920541 1570826 2011 8 صفحه PDF دانلود رایگان
عنوان انگلیسی مقاله ISI
Confidence intervals for the substitution number in the nucleotide substitution models
موضوعات مرتبط
علوم زیستی و بیوفناوری علوم کشاورزی و بیولوژیک بوم شناسی، تکامل، رفتار و سامانه شناسی
پیش نمایش صفحه اول مقاله
Confidence intervals for the substitution number in the nucleotide substitution models
چکیده انگلیسی

In the nucleotide substitution model for molecular evolution, a major task in the exploration of an evolutionary process is to estimate the substitution number per site of a protein or DNA sequence. The usual estimators are based on the observation of the difference proportion of the two nucleotide sequences. However, a more objective approach is to report a confidence interval with precision rather than only providing point estimators. The conventional confidence intervals used in the literature for the substitution number are constructed by the normal approximation. The performance and construction of confidence intervals for evolutionary models have not been much investigated in the literature. In this article, the performance of these conventional confidence intervals for one-parameter and two-parameter models are explored. Results show that the coverage probabilities of these intervals are unsatisfactory when the true substitution number is small. Since the substitution number may be small in many situations for an evolutionary process, the conventional confidence interval cannot provide accurate information for these cases. Improved confidence intervals for the one-parameter model with desirable coverage probability are proposed in this article. A numerical calculation shows the substantial improvement of the new confidence intervals over the conventional confidence intervals.

The coverage probabilities of the level 0.95 conventional confidence intervals and the proposed confidence intervals for sequence length L = 100 and 500 for K (substitution numer per site) ⩽ 0.06. They show the unsatisfactory results for the conventional approach and the improved results for the proposed methods.Highlights► We estimate the substitution number per site of a DNA or protein sequence. ► The performance of the conventional confidence intervals is unsatisfactory. ► We propose improved confidence intervals for a substitution model. ► A numerical calculation shows the substantial improvement of the new approach.

ناشر
Database: Elsevier - ScienceDirect (ساینس دایرکت)
Journal: Molecular Phylogenetics and Evolution - Volume 60, Issue 3, September 2011, Pages 472-479
نویسندگان
,