کد مقاله | کد نشریه | سال انتشار | مقاله انگلیسی | نسخه تمام متن |
---|---|---|---|---|
5918827 | 1570811 | 2015 | 9 صفحه PDF | دانلود رایگان |
- The possible effects of missing data on divergence dating with BEAST are unclear.
- We analyzed impacts of missing data on BEAST with empirical data from reptiles (20 genes, 32 taxa).
- Our analyses showed generally little impact of missing data on dating with BEAST, with errors typically less than 5Â Myr.
- Reduced sampling of genes also had limited impact on errors in estimated dates.
- In contrast, reducing the number of fossil calibrations led to dramatic errors in estimated dates.
Time-calibrated phylogenies have become essential to evolutionary biology. A recurrent and unresolved question for dating analyses is whether genes with missing data cells should be included or excluded. This issue is particularly unclear for the most widely used dating method, the uncorrelated lognormal approach implemented in BEAST. Here, we test the robustness of this method to missing data. We compare divergence-time estimates from a nearly complete dataset (20 nuclear genes for 32 species of squamate reptiles) to those from subsampled matrices, including those with 5 or 2 complete loci only and those with 5 or 8 incomplete loci added. In general, missing data had little impact on estimated dates (mean error of â¼5Â Myr per node or less, given an overall age of â¼220Â Myr in squamates), even when 80% of sampled genes had 75% missing data. Mean errors were somewhat higher when all genes were 75% incomplete (â¼17Â Myr). However, errors increased dramatically when only 2 of 9 fossil calibration points were included (â¼40Â Myr), regardless of missing data. Overall, missing data (and even numbers of genes sampled) may have only minor impacts on the accuracy of divergence dating with BEAST, relative to the dramatic effects of fossil calibrations.
Journal: Molecular Phylogenetics and Evolution - Volume 85, April 2015, Pages 41-49