کد مقاله کد نشریه سال انتشار مقاله انگلیسی نسخه تمام متن
2833737 1570801 2016 8 صفحه PDF دانلود رایگان
عنوان انگلیسی مقاله ISI
Impact of gene family evolutionary histories on phylogenetic species tree inference by gene tree parsimony
موضوعات مرتبط
علوم زیستی و بیوفناوری علوم کشاورزی و بیولوژیک بوم شناسی، تکامل، رفتار و سامانه شناسی
پیش نمایش صفحه اول مقاله
Impact of gene family evolutionary histories on phylogenetic species tree inference by gene tree parsimony
چکیده انگلیسی


• Gene content and duplication of gene family influence species tree accuracy of GTP.
• Gene families of highest and lowest gene content have higher species tree accuracy.
• Lineage-specific expansion of a gene family reduces species tree accuracy.
• Potential weighting of the duplication cost in future GTP analyses.

Complicated history of gene duplication and loss brings challenge to molecular phylogenetic inference, especially in deep phylogenies. However, phylogenomic approaches, such as gene tree parsimony (GTP), show advantage over some other approaches in its ability to use gene families with duplications. GTP searches the ‘optimal’ species tree by minimizing the total cost of biological events such as duplications, but accuracy of GTP and phylogenetic signal in the context of different gene families with distinct histories of duplication and loss are unclear. To evaluate how different evolutionary properties of different gene families can impact on species tree inference, 3900 gene families from seven angiosperms encompassing a wide range of gene content, lineage-specific expansions and contractions were analyzed. It was found that the gene content and total duplication number in a gene family strongly influence species tree inference accuracy, with the highest accuracy achieved at either very low or very high gene content (or duplication number) and lowest accuracy centered in intermediate gene content (or duplication number), as the relationship can fit a binomial regression. Besides, for gene families of similar level of average gene content, those with relatively higher lineage-specific expansion or duplication rates tend to show lower accuracy. Additional correlation tests support that high accuracy for those gene families with large gene content may rely on abundant ancestral copies to provide many subtrees to resolve conflicts, whereas high accuracy for single or low copy gene families are just subject to sequence substitution per se. Very low accuracy reached by gene families of intermediate gene content or duplication number can be due to insufficient subtrees to resolve the conflicts from loss of alternative copies. As these evolutionary properties can significantly influence species tree accuracy, I discussed the potential weighting of the duplication cost by evolutionary properties of gene families in future GTP analyses.

Figure optionsDownload as PowerPoint slide

ناشر
Database: Elsevier - ScienceDirect (ساینس دایرکت)
Journal: Molecular Phylogenetics and Evolution - Volume 96, March 2016, Pages 9–16
نویسندگان
,