Article ID Journal Published Year Pages File Type
8645347 Gene 2018 23 Pages PDF
Abstract
The explosive growth in genomic data has provided novel insights into the conflicting signals hidden in phylogenetic trees. Although some studies have explored the effects of the GC content and parsimony informative sites (PIS) on the phylogenetic tree, the effect of the heterogeneity of the GC content at the first/second/third codon position on parsimony informative sites (GC1/2/3PIS) among different species and the effect of PIS on phylogenetic tree construction remain largely unexplored. Here, we used two different mammal genomic datasets to explore the patterns of GC1/2/3PIS heterogeneity and the effect of PIS on the phylogenetic tree of genes: (i) all GC1/2/3PIS have obvious heterogeneity between different mammals, and the levels of heterogeneity are GC3PIS > GC2PIS > GC1PIS; (ii) the number of PIS is positively correlated with the metrics of “good” gene tree topologies, and excluding the third codon position (C3) decreases the quality of gene trees by removing too many PIS. These results provide novel insights into the heterogeneity pattern of GC1/2/3PIS in mammals and the relationship between GC3/PIS and gene trees. Additionally, it is necessary to carefully consider whether to exclude C3 to improve the quality of gene trees, especially in the super-tree method.
Related Topics
Life Sciences Biochemistry, Genetics and Molecular Biology Genetics
Authors
, , , ,