Identifying optimal incomplete phylogenetic data sets from sequence databases

کد مقاله	کد نشریه	سال انتشار	مقاله انگلیسی	نسخه تمام متن
9143121	1164377	2005	8 صفحه PDF	دانلود رایگان

عنوان انگلیسی مقاله ISI

دانلود مقاله + سفارش ترجمه

دانلود مقاله ISI انگلیسی

رایگان برای ایرانیان

کلمات کلیدی

Supermatrix Missing data - داده های گم شده

موضوعات مرتبط

علوم زیستی و بیوفناوری علوم کشاورزی و بیولوژیک بوم شناسی، تکامل، رفتار و سامانه شناسی

پیش نمایش صفحه اول مقاله

Identifying optimal incomplete phylogenetic data sets from sequence databases

چکیده انگلیسی

We introduce a new method for identifying optimal incomplete data sets from large sequence databases based on the graph theoretic concept of Î±-quasi-bicliques. The quasi-biclique method searches large sequence databases to identify useful phylogenetic data sets with a specified amount of missing data while maintaining the necessary amount of overlap among genes and taxa. The utility of the quasi-biclique method is demonstrated on large simulated sequence databases and on a data set of green plant sequences from GenBank. The quasi-biclique method greatly increases the taxon and gene sampling in the data sets while adding only a limited amount of missing data. Furthermore, under the conditions of the simulation, data sets with a limited amount of missing data often produce topologies nearly as accurate as those built from complete data sets. The quasi-biclique method will be an effective tool for exploiting sequence databases for phylogenetic information and also may help identify critical sequences needed to build large phylogenetic data sets.

ناشر

Database: Elsevier - ScienceDirect (ساینس دایرکت)
Journal: Molecular Phylogenetics and Evolution - Volume 35, Issue 3, June 2005, Pages 528-535

نویسندگان

Changhui Yan, J. Gordon Burleigh, Oliver Eulenstein,

علوم انسانی و هنر

فنی، مهندسی و علوم پایه

پزشکی و سلامت

بیو تکنولوژی

پذیرش سفارش ترجمه

Identifying optimal incomplete phylogenetic data sets from sequence databases

دسترسی سریع

ارتباط

English Website