کد مقاله کد نشریه سال انتشار مقاله انگلیسی نسخه تمام متن
10825804 1064674 2014 6 صفحه PDF دانلود رایگان
عنوان انگلیسی مقاله ISI
Proteome compression via protein domain compositions
ترجمه فارسی عنوان
فشرده سازی پروتئین از طریق ترکیبات پروتئین
کلمات کلیدی
فشرده سازی مبتنی بر گرامر، ترکیب دامنه پروتئین، برنامه ریزی خطی عدد صحیح،
موضوعات مرتبط
علوم زیستی و بیوفناوری بیوشیمی، ژنتیک و زیست شناسی مولکولی زیست شیمی
چکیده انگلیسی
In this paper, we study domain compositions of proteins via compression of whole proteins in an organism for the sake of obtaining the entropy that the individual contains. We suppose that a protein is a multiset of domains. Since gene duplication and fusion have occurred through evolutionary processes, the same domains and the same compositions of domains appear in multiple proteins, which enables us to compress a proteome by using references to proteins for duplicated and fused proteins. Such a network with references to at most two proteins is modeled as a directed hypergraph. We propose a heuristic approach by combining the Edmonds algorithm and an integer linear programming, and apply our procedure to 14 proteomes of Dictyostelium discoideum, Escherichia coli, Saccharomyces cerevisiae, Schizosaccharomyces pombe, Caenorhabditis elegans, Drosophila melanogaster, Arabidopsis thaliana, Oryza sativa, Danio rerio, Xenopus laevis, Gallus gallus, Mus musculus, Pan troglodytes, and Homo sapiens. The compressed size using both of duplication and fusion was smaller than that using only duplication, which suggests the importance of fusion events in evolution of a proteome.
ناشر
Database: Elsevier - ScienceDirect (ساینس دایرکت)
Journal: Methods - Volume 67, Issue 3, 1 June 2014, Pages 380-385
نویسندگان
, , ,