کد مقاله کد نشریه سال انتشار مقاله انگلیسی نسخه تمام متن
4496033 1623834 2015 7 صفحه PDF دانلود رایگان
عنوان انگلیسی مقاله ISI
Analysis of the multi-copied genes and the impact of the redundant protein coding sequences on gene annotation in prokaryotic genomes
ترجمه فارسی عنوان
تجزیه و تحلیل ژن های چند کپی شده و تاثیر توالی کدگذاری پروتئین از کار بر روی حاشیه نویسی ژن در ژنوم پروکریوت
کلمات کلیدی
پروتئین تکثیر کدگذاری دنباله، ژن چندگانه، ژنوم پروکاریوتی،
موضوعات مرتبط
علوم زیستی و بیوفناوری علوم کشاورزی و بیولوژیک علوم کشاورزی و بیولوژیک (عمومی)
چکیده انگلیسی
The important roles of duplicated genes in evolutional process have been recognized in bacteria, archaebacteria and eukaryotes, while there is very little study on the multi-copied protein coding genes that share sequence identity of 100%. In this paper, the multi-copied protein coding genes in a number of prokaryotic genomes are comprehensively analyzed firstly. The results show that 0-15.93% of the protein coding genes in each genome are multi-copied genes and 0-16.49% of the protein coding genes in each genome are highly similar with the sequence identity ≥80%. Function and COG (Clusters of Orthologous Groups of proteins) analysis shows that 64.64% of multi-copied genes concentrate on the function of transposase and 86.28% of the COG assigned multi-copied genes concentrate on the COG code of 'L'. Furthermore, the impact of redundant protein coding sequences on the gene prediction results is studied. The results show that the problem of protein coding sequence redundancies cannot be ignored and the consistency of the gene annotation results before and after excluding the redundant sequences is negatively related with the sequences redundancy degree of the protein coding sequences in the training set.
ناشر
Database: Elsevier - ScienceDirect (ساینس دایرکت)
Journal: Journal of Theoretical Biology - Volume 376, 7 July 2015, Pages 8-14
نویسندگان
, , , , , ,