کد مقاله کد نشریه سال انتشار مقاله انگلیسی نسخه تمام متن
2819210 1569909 2008 9 صفحه PDF دانلود رایگان
عنوان انگلیسی مقاله ISI
Classification analysis of triplet periodicity in protein-coding regions of genes
موضوعات مرتبط
علوم زیستی و بیوفناوری بیوشیمی، ژنتیک و زیست شناسی مولکولی ژنتیک
پیش نمایش صفحه اول مقاله
Classification analysis of triplet periodicity in protein-coding regions of genes
چکیده انگلیسی

We introduce a new concept of triplet periodicity class (TPC) and a measure of similarity between such classes. We performed classification of 472 288 triplet periodicity (TP) regions found in 578 868 genes from 29th release of KEGG databank. Totally 2520 classes were obtained. They contain 94% of 472 288 found cases of TP. For 92% of TP regions contained in classes the same linkage of TP to open reading frame (ORF) is observed. For 8% of TP cases we revealed a shift between ORF of a gene and ORF common for majority of genes contained in a TPC. For these 8% of periodic regions the hypothetical amino acid sequences corresponding to ORF built by TPC were made. BLAST program has shown that 2679 hypothetical amino acid sequences have statistically significant similarity with proteins from UniProt databank. We suppose that 8% of TP regions contained in classes possess a mutation originating from ORF shift. Obtained TPCs can be used for identification of genes' coding regions as well as for searching for mutations arisen arising from ORF shift.

ناشر
Database: Elsevier - ScienceDirect (ساینس دایرکت)
Journal: Gene - Volume 421, Issues 1–2, 15 September 2008, Pages 52–60
نویسندگان
, ,