کد مقاله | کد نشریه | سال انتشار | مقاله انگلیسی | نسخه تمام متن |
---|---|---|---|---|
2819210 | 1569909 | 2008 | 9 صفحه PDF | دانلود رایگان |

We introduce a new concept of triplet periodicity class (TPC) and a measure of similarity between such classes. We performed classification of 472 288 triplet periodicity (TP) regions found in 578 868 genes from 29th release of KEGG databank. Totally 2520 classes were obtained. They contain 94% of 472 288 found cases of TP. For 92% of TP regions contained in classes the same linkage of TP to open reading frame (ORF) is observed. For 8% of TP cases we revealed a shift between ORF of a gene and ORF common for majority of genes contained in a TPC. For these 8% of periodic regions the hypothetical amino acid sequences corresponding to ORF built by TPC were made. BLAST program has shown that 2679 hypothetical amino acid sequences have statistically significant similarity with proteins from UniProt databank. We suppose that 8% of TP regions contained in classes possess a mutation originating from ORF shift. Obtained TPCs can be used for identification of genes' coding regions as well as for searching for mutations arisen arising from ORF shift.
Journal: Gene - Volume 421, Issues 1–2, 15 September 2008, Pages 52–60