Article ID Journal Published Year Pages File Type
15182 Computational Biology and Chemistry 2012 14 Pages PDF
Abstract

In 1996, a common trinucleotide circular code, called X, is identified in genes of eukaryotes and prokaryotes ( Arquès and Michel, 1996). This circular code X is a set of 20 trinucleotides allowing the reading frames in genes to be retrieved locally, i.e. anywhere in genes and in particular without start codons. This reading frame retrieval needs a window length l of 12 nucleotides (l ≥ 12). With a window length strictly less than 12 nucleotides (l < 12), some words of X, called ambiguous words, are found in the shifted frames (the reading frame shifted by one or two nucleotides) preventing the reading frame in genes to be retrieved. Since 1996, these ambiguous words of X were never studied.In the first part of this paper, we identify all the ambiguous words of the common trinucleotide circular code X. With a length l varying from 1 to 11 nucleotides, the type and the occurrence number (multiplicity) of ambiguous words of X are given in each shifted frame. Maximal ambiguous words of X, words which are not factors of another ambiguous words, are also determined. Two probability definitions based on these results show that the common trinucleotide circular code X retrieves the reading frame in genes with a probability of about 90% with a window length of 6 nucleotides, and a probability of 99.9% with a window length of 9 nucleotides (100% with a window length of 12 nucleotides, by definition of a circular code).In the second part of this paper, we identify X circular code motifs (shortly X motifs) in transfer RNA and 16S ribosomal RNA: a tRNA X motif of 26 nucleotides including the anticodon stem-loop and seven 16S rRNA X motifs of length greater or equal to 15 nucleotides. Window lengths of reading frame retrieval with each trinucleotide of these X motifs are also determined. Thanks to the crystal structure 3I8G ( Jenner et al., 2010), a 3D visualization of X motifs in the ribosome shows several spatial configurations involving mRNA X motifs, A-tRNA and E-tRNA X motifs, and four 16S rRNA X motifs. Another identified 16S rRNA X motif is involved in the decoding center which recognizes the codon–anticodon helix in A-tRNA. From a code theory point of view, these identified X circular code motifs and their mathematical properties may constitute a translation code involved in retrieval, maintenance and synchronization of reading frames in genes.

Graphical abstractFigure optionsDownload full-size imageDownload as PowerPoint slideHighlights► Identification of the ambiguous words of the common trinucleotide circular code X. ► Identification of the maximal ambiguous words of the common trinucleotide circular code X. ► Probability of reading frame retrieval of the common trinucleotide circular code X. ► Identification of X circular code motifs in transfer and ribosomal RNAs. ► 3D visualization of X circular code motifs in the ribosome.

Related Topics
Physical Sciences and Engineering Chemical Engineering Bioengineering
Authors
,