Article ID Journal Published Year Pages File Type
405274 Knowledge-Based Systems 2012 10 Pages PDF
Abstract

We report the discovery of strong correlations between protein coding regions and the prediction errors when using the simple recurrent network to segment genome sequences. We are going to use SARS genome to demonstrate how we conduct training and derive corresponding results. The distribution of prediction error indicates how the underlying hidden regularity of the genome sequences and the results are consistent with the finding of biologists: predicated protein coding features of SARS genome. This implies that the simple recurrent network is capable of providing new features for further biological studies when applied on genome studies. The HA gene of influenza A subtype H1N1 is also analyzed in a similar way.

Related Topics
Physical Sciences and Engineering Computer Science Artificial Intelligence
Authors
, , ,