Article ID Journal Published Year Pages File Type
2820709 Genomics 2011 8 Pages PDF
Abstract

Knowledge of the detailed organization of nucleosomes across genomes and the mechanisms of nucleosome positioning is critical for the understanding of gene regulation and expression. In the present work, the bias of 4-mer frequency in nucleosome and linker sequences of the S. cerevisiae genome was analyzed statistically. A novel position-correlation scoring function algorithm based on the bias of 4-mer frequency in linker sequences was presented to distinguish nucleosome vs linker sequences. Five-fold cross-validation demonstrated that the algorithm achieved a good performance with mean area under the receiver operator characteristics curve of 0.981. Next, the algorithm was used to predict nucleosome occupancy throughout the S. cerevisiae genome and relatively high correlation coefficients with experiment maps of nucleosome positioning were obtained. Besides, the distinct nucleosome depleted regions in the vicinity of regulatory sites were confirmed. The results suggest that intrinsic DNA sequence preferences in linker regions have a significant impact on the nucleosome occupancy.

Related Topics
Life Sciences Biochemistry, Genetics and Molecular Biology Genetics
Authors
, , ,