Article ID Journal Published Year Pages File Type
5907691 Genomics 2016 6 Pages PDF
Abstract

•Significant expansion of the coverage of Infinium HumanMethylation450 BeadChip in determining DNA methylation.•Performance evaluation and comparison of different computational models.•Generalizable models to expand DNA methylation measurement in diverse tissues.

The Infinium HumanMethylation450 BeadChip array, referred as 450K array hereinafter, has been widely adopted as an affordable technique to determine DNA methylation. Tens of thousands of data have been generated on diverse cell types and patient tissues, which have provided great insight into understanding the crucial roles of epigenetic modifications in many biological processes and diseases. The limitation of this technique is its coverage, which measures methylation levels of about 450,000 CpGs, accounting for about 1.6% of all CpGs in the human genome. In the present study we developed and compared computational models to significantly expand the coverage of Illumina 450K (~ 11 folds). Using the whole genome bisulfite sequencing and Illumina 450K data in the human H1 embryonic stem cell, we showed that the predicted and measured methylation levels were well correlated. Our proposed model showed superior prediction accuracies compared to the existing methods on the same dataset. When applied to predict the DNA methylome on other cells, our proposed model achieved comparable performance in cross-validations, which indicates the generalizibility of the method. Our method would thus be invaluable to maximize the usage of the existing data.

Related Topics
Life Sciences Biochemistry, Genetics and Molecular Biology Genetics