Article ID | Journal | Published Year | Pages | File Type |
---|---|---|---|---|
8646309 | Genomics | 2018 | 7 Pages |
Abstract
Accurate data imputation requires ethnicity-matched reference panels. However, recent admixtures have created mosaic human genomes, different chromosomal segments have different ethnic backgrounds, so it is impossible for a single-ethnicity reference panel to be the matched for data imputation. In this study, we explored a novel strategy for imputation. We created individualized mosaic reference panel for each person according to his/her ethnic backgrounds at each genomic locus. We examined on datasets with 70% missing values on haplotypes and 50% missing values on genotypes. Results showed that the imputation with mosaic references steadily yielded high imputation accuracy and outperforms the other strategies. With the mosaic reference panels, the imputation accuracy was 98.8 ± 0.1% (CEU), 98.7 ± 0.1% (YRI), 98.5 ± 0.1% (CHB), 98.6 ± 0.1% (ASW), 97.3 ± 0.1% (MKK) and 98.2 ± 0.1% (MXL). Mosaic reference panel will be one option for future missing value imputation in big data era.
Related Topics
Life Sciences
Biochemistry, Genetics and Molecular Biology
Genetics
Authors
Qing Song, Wei Xu, Wenzhi Li, Shaohua He, Jiankang Liu, Guangming Wang, Li Ma,