Article ID Journal Published Year Pages File Type
2787981 Journal of Genetics and Genomics 2008 5 Pages PDF
Abstract

In this study, we propose to use the principal component analysis (PCA) and regression model to incorporate linkage disequilibrium (LD) in genomic association data analysis. To accommodate LD in genomic data and reduce multiple testing, we suggest performing PCA and extracting the PCA score to capture the variation of genomic data, after which regression analysis is used to assess the association of the disease with the principal component score. An empirical analysis result shows that both genotype-based correlation matrix and haplotype-based LD matrix can produce similar results for PCA. Principal component score seems to be more powerful in detecting genetic association because the principal component score is quantitatively measured and may be able to capture the effect of multiple loci.

Related Topics
Life Sciences Biochemistry, Genetics and Molecular Biology Developmental Biology
Authors
, ,