کد مقاله کد نشریه سال انتشار مقاله انگلیسی نسخه تمام متن
6026202 1580900 2015 22 صفحه PDF دانلود رایگان
عنوان انگلیسی مقاله ISI
Comparison of variants of canonical correlation analysis and partial least squares for combined analysis of MRI and genetic data
موضوعات مرتبط
علوم زیستی و بیوفناوری علم عصب شناسی علوم اعصاب شناختی
پیش نمایش صفحه اول مقاله
Comparison of variants of canonical correlation analysis and partial least squares for combined analysis of MRI and genetic data
چکیده انگلیسی
The standard analysis approach in neuroimaging genetics studies is the mass-univariate linear modeling (MULM) approach. From a statistical view, however, this approach is disadvantageous, as it is computationally intensive, cannot account for complex multivariate relationships, and has to be corrected for multiple testing. In contrast, multivariate methods offer the opportunity to include combined information from multiple variants to discover meaningful associations between genetic and brain imaging data. We assessed three multivariate techniques, partial least squares correlation (PLSC), sparse canonical correlation analysis (sparse CCA) and Bayesian inter-battery factor analysis (Bayesian IBFA), with respect to their ability to detect multivariate genotype-phenotype associations. Our goal was to systematically compare these three approaches with respect to their performance and to assess their suitability for high-dimensional and multi-collinearly dependent data as is the case in neuroimaging genetics studies. In a series of simulations using both linearly independent and multi-collinear data, we show that sparse CCA and PLSC are suitable even for very high-dimensional collinear imaging data sets. Among those two, the predictive power was higher for sparse CCA when voxel numbers were below 400 times sample size and candidate SNPs were considered. Accordingly, we recommend Sparse CCA for candidate phenotype, candidate SNP studies. When voxel numbers exceeded 500 times sample size, the predictive power was the highest for PLSC. Therefore, PLSC can be considered a promising technique for multivariate modeling of high-dimensional brain-SNP-associations. In contrast, Bayesian IBFA cannot be recommended, since additional post-processing steps were necessary to detect causal relations. To verify the applicability of sparse CCA and PLSC, we applied them to an experimental imaging genetics data set provided for us. Most importantly, application of both methods replicated the findings of this data set.
ناشر
Database: Elsevier - ScienceDirect (ساینس دایرکت)
Journal: NeuroImage - Volume 107, 15 February 2015, Pages 289-310
نویسندگان
, , , , , , ,