Article ID Journal Published Year Pages File Type
7562622 Chemometrics and Intelligent Laboratory Systems 2016 13 Pages PDF
Abstract
The integration of multiblock high throughput data from multiple sources is one of the major challenges in several disciplines including metabolomics, computational biology, genomics, and clinical psychology. A main challenge in this line of research is to obtain interpretable results 1) that give an insight into the common and distinctive sources of variations associated to the multiple and heterogeneous data blocks and 2) that facilitate the identification of relevant variables. We present a novel variable selection method for performing data integration, providing easily interpretable results, and recovering underlying data structure such as common and distinctive components. The flexibility and applicability of this method are showcased via numerical simulations and an application to metabolomics data.
Related Topics
Physical Sciences and Engineering Chemistry Analytical Chemistry
Authors
, ,