Article ID | Journal | Published Year | Pages | File Type |
---|---|---|---|---|
10825796 | Methods | 2014 | 9 Pages |
Abstract
In this work, we present a mathematical formulation for integrative clustering of multiple-source data including both numerical and categorical data to resolve the above issue. Specifically, we formulate the problem as a novel consensus clustering method called Molecular Regularized Consensus Patient Stratification (MRCPS) based on an optimization process with regularization. Unlike the traditional consensus clustering methods, MRCPS can automatically and spontaneously cluster both numerical and categorical data with any option of similarity metrics. We apply this new method by applying it on the TCGA breast cancer datasets and evaluate using both statistical criteria and clinical relevance on predicting prognosis. The result demonstrates the superiority of this method in terms of effectiveness of aggregation and differentiating patient outcomes. Our method, while motivated by the breast cancer research, is nevertheless universal for integrative genomics studies.
Related Topics
Life Sciences
Biochemistry, Genetics and Molecular Biology
Biochemistry
Authors
Chao Wang, Raghu Machiraju, Kun Huang,