Article ID | Journal | Published Year | Pages | File Type |
---|---|---|---|---|
505731 | Computers in Biology and Medicine | 2009 | 14 Pages |
We have developed an integrated tool for statistical analysis of large-scale LC-MS profiles of complex protein mixtures comprising a set of procedures for data processing, selection of biomarkers used in early diagnostic and classification of patients based on their peptide mass fingerprints.Here, a novel boosting technique is proposed, which is embedded in our framework for MS data analysis. Our boosting scheme is based on Hannan-consistent game playing strategies. We analyze boosting from a game-theoretic perspective and define a new class of boosting algorithms called H-boosting methods.In the experimental part of this work we apply the new classifier together with classical and state-of-the-art algorithms to classify ovarian cancer and cystic fibrosis patients based on peptide mass spectra.The methods developed here provide automatic, general, and efficient means for processing of large scale LC-MS datasets. Good classification results suggest that our approach is able to uncover valuable information to support medical diagnosis.