Article ID Journal Published Year Pages File Type
6484216 Biocybernetics and Biomedical Engineering 2017 12 Pages PDF
Abstract
Non-small cell lung cancer (NSCLC) is the most common type of lung cancer; and is one of the leading causes of death in the world. Surgery combined with chemotherapy is the recommended treatment for NSCLC. Since chemotherapy is an expensive treatment for either medical staff or patients suffering from pain, this study attempts to construct an intelligent predictive model to predict the adjuvant chemotherapy (ACT) effectiveness/futileness in the patients, in order to help futile cases for unnecessary applications. There is a 2-step method: preprocessing and predicting. First a purposefully preprocessing technique: chi-square test, SVM-RFE and correlation matrix, were employed in NSCLC gene expression dataset as a novel multi-layered feature selection method to defeat the curse of dimension and detect the chemotherapy target genes from tens of thousands features, based on which the patients can be classified into two groups, with NB classifier at second step. 10-Fold cross-validation was found with accuracy of 68.93% for 2 genes, TGFA (205015_s_at) and SEMA6C (208100_x_at), which is preferable compared to earlier studies, even though more than 2 input features are employed for the prediction. According to the results found in this study, one can concludes that the multi-layered feature selection approach has increased the classification accuracy in terms of finding the fitted patient for receiving ACT by reducing the number of features and has significant power to be used in medical datasets with small train samples and large number of features.
Related Topics
Physical Sciences and Engineering Chemical Engineering Bioengineering
Authors
, ,