Article ID Journal Published Year Pages File Type
11028859 Expert Systems with Applications 2019 32 Pages PDF
Abstract
This research proposes a novel genetic algorithm-based online gradient boosting (GAOGB) model for incremental breast cancer (BC) prognosis. The development of clinical information collection technologies has brought in increasingly large amounts of stream data for BC research. Traditional batch learning models have shown limitations in: (1) real-time prognosis accuracy from losing the information of incremental changes of a patient's pathological condition by time; (2) high redundancy due to the time required to retrain models every time new data are received. Online boosting is an efficient technique for learning from data streams. However, difficulties in parameter assignment and the lack of adaptiveness for batch learning base learners can degrade the performances of typical online boosting algorithms. The main objective of this research is to propose an incremental learning model for BC survivability prediction. To render a boosting algorithm with superiority in global optimal parameters, the genetic algorithm (GA) is integrated to an online gradient boosting scenario at the parameter selection phase, enabling real-time optimization. To enhance adaptiveness, an adaptive linear regressor is adopted as the base learner with minimal computational efforts, and updated in symphony with the online boosting model. The proposed GAOGB model is comprehensively evaluated on the U.S. National Cancer Institute's Surveillance, Epidemiology, and End Results (SEER) program breast cancer dataset in terms of accuracy, area under the curve (AUC), sensitivity, specificity, retraining time, and variation at each iteration. Experimental results show that the proposed GAOGB model achieves statistically outstanding online learning effectiveness. With a highest 28% improvement on testing accuracy over its base learners, outperforming current state-of-art online learning methods, and approximating batch learning boosting algorithms, the GAOGB algorithm validates the impact of parameter, adaptiveness and convergence in devising practical online learning algorithms. The proposed GAOGB model demonstrates potential for practical incremental breast cancer prognosis, promising a combination of training effectiveness and efficiency.
Related Topics
Physical Sciences and Engineering Computer Science Artificial Intelligence
Authors
, , ,