Article ID Journal Published Year Pages File Type
15202 Computational Biology and Chemistry 2012 12 Pages PDF
Abstract

Classification analysis has been developed continuously since 1936. This research field has advanced as a result of development of classifiers such as KNN, ANN, and SVM, as well as through data preprocessing areas. Feature (gene) selection is required for very high dimensional data such as microarray before classification work. The goal of feature selection is to choose a subset of informative features that reduces processing time and provides higher classification accuracy. In this study, we devised a method of artificial gene making (AGM) for microarray data to improve classification accuracy. Our artificial gene was derived from a whole microarray dataset, and combined with a result of gene selection for classification analysis. We experimentally confirmed a clear improvement of classification accuracy after inserting artificial gene. Our artificial gene worked well for popular feature (gene) selection algorithms and classifiers. The proposed approach can be applied to any type of high dimensional dataset.

Graphical abstractFigure optionsDownload full-size imageDownload as PowerPoint slideHighlights► Microarray datasets require gene selection before classification task. ► We add an artificial gene into gene selection result. ► The artificial gene brings clear improvement of classification accuracy.

Related Topics
Physical Sciences and Engineering Chemical Engineering Bioengineering
Authors
, ,