کد مقاله کد نشریه سال انتشار مقاله انگلیسی نسخه تمام متن
496371 862857 2012 5 صفحه PDF دانلود رایگان
عنوان انگلیسی مقاله ISI
A novel algorithm applied to classify unbalanced data
موضوعات مرتبط
مهندسی و علوم پایه مهندسی کامپیوتر نرم افزارهای علوم کامپیوتر
پیش نمایش صفحه اول مقاله
A novel algorithm applied to classify unbalanced data
چکیده انگلیسی

Unbalanced data that are minority classes with few samples presented in many fields. The mean of unbalanced data is difficult to formalize so that traditional algorithms are limited in solving unbalanced data. In this paper, a novel algorithm based on analysis of variance (ANOVA), fuzzy C-means (FCM) and bacterial foraging optimization (BFO) is proposed to classify unbalanced data. ANOVA can measure the difference between the means of two or more groups in which the observed variance is partitioned into components due to various explanatory variables. FCM is a method of fuzzy clustering algorithm that allows one piece of data to belong to two or more clusters. Natural selection tends to eliminate animals with poor foraging strategies and favors the propagation of genes of those animals that have successful foraging strategies. BFO can model the mechanism of natural selection and solve many application problems. The proposed algorithm combines the advantages of ANOVA, FCM and BFO. ANOVA has the ability to select beneficial feature subsets. FCM has the ability to identify data into clusters with certain membership degrees, and BFO has the fast ability to converge to global optima. In this paper, microarray data of ovarian cancer and zoo dataset are used to test the performance for the proposed algorithm. The performance of the proposed algorithm is supported by simulation results. From simulation results, the classification accuracy of the proposed algorithm outperforms other existing approaches.

Figure optionsDownload as PowerPoint slideHighlights
► The proposed algorithm combines the advantages of ANOVA, FCM and BFO.
► ANOVA has the ability to select beneficial feature subsets.
► FCM has the ability to identify data into clusters with certain membership degrees.
► BFO has the fast ability to converge to global optima.
► Microarray data of ovarian cancer and zoo dataset are used to test the performance for the proposed algorithm.

ناشر
Database: Elsevier - ScienceDirect (ساینس دایرکت)
Journal: Applied Soft Computing - Volume 12, Issue 8, August 2012, Pages 2481–2485
نویسندگان
, ,