کد مقاله کد نشریه سال انتشار مقاله انگلیسی نسخه تمام متن
494978 862810 2015 16 صفحه PDF دانلود رایگان
عنوان انگلیسی مقاله ISI
Conception of a dominance-based multi-objective local search in the context of classification rule mining in large and imbalanced data sets
ترجمه فارسی عنوان
مفهوم جستجوی محلی چند هدفه مبتنی بر سلطه در زمینه قوانین استخراج قوانین در مجموعه داده های بزرگ و عدم تعادل
کلمات کلیدی
طبقه بندی جزئی، داده های نامتعادل، چند هدفه، جستجوی محلی
موضوعات مرتبط
مهندسی و علوم پایه مهندسی کامپیوتر نرم افزارهای علوم کامپیوتر
چکیده انگلیسی


• Formulation of the classification rule mining problem as a multi-objective problem.
• Proposal of MOCA-I that deals both with uncertainty, class imbalance and volumetry.
• Comparison of different MOCA-I based DMLS versions and DMLS 1· * shows better results.
• Comparison with 13 state-of-the-art classification algorithms.
• MOCA-I gives shorter and statistically more effective rules than other algorithms.

Classification on medical data raises several problems such as class imbalance, double meaning of missing data, volumetry or need of highly interpretable results. In this paper a new algorithm is proposed: MOCA-I (Multi-Objective Classification Algorithm for Imbalanced data), a multi-objective local search algorithm that is conceived to deal with these issues all together. It is based on a new modelization as a Pittsburgh multi-objective partial classification rule mining problem, which is described in the first part of this paper. An existing dominance-based multi-objective local search (DMLS) is modified to deal with this modelization. After experimentally tuning the parameters of MOCA-I and determining which version of DMLS algorithm is the most effective, the obtained MOCA-I version is compared to several state-of-the-art classification algorithms. This comparison is realized on 10 small and middle-sized data sets of literature and 2 real data sets; MOCA-I obtains the best results on the 10 data sets and is statistically better than other approaches on the real data sets.

Figure optionsDownload as PowerPoint slide

ناشر
Database: Elsevier - ScienceDirect (ساینس دایرکت)
Journal: Applied Soft Computing - Volume 34, September 2015, Pages 705–720
نویسندگان
, , , , ,