کد مقاله کد نشریه سال انتشار مقاله انگلیسی نسخه تمام متن
4970171 1450031 2017 10 صفحه PDF دانلود رایگان
عنوان انگلیسی مقاله ISI
Feature weighting and selection with a Pareto-optimal trade-off between relevancy and redundancy
ترجمه فارسی عنوان
مقیاس و انتخاب ویژگی ها با یک معامله مطلوب پارتو بین ربط و اضافه کاری
کلمات کلیدی
انتخاب ویژگی، ویژگی وزن، بهینه سازی چند هدفه، اندازه گیری اطلاعات، طبقه بندی،،
موضوعات مرتبط
مهندسی و علوم پایه مهندسی کامپیوتر چشم انداز کامپیوتر و تشخیص الگو
چکیده انگلیسی
Feature Selection (FS) is an important pre-processing step in machine learning and it reduces the number of features/variables used to describe each member of a dataset. Such reduction occurs by eliminating some of the non-discriminating and redundant features and selecting a subset of the existing features with higher discriminating power among various classes in the data. In this paper, we formulate the feature selection as a bi-objective optimization problem of some real-valued weights corresponding to each feature. A subset of the weighted features is thus selected as the best subset for subsequent classification of the data. Two information theoretic measures, known as 'relevancy' and 'redundancy' are chosen for designing the objective functions for a very competitive Multi-Objective Optimization (MOO) algorithm called 'Multi-Objective Evolutionary Algorithm based on Decomposition (MOEA/D)'. We experimentally determine the best possible constraints on the weights to be optimized. We evaluate the proposed bi-objective feature selection and weighting framework on a set of 15 standard datasets by using the popular k-Nearest Neighbor (k-NN) classifier. As is evident from the experimental results, our method appears to be quite competitive to some of the state-of-the-art FS methods of current interest. We further demonstrate the effectiveness of our framework by changing the choices of the optimization scheme and the classifier to Non-dominated Sorting Genetic Algorithm (NSGA)-II and Support Vector Machines (SVMs) respectively.
ناشر
Database: Elsevier - ScienceDirect (ساینس دایرکت)
Journal: Pattern Recognition Letters - Volume 88, 1 March 2017, Pages 12-19
نویسندگان
, ,