کد مقاله کد نشریه سال انتشار مقاله انگلیسی نسخه تمام متن
385646 660869 2011 11 صفحه PDF دانلود رایگان
عنوان انگلیسی مقاله ISI
A Boolean function approach to feature selection in consistent decision information systems
موضوعات مرتبط
مهندسی و علوم پایه مهندسی کامپیوتر هوش مصنوعی
پیش نمایش صفحه اول مقاله
A Boolean function approach to feature selection in consistent decision information systems
چکیده انگلیسی

The goal of feature selection (FS) is to find the minimal subset (MS) R of condition feature set C such that R has the same classification power as C and then reduce the dataset by discarding from it all features not contained in R. Usually one dataset may have a lot of MSs and finding all of them is known as an NP-hard problem. Therefore, when only one MS is required, some heuristic for finding only one or a small number of possible MSs is used. But in this case there is a risk that the best MSs would be overlooked. When the best solution of an FS task is required, the discernibility matrix (DM)-based approach, generating all MSs, is used. There are basically two factors that often cause to overflow the computer’s memory due to which the DM-based FS programs fail. One of them is the largeness of sizes of discernibility functions (DFs) for large data sets; the other is the intractable space complexity of the conversion of a DF to disjunctive normal form (DNF). But usually most of the terms of DF and temporary results generated during DF to DNF conversion process are redundant ones. Therefore, usually the minimized DF (DFmin) and the final DNF is to be much simpler than the original DF and temporary results mentioned, respectively. Based on these facts, we developed a logic function-based feature selection method that derives DFmin from the truth table image of a dataset and converts it to DNF with preventing the occurrences of redundant terms. The proposed method requires no more amount of memory than that is required for constructing DFmin and final DNF separately. Due to this property, it can process most of datasets that can not be processed by DM-based programs.

Research highlights
► The space complexity of the discernibility function-based reduct generation process is reduced.
► To do this the minimal form of discernibility function is generated.
► This function is converted to the set of reducts by preventing the occurrence of redundant implicants.
► The spacecomplexity of the problem is reduced in a great scale.

ناشر
Database: Elsevier - ScienceDirect (ساینس دایرکت)
Journal: Expert Systems with Applications - Volume 38, Issue 7, July 2011, Pages 8229–8239
نویسندگان
, , ,