کد مقاله کد نشریه سال انتشار مقاله انگلیسی نسخه تمام متن
242279 501817 2007 12 صفحه PDF دانلود رایگان
عنوان انگلیسی مقاله ISI
MICF: An effective sanitization algorithm for hiding sensitive patterns on data mining
موضوعات مرتبط
مهندسی و علوم پایه مهندسی کامپیوتر هوش مصنوعی
پیش نمایش صفحه اول مقاله
MICF: An effective sanitization algorithm for hiding sensitive patterns on data mining
چکیده انگلیسی

Data mining mechanisms have widely been applied in various businesses and manufacturing companies across many industry sectors. Sharing data or sharing mined rules has become a trend among business partnerships, as it is perceived to be a mutually benefit way of increasing productivity for all parties involved. Nevertheless, this has also increased the risk of unexpected information leaks when releasing data. To conceal restrictive itemsets (patterns) contained in the source database, a sanitization process transforms the source database into a released database that the counterpart cannot extract sensitive rules from. The transformed result also conceals non-restrictive information as an unwanted event, called a side effect or the “misses cost”. The problem of finding an optimal sanitization method, which conceals all restrictive itemsets but minimizes the misses cost, is NP-hard. To address this challenging problem, this study proposes the maximum item conflict first (MICF) algorithm. Experimental results demonstrate that the proposed method is effective, has a low sanitization rate, and can generally achieve a significantly lower misses cost than those achieved by the MinFIA, MaxFIA, IGA and Algo2b methods in several real and artificial datasets.

ناشر
Database: Elsevier - ScienceDirect (ساینس دایرکت)
Journal: Advanced Engineering Informatics - Volume 21, Issue 3, July 2007, Pages 269–280
نویسندگان
, , ,