کد مقاله کد نشریه سال انتشار مقاله انگلیسی نسخه تمام متن
391637 661904 2014 17 صفحه PDF دانلود رایگان
عنوان انگلیسی مقاله ISI
Disclosed: An efficient depth-first, top-down algorithm for mining disjunctive closed itemsets in high-dimensional data
ترجمه فارسی عنوان
اکتشاف: یک الگوریتم عمیق عمیق، اول، بالا به پایین برای مجموعه داده های بسته بندی معدنی در داده های با ابعاد بزرگ
موضوعات مرتبط
مهندسی و علوم پایه مهندسی کامپیوتر هوش مصنوعی
چکیده انگلیسی

We focus, in this paper, on the computational challenges of identifying disjunctive Boolean patterns in high-dimensional data. We conduct our analysis focusing particularly in microarray gene expression data, since this is one of the most stereotypical examples of high-dimensional data. We devised a novel algorithm that takes advantage of the scarcity of samples in microarray data sets, allowing us to efficiently find disjunctive closed patterns. Our algorithm, Disclosed, mines disjunctive closed itemsets by exploring the search space in a depth-first, top-down manner.We evaluated the performance of our algorithm to execute such a task using real microarray gene expression data sets publicly available on the Internet. Our experiments revealed under what situations, the characteristics of a data set, our method obtain a good, bad or average performance. We also compared the performance of our method with the state of the art algorithms for finding disjunctive closed patterns and disjunctive minimal generators. We observed that our approach is two orders of magnitude more efficient, both in terms of time and memory.

ناشر
Database: Elsevier - ScienceDirect (ساینس دایرکت)
Journal: Information Sciences - Volume 280, 1 October 2014, Pages 171–187
نویسندگان
, ,