Article ID Journal Published Year Pages File Type
410384 Neurocomputing 2010 9 Pages PDF
Abstract

Association rules have been widely used in gene expression data analysis. However, there is no systematical way to select interesting rules from the millions of rules generated from high dimensional gene expression data. In this study, a kernel density estimation based measurement is proposed to evaluate the interestingness of the association rules. Several pruning strategies are also devised to efficiently discover the approximate top-k interesting patterns. Finally, over-fitting problem of the classification model is addressed by using conditional independence test to eliminate redundant rules. Experimental results show the effectiveness of the proposed interestingness measure and classification model.

Related Topics
Physical Sciences and Engineering Computer Science Artificial Intelligence
Authors
, , , ,