Article ID | Journal | Published Year | Pages | File Type |
---|---|---|---|---|
408638 | Neurocomputing | 2010 | 7 Pages |
Abstract
We aim to identify the minimal subset of random variables that is relevant for probabilistic classification in data sets with many variables but few instances. A principled solution to this problem is to determine the Markov boundary of the class variable. In this paper, we propose a novel constraint-based Markov boundary discovery algorithm called MBOR with the objective of improving accuracy while still remaining scalable to very high dimensional data sets and theoretically correct under the so-called faithfulness condition. We report extensive empirical experiments on synthetic data sets scaling up to tens of thousand variables.
Related Topics
Physical Sciences and Engineering
Computer Science
Artificial Intelligence
Authors
Sérgio Rodrigues de Morais, Alex Aussem,