Article ID Journal Published Year Pages File Type
4636677 Applied Mathematics and Computation 2006 17 Pages PDF
Abstract

We prove a (m/δ)O(κ) · na time bound for finding minimum solutions Smin of Feature Set problems, where n is the total size of a given Feature Set problem, κ ⩽ ∣Smin∣, m equals the number of non-target features, a is a (relatively small) constant, and 1 − δ is the confidence that the solution is of minimum length. In terms of parameterized complexity of NP-complete problems, our time bound differs from an FPT-type bound by the factor mO(κ) for fixed δ. The algorithm is applied to a prominent microarray dataset: The classification of gene-expression data related to acute myeloid leukaemia (AML) and acute lymphoblastic leukaemia (ALL). From the set of potentially significant features calculated by the algorithm we can identify three genes (D88422, M92287, L09209) that produce zero errors on the test set by using a simple, straightforward evaluation procedure (performing the test on the single gene M84526 produces only one error).

Related Topics
Physical Sciences and Engineering Mathematics Applied Mathematics
Authors
,