Article ID | Journal | Published Year | Pages | File Type |
---|---|---|---|---|
552628 | Decision Support Systems | 2014 | 11 Pages |
•clusterAOI uses better heuristics and avoids overgeneralisation than AOI.•clusterAOI has superior runtime performance (about half of that of classical AOI).•clusterAOI has 4 times interestingness and 1.5 times divergence better than AOI.•clusterAOI does not fluctuate between small and large datasets—steady and stable.
We present a hybrid heuristic algorithm, clusterAOI, that generates a more interesting generalised table than obtained via attribute-oriented induction (AOI). AOI tends to overgeneralise as it uses a fixed global static threshold to cluster and generalise attributes irrespective of their features, and does not evaluate intermediate interestingness. In contrast, clusterAOI uses attribute features to dynamically recalculate new attribute thresholds and applies heuristics to evaluate cluster quality and intermediate interestingness. Experimental results show improved interestingness, better output pattern distribution and expressiveness, and improved runtime.