Article ID Journal Published Year Pages File Type
530168 Pattern Recognition 2015 14 Pages PDF
Abstract

•A study of multi-output classification as graphical models.•An empirical comparison of existing strategies for modelling dependency among outputs.•A novel scalable approach based on a hill climbing heuristic: the classifier trellis.•An empirical cross-fold comparison with other methods.•A connection to structured output prediction and a comparison in a segmentation task.

Multi-output inference tasks, such as multi-label classification, have become increasingly important in recent years. A popular method for multi-label classification is classifier chains, in which the predictions of individual classifiers are cascaded along a chain, thus taking into account inter-label dependencies and improving the overall performance. Several varieties of classifier chain methods have been introduced, and many of them perform very competitively across a wide range of benchmark datasets. However, scalability limitations become apparent on larger datasets when modelling a fully cascaded chain. In particular, the methods׳ strategies for discovering and modelling a good chain structure constitute a major computational bottleneck. In this paper, we present the classifier trellis (CT) method for scalable multi-label classification. We compare CT with several recently proposed classifier chain methods to show that it occupies an important niche: it is highly competitive on standard multi-label problems, yet it can also scale up to thousands or even tens of thousands of labels.

Related Topics
Physical Sciences and Engineering Computer Science Computer Vision and Pattern Recognition
Authors
, , , ,