Article ID Journal Published Year Pages File Type
530911 Pattern Recognition 2014 16 Pages PDF
Abstract

•Spatial arrangement of visual words (WSA) for image retrieval and classification.•WSA generates vectors more compact than the traditional spatial pooling methods.•WSA outperforms Spatial Pyramids in the retrieval scenario.•WSA presents adequate performance in the classification scenario.

We present word spatial arrangement (WSA), an approach to represent the spatial arrangement of visual words under the bag-of-visual-words model. It lies in a simple idea which encodes the relative position of visual words by splitting the image space into quadrants using each detected point as origin. WSA generates compact feature vectors and is flexible for being used for image retrieval and classification, for working with hard or soft assignment, requiring no pre/post processing for spatial verification. Experiments in the retrieval scenario show the superiority of WSA in relation to Spatial Pyramids. Experiments in the classification scenario show a reasonable compromise between those methods, with Spatial Pyramids generating larger feature vectors, while WSA provides adequate performance with much more compact features. As WSA encodes only the spatial information of visual words and not their frequency of occurrence, the results indicate the importance of such information for visual categorization.

Related Topics
Physical Sciences and Engineering Computer Science Computer Vision and Pattern Recognition
Authors
, , , , ,