Article ID Journal Published Year Pages File Type
4948632 Neurocomputing 2016 8 Pages PDF
Abstract
The bag-of-visual-words (BoW) model has been broadly applied to represent images in a variety of image retrieval applications. Ideal visual word vocabulary should obtain two important properties, completeness and distinctiveness (i.e., containing all the similar image patches and eliminating different ones accurately). However, the traditional definition of basic visual elements tend to lack in either generality or particularity, which will lead to the incoherence problem and further ruin the image retrieval performance. To address this issue, in this paper, we propose a novel basic visual element called Visual Homograph - groups of local patches which have indivisible descriptors but differ in origins. To generate Visual Homograph based vocabulary, we also propose a novel similarity metric named Integrated Perceptual Similarity which systematically combines center-based and boundary-based measures. Experimental results on standard datasets demonstrate the superiority of Visual Homograph based method over state-of-the-art visual vocabulary generation approaches, in terms of alleviating incoherence problem, and enhancing retrieval performance in practical applications.
Keywords
Related Topics
Physical Sciences and Engineering Computer Science Artificial Intelligence
Authors
, ,