Article ID | Journal | Published Year | Pages | File Type |
---|---|---|---|---|
6938537 | Journal of Visual Communication and Image Representation | 2015 | 13 Pages |
Abstract
Representation of image content is an important part of image annotation and retrieval, and it has become a hot issue in computer vision. As an efficient and accurate image content representation model, bag-of-words (BoW) has attracted more attention in recent years. After segmentation, BoW treats all of the image regions equally. In fact, some regions of image are more important than others in image retrieval, such as salient object or region of interest. In this paper, a novel region of interest based bag-of-words model (RoI-BoW) for image representation is proposed. At first, the difference of Gaussian (DoG) is adopted to find key points in an image and generates different size grid as RoI to construct visual words by the BoW model. Furthermore, we analyze the influence of different size segmentation on image content representation by content based image retrieval. Experiments on Corel 5K verify the effectiveness of RoI-BoW on image content representation, and prove that RoI-BoW outperforms the BoW model significantly. Moreover, amounts of experiments illustrate the influence of different size segmentation on image representation based on the Bow model and RoI-BoW model respectively. This work is helpful to choose appropriate grid size in different situations when representing image content, and meaningful to image classification and retrieval.
Related Topics
Physical Sciences and Engineering
Computer Science
Computer Vision and Pattern Recognition
Authors
Jing Zhang, Da Li, Yaxin Zhao, Zhihua Chen, Yubo Yuan,