Representation of image content based on RoI-BoW

Article ID	Journal	Published Year	Pages	File Type
6938537	Journal of Visual Communication and Image Representation	2015	13 Pages	PDF

Abstract

Representation of image content is an important part of image annotation and retrieval, and it has become a hot issue in computer vision. As an efficient and accurate image content representation model, bag-of-words (BoW) has attracted more attention in recent years. After segmentation, BoW treats all of the image regions equally. In fact, some regions of image are more important than others in image retrieval, such as salient object or region of interest. In this paper, a novel region of interest based bag-of-words model (RoI-BoW) for image representation is proposed. At first, the difference of Gaussian (DoG) is adopted to find key points in an image and generates different size grid as RoI to construct visual words by the BoW model. Furthermore, we analyze the influence of different size segmentation on image content representation by content based image retrieval. Experiments on Corel 5K verify the effectiveness of RoI-BoW on image content representation, and prove that RoI-BoW outperforms the BoW model significantly. Moreover, amounts of experiments illustrate the influence of different size segmentation on image representation based on the Bow model and RoI-BoW model respectively. This work is helpful to choose appropriate grid size in different situations when representing image content, and meaningful to image classification and retrieval.

Keywords

Feature extraction Image retrieval Gabor filtering region of interest Bag of words