کد مقاله کد نشریه سال انتشار مقاله انگلیسی نسخه تمام متن
527492 869328 2015 11 صفحه PDF دانلود رایگان
عنوان انگلیسی مقاله ISI
Spatial histograms of soft pairwise similar patches to improve the bag-of-visual-words model
ترجمه فارسی عنوان
هیستوگرام های فضایی از تکه های مشابه جفت نرم برای بهبود مدل کیسه بصری کلمات
کلمات کلیدی
اطلاعات فضایی، طبقه بندی شی، زاویه و هیستوگرام فاصله، کلماتی از قبیل
موضوعات مرتبط
مهندسی و علوم پایه مهندسی کامپیوتر چشم انداز کامپیوتر و تشخیص الگو
چکیده انگلیسی


• A new approach to improve image representation for category level classification.
• We encode pairwise relative spatial information of patches in the bag of word model.
• A simple approach complementary to the Spatial Pyramid Representation (SPR).
• Can be combined with SPR and outperforms other existing spatial methods.
• Experimental validation of the approach is shown on 5 challenging datasets.

In the context of category level scene classification, the bag-of-visual-words model (BoVW) is widely used for image representation. This model is appearance based and does not contain any information regarding the arrangement of the visual words in the 2D image space. To overcome this problem, recent approaches try to capture information about either the absolute or the relative spatial location of visual words. In the first category, the so-called Spatial Pyramid Representation (SPR) is very popular thanks to its simplicity and good results. Alternatively, adding information about occurrences of relative spatial configurations of visual words was proven to be effective but at the cost of higher computational complexity, specifically when relative distance and angles are taken into account. In this paper, we introduce a novel way to incorporate both distance and angle information in the BoVW representation. The novelty is first to provide a computationally efficient representation adding relative spatial information between visual words and second to use a soft pairwise voting scheme based on the distance in the descriptor space. Experiments on challenging data sets MSRC-2, 15Scene, Caltech101, Caltech256 and Pascal VOC 2007 demonstrate that our method outperforms or is competitive with concurrent ones. We also show that it provides important complementary information to the spatial pyramid matching and can improve the overall performance.

ناشر
Database: Elsevier - ScienceDirect (ساینس دایرکت)
Journal: Computer Vision and Image Understanding - Volume 132, March 2015, Pages 102–112
نویسندگان
, , , ,