کد مقاله کد نشریه سال انتشار مقاله انگلیسی نسخه تمام متن
527886 869405 2011 11 صفحه PDF دانلود رایگان
عنوان انگلیسی مقاله ISI
Towards a more discriminative and semantic visual vocabulary
موضوعات مرتبط
مهندسی و علوم پایه مهندسی کامپیوتر چشم انداز کامپیوتر و تشخیص الگو
پیش نمایش صفحه اول مقاله
Towards a more discriminative and semantic visual vocabulary
چکیده انگلیسی

We present a novel method for constructing a visual vocabulary that takes into account the class labels of images, thus resulting in better recognition performance and more efficient learning. Our method consists of two stages: Cluster Precision Maximisation (CPM) and Adaptive Refinement. In the first stage, a Reciprocal Nearest Neighbours (RNN) clustering algorithm is guided towards class representative visual words by maximising a new cluster precision criterion. As we are able to optimise the vocabulary without the need for expensive cross-validation, the overall training time is significantly reduced without a negative impact on the results. Next, an adaptive threshold refinement scheme is proposed with the aim of increasing vocabulary compactness while at the same time improving the recognition rate and further increasing the representativeness of the visual words for category-level object recognition. This is a correlation clustering based approach, which works as a meta-clustering and optimises the cut-off threshold for each cluster separately. In the experiments we analyse the recognition rate of different vocabularies for a subset of the Caltech 101 dataset, showing how RNN in combination with CPM selects the optimal codebooks, and how the clustering refinement step succeeds in further increasing the recognition rate.

Research highlights
► We present a novel method for constructing more semantic visual vocabularies.
► A procedure to optimise a novel measure for cluster precision is detailed.
► Correlation clustering techniques are used for improving class representativeness.
► Experiments confirm that our methods select optimal codebooks.
► The average recognition rate increases while the codebook size decreases.

ناشر
Database: Elsevier - ScienceDirect (ساینس دایرکت)
Journal: Computer Vision and Image Understanding - Volume 115, Issue 3, March 2011, Pages 415–425
نویسندگان
, , , ,