کد مقاله کد نشریه سال انتشار مقاله انگلیسی نسخه تمام متن
533443 870118 2012 10 صفحه PDF دانلود رایگان
عنوان انگلیسی مقاله ISI
Representing and recognizing objects with massive local image patches
موضوعات مرتبط
مهندسی و علوم پایه مهندسی کامپیوتر چشم انداز کامپیوتر و تشخیص الگو
پیش نمایش صفحه اول مقاله
Representing and recognizing objects with massive local image patches
چکیده انگلیسی

Natural image patches are fundamental elements for visual pattern modeling and recognition. By studying the intrinsic manifold structures in the space of image patches, this paper proposes an approach for representing and recognizing objects with a massive number of local image patches (e.g. 17×17 pixels). Given a large collection (>104>104) of proto image patches extracted from objects, we map them into two types of manifolds with different metrics: explicit manifolds of low dimensions for structural primitives, and implicit manifolds of high dimensions for stochastic textures. We define these manifolds grown from patches as the “ε-ballsε-balls”, where εε corresponds to the perception residual or fluctuation. Using these ε-ballsε-balls as features, we present a novel generative learning algorithm by the information projection principle. This algorithm greedily stepwise pursues the object models by selecting sparse and independent ε-ballsε-balls (say 103103 for each category). During the detection and classification phase, only a small number (say 20) of features are activated by a fast KD-tree indexing technique. The proposed method owns two characters. (1) Automatically generating features (ε-ballsε-balls) from local image patches rather than designing marginal feature carefully and category-specifically. (2) Unlike the weak classifiers in the boosting models, these selected ε-ballε-ball features are used to explain object in a generative way and are mutually independent. The advantage and performance of our approach is evaluated on several challenging datasets with the task of localizing objects against appearance variance, occlusion and background clutter.


► We propose a general method to recognize objects with massive local image patches.
► We map image patches into the explicit manifold and the implicit manifold.
► We present a novel generative learning algorithm by information projection principle.
► We apply our method to object detection task and achieve very good performance.

ناشر
Database: Elsevier - ScienceDirect (ساینس دایرکت)
Journal: Pattern Recognition - Volume 45, Issue 1, January 2012, Pages 231–240
نویسندگان
, , , ,