کد مقاله کد نشریه سال انتشار مقاله انگلیسی نسخه تمام متن
527097 869286 2011 11 صفحه PDF دانلود رایگان
عنوان انگلیسی مقاله ISI
Toward coherent object detection and scene layout understanding
کلمات کلیدی
موضوعات مرتبط
مهندسی و علوم پایه مهندسی کامپیوتر چشم انداز کامپیوتر و تشخیص الگو
پیش نمایش صفحه اول مقاله
Toward coherent object detection and scene layout understanding
چکیده انگلیسی

Detecting objects in complex scenes while recovering the scene layout is a critical functionality in many vision-based applications. In this work, we advocate the importance of geometric contextual reasoning for object recognition. We start from the intuition that objects' location and pose in the 3D space are not arbitrarily distributed but rather constrained by the fact that objects must lie on one or multiple supporting surfaces. We model such supporting surfaces by means of hidden parameters (i.e. not explicitly observed) and formulate the problem of joint scene reconstruction and object recognition as the one of finding the set of parameters that maximizes the joint probability of having a number of detected objects on K supporting planes given the observations. As a key ingredient for solving this optimization problem, we have demonstrated a novel relationship between object location and pose in the image, and the scene layout parameters (i.e. normal of one or more supporting planes in 3D and camera pose, location and focal length). Using a novel probabilistic formulation and the above relationship our method has the unique ability to jointly: i) reduce false alarm and false negative object detection rate; ii) recover object location and supporting planes within the 3D camera reference system; iii) infer camera parameters (view point and the focal length) from just one single uncalibrated image. Quantitative and qualitative experimental evaluation on two datasets (desk-top dataset [1] and LabelMe [2]) demonstrates our theoretical claims.

Figure optionsDownload high-quality image (279 K)Download as PowerPoint slideHighlights
► We advocate the importance of geometric contextual reasoning for object recognition.
► Our method reduces false alarm and false negative object detection.
► Our method recovers object location and supporting planes within the 3D camera reference system.
► Our method infers camera parameters from one single uncalibrated image.

ناشر
Database: Elsevier - ScienceDirect (ساینس دایرکت)
Journal: Image and Vision Computing - Volume 29, Issue 9, August 2011, Pages 569–579
نویسندگان
, , ,