Building semantic scene models from unconstrained video

کد مقاله	کد نشریه	سال انتشار	مقاله انگلیسی	نسخه تمام متن
527865	869391	2012	11 صفحه PDF	دانلود رایگان

عنوان انگلیسی مقاله ISI

دانلود مقاله + سفارش ترجمه

دانلود مقاله ISI انگلیسی

رایگان برای ایرانیان

کلمات کلیدی

Scene understanding - درک صحنه Human behaviour - رفتار انسان Machine learning - یادگیری ماشین

موضوعات مرتبط

مهندسی و علوم پایه مهندسی کامپیوتر چشم انداز کامپیوتر و تشخیص الگو

پیش نمایش صفحه اول مقاله

Building semantic scene models from unconstrained video

چکیده انگلیسی

This paper describes a method for building semantic scene models from video data using observed motion. We do this through unsupervised clustering of simple yet novel motion descriptors, which provide a quantized representation of gross motion within scene regions. Using these we can characterise the dominant patterns of motion, and then group spatial regions based upon both proximity and local motion similarity to define areas or regions with particular motion characteristics. We are able to process scenes in which objects are difficult to detect and track due to variable frame-rate, video quality or occlusion, and we are able to identify regions which differ by usage but which do not differ by appearance (such as frequently used paths across open space). We demonstrate our method on 50 videos from very different scene types: indoor scenarios with unpredictable unconstrained motion, junction scenes, road and path scenes, and open squares or plazas. We show that these scenes can be clustered using our representation, and that the incorporation of learned spatial relations into the representation enables us to cluster more effectively. This method enables us to make meaningful statements about video scenes as a whole (such as “this video is like that video”) and about regions within these scenes (such as “this part of this scene is similar to that part of that scene”).

► We use simple tracked features to build a model of unconstrained scenes.
► This works with poor quality and low-frame-rate video.
► From this we learn spatial relationships.
► Scenes can then be clustered, and similarities between scenes identified.

ناشر

Database: Elsevier - ScienceDirect (ساینس دایرکت)
Journal: Computer Vision and Image Understanding - Volume 116, Issue 3, March 2012, Pages 446–456

نویسندگان

Hannah M. Dee, Anthony G. Cohn, David C. Hogg,

علوم انسانی و هنر

فنی، مهندسی و علوم پایه

پزشکی و سلامت

بیو تکنولوژی

پذیرش سفارش ترجمه

Building semantic scene models from unconstrained video

دسترسی سریع

ارتباط

English Website