کد مقاله کد نشریه سال انتشار مقاله انگلیسی نسخه تمام متن
534478 870257 2015 8 صفحه PDF دانلود رایگان
عنوان انگلیسی مقاله ISI
Action recognition in still images by learning spatial interest regions from videos
ترجمه فارسی عنوان
شناخت عمل در تصاویر ساکن با یادگیری مناطق مکانی از ویدیوها
کلمات کلیدی
موضوعات مرتبط
مهندسی و علوم پایه مهندسی کامپیوتر چشم انداز کامپیوتر و تشخیص الگو
چکیده انگلیسی


• This paper addresses the problem of human action recognition in still images.
• This paper proposes a novel approach to learn interest regions from videos.
• This paper builds a Bayesian framework using learned interest regions and image local features for classification.
• This paper achieves high recognition rates compared to conventional image classification techniques.

A common approach to human action recognition from still images consists in computing local descriptors for classification. Typically, these descriptors are computed in the vicinity of key points which either result from running a key point detector or from dense sampling of pixel coordinates. Such key points are not a priorly related to human activities and thus might not be very informative with regard to action recognition. Several recent approaches, on the other hand, are based on learning person–object interactions and saliency maps in images. In this article, we investigate the possibility and applicability of identifying action-specific points or regions of interest in still images based on information extracted from video data. In particular, we propose a novel method for extracting spatial interest regions where we apply non-negative matrix factorization to optical flow fields extracted from videos. The resulting basis flows are found to indicate image regions that are specific to certain actions and therefore allow for an informed sampling of key points for feature extraction. We thus present a generative model for action recognition in still images that allows for characterizing joint distributions of regions of interest, local image features (visual words), and human actions. Experimental evaluation shows that (a) our approach is able to extract interest regions that are highly correlated to those body parts most relevant for different actions and (b) our generative model achieves high accuracy in action classification.

ناشر
Database: Elsevier - ScienceDirect (ساینس دایرکت)
Journal: Pattern Recognition Letters - Volume 51, 1 January 2015, Pages 8–15
نویسندگان
, , ,