کد مقاله کد نشریه سال انتشار مقاله انگلیسی نسخه تمام متن
526742 869216 2016 11 صفحه PDF دانلود رایگان
عنوان انگلیسی مقاله ISI
Action recognition using saliency learned from recorded human gaze *
موضوعات مرتبط
مهندسی و علوم پایه مهندسی کامپیوتر چشم انداز کامپیوتر و تشخیص الگو
پیش نمایش صفحه اول مقاله
Action recognition using saliency learned from recorded human gaze *
چکیده انگلیسی


• 3D CNN action features learned using gaze information outperform handcrafted features.
• Using latent variables that localize the action in an SVM framework is beneficial.
• Using saliency learned from gaze in a latent SVM framework is beneficial.

This paper addresses the problem of recognition and localization of actions in image sequences, by utilizing, in the training phase only, gaze tracking data of people watching videos depicting the actions in question. First, we learn discriminative action features at the areas of gaze fixation and train a Convolutional Network that predicts areas of fixation (i.e. salient regions) from raw image data. Second, we propose a Support Vector Machine-based recognition method for joint recognition and localization, in which the bounding box of the action in question is considered as a latent variable. In our formulation the optimization attempts to both minimize the classification cost and maximize the saliency within the bounding box. We show that the results obtained with the optimization where saliency within the bounding box is maximized outperform the results obtained when saliency within the bounding box is not maximized, i.e. when only classification cost is minimized. Furthermore, the results that we obtain outperform the state-of-the-art results on the UCF sports dataset.

Graphical AbstractFigure optionsDownload high-quality image (244 K)Download as PowerPoint slide

ناشر
Database: Elsevier - ScienceDirect (ساینس دایرکت)
Journal: Image and Vision Computing - Volume 52, August 2016, Pages 195–205
نویسندگان
, ,