کد مقاله کد نشریه سال انتشار مقاله انگلیسی نسخه تمام متن
535547 870353 2013 10 صفحه PDF دانلود رایگان
عنوان انگلیسی مقاله ISI
Exploring STIP-based models for recognizing human interactions in TV videos
موضوعات مرتبط
مهندسی و علوم پایه مهندسی کامپیوتر چشم انداز کامپیوتر و تشخیص الگو
پیش نمایش صفحه اول مقاله
Exploring STIP-based models for recognizing human interactions in TV videos
چکیده انگلیسی

Human motion recognition – action (HAR) or interaction (HIR) – in real video data is identified as a very challenging task. In the last few years models of increasing complexity have been proposed in order to improve the performance in the task. However, it still remains unclear whether it is the features or the models what deserves the increase in complexity. In this paper an evaluation of such problem is carried out in the HIR task. For that purpose, we compare the results obtained in our experiments – by using STIP-based features and BOW models as basis and combined with a standard classifier – with some of the more effective and recent approaches that use alternative representation models. We perform a comprehensive experimental study on two state-of-the-art databases in HIR: TV Human interactions and UT-interactions. We compare the results of our experiments with recent results published on these datasets. In addition, we run cross-data experiments on Hollywood-2 dataset in order to study the capability of generalization of the trained models through different datasets. The most relevant result is that the model combining STIP + BOW is competitive in the HIR task in comparison with the most complex ones. It is also shown that the vocabulary learning subtask can be improved by using compression algorithms on large enough initial set of features. In contrast to other categorization tasks the context does not help, the results show that dense sampling of STIP is the best choice, but only when it is used inside the region of interest of the interaction.


► Comprehensive study of STIP-based models for human interaction recognition in video.
► New results and conclusions on state-of-the-art datasets for human interactions.
► Additional cross-data results of human interactions from Hollywood-2 dataset.
► This new work has not been previously submitted to any conference or journal.

ناشر
Database: Elsevier - ScienceDirect (ساینس دایرکت)
Journal: Pattern Recognition Letters - Volume 34, Issue 15, 1 November 2013, Pages 1819–1828
نویسندگان
, , ,