کد مقاله کد نشریه سال انتشار مقاله انگلیسی نسخه تمام متن
533879 870180 2014 10 صفحه PDF دانلود رایگان
عنوان انگلیسی مقاله ISI
A fully connected model for consistent collective activity recognition in videos
ترجمه فارسی عنوان
یک مدل به طور کامل متصل برای به رسمیت شناختن فعالیت های مشترک در فیلم ها
کلمات کلیدی
موضوعات مرتبط
مهندسی و علوم پایه مهندسی کامپیوتر چشم انداز کامپیوتر و تشخیص الگو
چکیده انگلیسی


• A fully connected model for consistent collective activity recognition is proposed.
• Various features are leveraged over a multi-scale in a single unified model.
• Efficiency is achieved by describing the pairwise potentials as Gaussian kernels.
• Our model can deal with multiple activities in a scene and activity transition.
• Experimental results show that our method outperforms state-of-the art methods.

We propose a novel method for consistent collective activity recognition in video images. Collective activities are activities performed by multiple persons, such as queuing in a line, talking together, and waiting at an intersection. Since it is often difficult to differentiate between these activities using the appearance of only an individual person, the models proposed in recent studies exploit the contextual information of other people nearby. However, these models do not sufficiently consider the spatial and temporal consistency in a group (e.g., they consider the consistency in only the adjacent area), and therefore, they cannot effectively deal with temporary misclassification or simultaneously consider multiple collective activities in a scene. To overcome this drawback, this paper describes a method to integrate the individual recognition results via fully connected conditional random fields (CRFs), which consider all the interactions among the people in a video clip and alter the interaction strength in accordance with the degree of their similarity. Unlike previous methods that restrict the interactions among the people heuristically (e.g., within a constant area), our method describes the “multi-scale” interactions in various features, i.e., position, size, motion, and time sequence, in order to allow various types, sizes, and shapes of groups to be treated. Experimental results on two challenging video datasets indicate that our model outperforms not only other graph topologies but also state-of-the art models.

ناشر
Database: Elsevier - ScienceDirect (ساینس دایرکت)
Journal: Pattern Recognition Letters - Volume 43, 1 July 2014, Pages 109–118
نویسندگان
, , , , ,