کد مقاله کد نشریه سال انتشار مقاله انگلیسی نسخه تمام متن
562491 1451955 2015 15 صفحه PDF دانلود رایگان
عنوان انگلیسی مقاله ISI
Multi-view discriminative and structured dictionary learning with group sparsity for human action recognition
ترجمه فارسی عنوان
یادگیری فرهنگی با استفاده از چندین دیدگاه و با ساختار گروهی برای شناخت عمل انسان
موضوعات مرتبط
مهندسی و علوم پایه مهندسی کامپیوتر پردازش سیگنال
چکیده انگلیسی


• Multi-view bag of words (MVBoW) representation is employed.
• The graph model is also utilized to fuse different views.
• We formulate the multi-view action recognition task as a joint dictionary learning (DL) problem.
• We release a new multi-view action dataset which contains RGB, depth and skeleton data.

Human action may be observed from multi-view, which are highly related but sometimes look different from each other. Traditional metric learning algorithms have achieved satisfactory performance in single-view, but they often fail or do not satisfy when they are utilized to fuse different views. Thus, multi-view discriminative and structured dictionary learning with group sparsity and graph model (GM-GS-DSDL) is proposed to fuse different views and recognize human actions. First, spatio-temporal interest points are extracted for each view, and then multi-view bag of words (MVBoW) representation is employed, at the same time, the graph model is also utilized to fuse different views, which will remove overlapped interest points to explore their consistency properties. Furthermore, GM-GS-DSDL is formulated to discover the latent correlation among multiple views. In addition, we also issue a new multi-view action dataset with RGB, depth and skeleton data (called CVS-MV-RGBD). Large-scale experimental results on multi-view IXMAX and CVS-MV-RGBD datasets show that the exploring of consistency properties of different views by graph model is very useful, moreover, GM-GS-DSDL for each view, which are learnt simultaneously, can further improve the fusion performance. Comparative experiments demonstrate that our proposed algorithm can obtain competing performance against the state-of-the-art methods.

ناشر
Database: Elsevier - ScienceDirect (ساینس دایرکت)
Journal: Signal Processing - Volume 112, July 2015, Pages 83–97
نویسندگان
, , , , ,