کد مقاله کد نشریه سال انتشار مقاله انگلیسی نسخه تمام متن
4948296 1439614 2016 6 صفحه PDF دانلود رایگان
عنوان انگلیسی مقاله ISI
Robust spatial-temporal deep model for multimedia event detection
ترجمه فارسی عنوان
مدل عمیق فضایی و زمانی برای تشخیص رویداد چندرسانه ای
موضوعات مرتبط
مهندسی و علوم پایه مهندسی کامپیوتر هوش مصنوعی
چکیده انگلیسی
The task of multimedia event detection (MED) aims at training a set of models that can automatically detect the most event-relevant videos from large datasets. In this paper, we attempt to build a robust spatial-temporal deep neural network for large-scale video event detection. In our setting, each video follows a multiple instance assumption, where its visual segments contain both spatial and temporal properties of events. Regarding these properties, we try to implement the MED system by a two-step training phase: unsupervised recurrent video reconstruction and supervised fine-tuning. We conduct extensive experiments on the challenging TRECVID MED14 dataset, which indicate that with the consideration of both spatial and temporal information, the detection performance can be further boosted compared with the state-of-the-art MED models.
ناشر
Database: Elsevier - ScienceDirect (ساینس دایرکت)
Journal: Neurocomputing - Volume 213, 12 November 2016, Pages 48-53
نویسندگان
, , ,