کد مقاله کد نشریه سال انتشار مقاله انگلیسی نسخه تمام متن
412188 679619 2014 10 صفحه PDF دانلود رایگان
عنوان انگلیسی مقاله ISI
Clustering and retrieval of video shots based on natural stimulus fMRI
موضوعات مرتبط
مهندسی و علوم پایه مهندسی کامپیوتر هوش مصنوعی
پیش نمایش صفحه اول مقاله
Clustering and retrieval of video shots based on natural stimulus fMRI
چکیده انگلیسی

Functional magnetic resonance imaging (fMRI) is a powerful tool to probe the human brain׳s perception and cognition. Besides being extensively exploited in the clinical applications, fMRI technique is also useful to human׳s ordinary life. In this paper, we investigate a novel application of leveraging fMRI techniques to video clustering and retrieval. In the proposed work, we successfully integrate semantic human-centric features derived from natural stimulus fMRI data and low-level visual-audio features to facilitate video clustering and retrieval, which is a significant innovation compared to the previous works relying on either fMRI-derived features or low-level visual-audio features. Our system consists of several algorithmic modules. First, fMRI data when the subjects are watching video shot samples are acquired. Then a newly developed brain networks localization system is employed to locate the cortical regions of interests (ROIs) for each individual subject. The functional interactions computed by wavelet transform coherence are quantified, from which the human-centric features are derived. Afterwards, the Gaussian process regression model mapping visual-audio feature space to an fMRI-derived feature space is trained, given the training samples. The trained model is then adopted to predict fMRI-derived features for videos without the fMRI data. Finally, the multi-modal spectral clustering and multi-modal ranking algorithm are adopted and proposed to integrate these two heterogeneous features for video clustering and retrieval, respectively. Our experiment on TRECVID database has demonstrated the precision of video clustering and retrieval can be substantially improved by integration of visual-audio features and fMRI-derived features.

ناشر
Database: Elsevier - ScienceDirect (ساینس دایرکت)
Journal: Neurocomputing - Volume 144, 20 November 2014, Pages 128–137
نویسندگان
, , , , ,