کد مقاله کد نشریه سال انتشار مقاله انگلیسی نسخه تمام متن
393216 665578 2015 12 صفحه PDF دانلود رایگان
عنوان انگلیسی مقاله ISI
Analysis of music/speech via integration of audio content and functional brain response
ترجمه فارسی عنوان
تجزیه و تحلیل موسیقی / سخنرانی از طریق ادغام محتوای صوتی و عملکرد مغز پاسخ
موضوعات مرتبط
مهندسی و علوم پایه مهندسی کامپیوتر هوش مصنوعی
چکیده انگلیسی


• A novel music/speech analysis framework via integration of audio content and functional brain response.
• Brain response features are derived from fMRI data when participants are listening to the music/speech.
• A novel regression algorithm named ITGP is proposed to improve the quality in predicting functional brain response.

Effective analysis of music/speech data such as clustering, retrieval, and classification has received significant attention in recent years. Traditional methods mainly rely on the low-level acoustic features derived from digital audio stream, and the accuracy of these methods is limited by the well-known semantic gap. To alleviate this problem, we propose a novel framework for music/speech clustering, retrieval, and classification by integrating the low-level acoustic features derived from audio content with the functional magnetic resonance imaging (fMRI) measured features that represent the brain’s functional response when subjects are listening to the music/speech excerpts. First, the brain networks and regions of interest (ROIs) involved in the comprehension of audio stimuli, such as the auditory, emotion, attention, and working memory systems, are located by a new approach named dense individualized and common connectivity-based cortical landmarks (DICCCOLs). Then the functional connectivity matrix measuring the similarity between the fMRI signals of different ROIs is adopted to represent the brain’s comprehension of audio semantics. Afterwards, we propose an improved twin Gaussian process (ITGP) model based on self-training to predict the fMRI-measured features of testing data without fMRI scanning. Finally, multi-view learning algorithms are proposed to integrate acoustic features with fMRI-measured features for music/speech clustering, retrieval, and classification, respectively. The experimental results demonstrate the superiority of our proposed work in comparison with existing methods and suggest the advantage of integrating functional brain responses via fMRI data for music/speech analysis.

ناشر
Database: Elsevier - ScienceDirect (ساینس دایرکت)
Journal: Information Sciences - Volume 297, 10 March 2015, Pages 271–282
نویسندگان
, , , , , , , ,