کد مقاله کد نشریه سال انتشار مقاله انگلیسی نسخه تمام متن
380198 1437426 2016 9 صفحه PDF دانلود رایگان
عنوان انگلیسی مقاله ISI
Multistage data selection-based unsupervised speaker adaptation for personalized speech emotion recognition
ترجمه فارسی عنوان
انطباق سخنران های ناخواسته با انتخاب چندگانه برای شناخت احساسات گفتاری شخصی
کلمات کلیدی
شناخت احساسات گفتاری، سازگاری بلندگو، حداکثر رگرسیون خطی احتمال، مدل پس زمینه جهانی مدل آکوستیک
موضوعات مرتبط
مهندسی و علوم پایه مهندسی کامپیوتر هوش مصنوعی
چکیده انگلیسی

This paper proposes an efficient speech emotion recognition (SER) approach that utilizes personal voice data accumulated on personal devices. A representative weakness of conventional SER systems is the user-dependent performance induced by the speaker independent (SI) acoustic model framework. But, handheld communications devices such as smartphones provide a collection of individual voice data, thus providing suitable conditions for personalized SER that is more enhanced than the SI model framework. By taking advantage of personal devices, we propose an efficient personalized SER scheme employing maximum likelihood linear regression (MLLR), a representative speaker adaptation technique. To further advance the conventional MLLR technique for SER tasks, the proposed approach selects useful data that convey emotionally discriminative acoustic characteristics and uses only those data for adaptation. For reliable data selection, we conduct multistage selection using a log-likelihood distance-based measure and a universal background model. On SER experiments based on a Linguistic Data Consortium emotional speech corpus, our approach exhibited superior performance when compared to conventional adaptation techniques as well as the SI model framework.

Figure optionsDownload as PowerPoint slide

ناشر
Database: Elsevier - ScienceDirect (ساینس دایرکت)
Journal: Engineering Applications of Artificial Intelligence - Volume 52, June 2016, Pages 126–134
نویسندگان
, ,