کد مقاله کد نشریه سال انتشار مقاله انگلیسی نسخه تمام متن
563107 875471 2013 16 صفحه PDF دانلود رایگان
عنوان انگلیسی مقاله ISI
Speaker state recognition using an HMM-based feature extraction method
موضوعات مرتبط
مهندسی و علوم پایه مهندسی کامپیوتر پردازش سیگنال
پیش نمایش صفحه اول مقاله
Speaker state recognition using an HMM-based feature extraction method
چکیده انگلیسی

In this article we present an efficient approach to modeling the acoustic features for the tasks of recognizing various paralinguistic phenomena. Instead of the standard scheme of adapting the Universal Background Model (UBM), represented by the Gaussian Mixture Model (GMM), normally used to model the frame-level acoustic features, we propose to represent the UBM by building a monophone-based Hidden Markov Model (HMM). We present two approaches: transforming the monophone-based segmented HMM–UBM to a GMM–UBM and proceeding with the standard adaptation scheme, or to perform the adaptation directly on the HMM–UBM. Both approaches give superior results than the standard adaptation scheme (GMM–UBM) in both the emotion recognition task and the alcohol detection task. Furthermore, with the proposed method we were able to achieve better results than the current state-of-the-art systems in both tasks.


► Efficient approach to modeling the acoustic features for speaker state recognition.
► The HMM–UBM–MAP adaptation scheme is presented and evaluated.
► Emotion recognition and alcohol detection tasks are used for assessment.
► Superior results are achieved in both tasks compared to other systems.

ناشر
Database: Elsevier - ScienceDirect (ساینس دایرکت)
Journal: Computer Speech & Language - Volume 27, Issue 1, January 2013, Pages 135–150
نویسندگان
, , ,