دانلود رایگان مقاله: تجزیه و تحلیل توابع حالت ذاتی برای اطلاعات بلندگو

کد مقاله	کد نشریه	سال انتشار	مقاله انگلیسی	نسخه تمام متن
4977807	1452009	2017	16 صفحه PDF	دانلود رایگان

عنوان انگلیسی مقاله ISI

Analysis of the Intrinsic Mode Functions for Speaker Information

ترجمه فارسی عنوان

تجزیه و تحلیل توابع حالت ذاتی برای اطلاعات بلندگو

دانلود مقاله + سفارش ترجمه

دانلود مقاله ISI انگلیسی

رایگان برای ایرانیان

کلمات کلیدی

EMD GMM MFCCs MEMD IMFs - صندوق بین المللی پول i-vector - من بردار

موضوعات مرتبط

مهندسی و علوم پایه مهندسی کامپیوتر پردازش سیگنال

پیش نمایش مقاله

تجزیه و تحلیل توابع حالت ذاتی برای اطلاعات بلندگو

چکیده انگلیسی

This work explores the utility of the time-domain signal components, or the Intrinsic Mode Functions (IMFs), of speech signals', as generated from the data-adaptive filterbank nature of Empirical Mode Decomposition (EMD), in characterizing speakers for the task of text-independent Speaker Verification (SV). A modified version of EMD, denoted as MEMD, which extracts IMFs with lesser mode-mixing, and provides a better representation of the higher frequency spectrum of speech, is also utilized for the SV task. Three different features are extracted over 20 ms frames, from the IMFs of EMD and MEMD. They are, then, tested individually, and in conjunction with the Mel Frequency Cepstral Coefficients (MFCCs), for SV. Two corpora - the NIST SRE 2003 corpus, and the CHAINS corpus - are used for the experiments. The results evaluated on the NIST SRE 2003 database, using the i-vector framework, reveal that the features extracted from the IMFs, in conjunction with the MFCCs, enhances the performance of the SV system. Further, it is observed that only a small set of lower-order IMFs is useful and necessary for characterizing speaker-specific information. The combination of the features with the MFCCs is also found to be useful when short speech utterances of â¤10 s are used for testing. Similarly, the results evaluated on the CHAINS corpus, using the conventional Gaussian Mixture Model (GMM) framework, reveal that the features, in combination with the MFCCs, enhance the performance of the SV system, not only for normal speech, but also for fast and whispered speech. Again, it is observed that only the first few IMFs are needed and useful for achieving such enhanced performance.

ناشر

Database: Elsevier - ScienceDirect (ساینس دایرکت)
Journal: Speech Communication - Volume 91, July 2017, Pages 1-16

نویسندگان

Rajib Sharma, S.R.M. Prasanna, Ramesh K. Bhukya, Rohan Kumar Das,

علوم انسانی و هنر

فنی، مهندسی و علوم پایه

پزشکی و سلامت

بیو تکنولوژی

پذیرش سفارش ترجمه

دانلود رایگان مقاله ISI : تجزیه و تحلیل توابع حالت ذاتی برای اطلاعات بلندگو

دسترسی سریع

ارتباط

English Website