Advanced parallel combined Gaussian mixture model based feature compensation integrated with iterative channel estimation

کد مقاله	کد نشریه	سال انتشار	مقاله انگلیسی	نسخه تمام متن
565889	1452027	2015	13 صفحه PDF	دانلود رایگان

عنوان انگلیسی مقاله ISI

دانلود مقاله + سفارش ترجمه

دانلود مقاله ISI انگلیسی

رایگان برای ایرانیان

کلمات کلیدی

channel estimation - برآورد کانال Model combination - ترکیب مدل Feature compensation - جبران مشخصات Robust speech recognition - شناسایی قوی سخنرانی

موضوعات مرتبط

مهندسی و علوم پایه مهندسی کامپیوتر پردازش سیگنال

پیش نمایش صفحه اول مقاله

Advanced parallel combined Gaussian mixture model based feature compensation integrated with iterative channel estimation

چکیده انگلیسی

• Effective feature compensation for speech recognition in noise and channel distortion.
• Employs Parallel Combined Gaussian Mixture Model (PCGMM).
• Evaluation uses objective measures including STNR, PESQ, and speech recognition.
• Show +9.77% and +15.77% relative avg. WER improvement vs. ETSI AFE standard.

This study proposes an effective feature compensation scheme to address severely adverse environments for speech recognition where background noise and channel distortion are simultaneously involved. In the proposed scheme, an iterative channel estimation method is integrated into the framework of our previously proposed Parallel Combined Gaussian Mixture Model (PCGMM) based feature compensation algorithm. A new speech corpus is developed which reflects both additive and convolutional noise corruption. The channel distortion effects are obtained from the NTIMIT and CTIMIT corpora. Evaluation based on objective measures including STNR, PESQ, and speech recognition shows that generated speech corpus includes highly challenging acoustic conditions for speech recognition. The proposed feature compensation method is evaluated over the developed speech corpus. The experimental results demonstrate that the proposed feature compensation scheme is effective at improving speech recognition performance in the presence of both background noise and channel distortion, employing the iterative channel estimation method. The proposed PCGMM-based feature compensation scheme employing the channel estimation method shows +3.58% and +11.61% relative improvements in averaged WER compared to the ETSI AFE algorithm for the developed speech corpora including NTIMIT and CTIMIT channel effects respectively. For real-life application, a voice activity detection technique is employed to estimate the noise model for PCGMM-based method without a priori knowledge of the non-speech locations of input speech. The proposed method is also evaluated on the CU-Move corpus which represents actual in-vehicle conditions, showing a +12.99% relative improvement compared to the ETSI AFE. This study confirms that the proposed PCGMM-based feature compensation method integrated with channel estimation is effective at increasing speech recognition accuracy in real-life severely adverse conditions.

ناشر

Database: Elsevier - ScienceDirect (ساینس دایرکت)
Journal: Speech Communication - Volume 73, October 2015, Pages 81–93

نویسندگان

Wooil Kim, John H.L. Hansen,

علوم انسانی و هنر

فنی، مهندسی و علوم پایه

پزشکی و سلامت

بیو تکنولوژی

پذیرش سفارش ترجمه

Advanced parallel combined Gaussian mixture model based feature compensation integrated with iterative channel estimation

دسترسی سریع

ارتباط

English Website