Variational noise model composition through model perturbation for robust speech recognition with time-varying background noise

کد مقاله	کد نشریه	سال انتشار	مقاله انگلیسی	نسخه تمام متن
565971	875886	2011	14 صفحه PDF	دانلود رایگان

عنوان انگلیسی مقاله ISI

دانلود مقاله + سفارش ترجمه

دانلود مقاله ISI انگلیسی

رایگان برای ایرانیان

کلمات کلیدی

Feature compensation - جبران مشخصات Robust speech recognition - شناسایی قوی سخنرانی

موضوعات مرتبط

مهندسی و علوم پایه مهندسی کامپیوتر پردازش سیگنال

پیش نمایش صفحه اول مقاله

Variational noise model composition through model perturbation for robust speech recognition with time-varying background noise

چکیده انگلیسی

This study proposes a novel model composition method to improve speech recognition performance in time-varying background noise conditions. It is suggested that each element of the cepstral coefficients represents the frequency degree of the changing components in the envelope of the log-spectrum. With this motivation, in the proposed method, variational noise models are formulated by selectively applying perturbation factors to the mean parameters of a basis model, resulting in a collection of noise models that more accurately reflect the natural range of spectral patterns seen in the log-spectral domain. The basis noise model is obtained from the silence segments of the input speech. The perturbation factors are designed separately for changes in the energy level and spectral envelope. The proposed variational model composition (VMC) method is employed to generate multiple environmental models for our previously proposed parallel combined gaussian mixture model (PCGMM) based feature compensation algorithm. The mixture sharing technique is integrated to reduce computational expenses, caused by employing the variational models. Experimental results prove that the proposed method is considerably more effective at increasing speech recognition performance in time-varying background noise conditions, with +31.31%, +10.65%, and +20.54% average relative improvements in word error rate for speech babble, background music, and real-life in-vehicle noise conditions respectively, compared to the original basic PCGMM method.

Research highlights
► Variational model composition (VMC) method is proposed to address unseen noise.
► Variational models are generated by applying perturbation factors to a basis model.
► VMC method is employed to obtain multiple noise models for feature compensation.
► The proposed method is evaluated in various types of background noise conditions.
► The proposed method shows significant improvement of speech recognition performance.

ناشر

Database: Elsevier - ScienceDirect (ساینس دایرکت)
Journal: Speech Communication - Volume 53, Issue 4, April 2011, Pages 451–464

نویسندگان

Wooil Kim, John H.L. Hansen,

علوم انسانی و هنر

فنی، مهندسی و علوم پایه

پزشکی و سلامت

بیو تکنولوژی

پذیرش سفارش ترجمه

Variational noise model composition through model perturbation for robust speech recognition with time-varying background noise

دسترسی سریع

ارتباط

English Website