An improved model of masking effects for robust speech recognition system

کد مقاله	کد نشریه	سال انتشار	مقاله انگلیسی	نسخه تمام متن
565934	875866	2013	10 صفحه PDF	دانلود رایگان

عنوان انگلیسی مقاله ISI

دانلود مقاله + سفارش ترجمه

دانلود مقاله ISI انگلیسی

رایگان برای ایرانیان

کلمات کلیدی

AURORA2 Automatic speech recognition - تشخیص گفتار خودکار Temporal masking - ماسک موقتی Simultaneous masking - ماسک همزمان Auditory modeling - مدل سازی شنوایی

موضوعات مرتبط

مهندسی و علوم پایه مهندسی کامپیوتر پردازش سیگنال

پیش نمایش صفحه اول مقاله

An improved model of masking effects for robust speech recognition system

چکیده انگلیسی

Performance of an automatic speech recognition system drops dramatically in the presence of background noise unlike the human auditory system which is more adept at noisy speech recognition. This paper proposes a novel auditory modeling algorithm which is integrated into the feature extraction front-end for Hidden Markov Model (HMM). The proposed algorithm is named LTFC which simulates properties of the human auditory system and applies it to the speech recognition system to enhance its robustness. It integrates simultaneous masking, temporal masking and cepstral mean and variance normalization into ordinary mel-frequency cepstral coefficients (MFCC) feature extraction algorithm for robust speech recognition. The proposed method sharpens the power spectrum of the signal in both the frequency domain and the time domain. Evaluation tests are carried out on the AURORA2 database. Experimental results show that the word recognition rate using our proposed feature extraction method has been effectively increased.

► Modeling of human auditory system.
► Direct implementation of masking effects.
► Mathematical derivation is provided to show the correctness of LTFC.
► Extensive comparison is made to show the superiority of the proposed algorithm.

ناشر

Database: Elsevier - ScienceDirect (ساینس دایرکت)
Journal: Speech Communication - Volume 55, Issue 3, March 2013, Pages 387–396

نویسندگان

Peng Dai, Ing Yann Soon,

علوم انسانی و هنر

فنی، مهندسی و علوم پایه

پزشکی و سلامت

بیو تکنولوژی

پذیرش سفارش ترجمه

An improved model of masking effects for robust speech recognition system

دسترسی سریع

ارتباط

English Website