کد مقاله کد نشریه سال انتشار مقاله انگلیسی نسخه تمام متن
567557 876105 2011 13 صفحه PDF دانلود رایگان
عنوان انگلیسی مقاله ISI
A temporal warped 2D psychoacoustic modeling for robust speech recognition system
موضوعات مرتبط
مهندسی و علوم پایه مهندسی کامپیوتر پردازش سیگنال
پیش نمایش صفحه اول مقاله
A temporal warped 2D psychoacoustic modeling for robust speech recognition system
چکیده انگلیسی

Human auditory system performs better than speech recognition system under noisy condition, which leads us to the idea of incorporating the human auditory system into automatic speech recognition engines. In this paper, a hybrid feature extraction method, which utilizes forward masking, backward masking, and lateral inhibition, is incorporated into mel-frequency cepstral coefficients (MFCC). The integration is implemented using a warped 2D psychoacoustic filter. The AURORA2 database is utilized for testing, and the Hidden Markov Model (HMM) is used for recognition. Comparison is made against lateral inhibition (LI), forward masking (FM), cepstral mean and variance normalization (CMVN), the original 2D psychoacoustic filter and the RASTA filter. Experimental results show that the word recognition rate is significantly improved, especially under noisy conditions.

In this paper, a hybrid feature extraction method, which utilizes forward masking, backward masking, and lateral inhibition, is incorporated into mel-frequency cepstral coefficients (MFCC).Figure optionsDownload as PowerPoint slideResearch highlights
► Implementation of forward masking and lateral inhibition with a 2D filter.
► Mathematical derivation is provided to show the validity of the proposed 2D filter.
► Extensive comparison is made to show the superiority of the proposed algorithm.
► Experimental results show significant improvements.

ناشر
Database: Elsevier - ScienceDirect (ساینس دایرکت)
Journal: Speech Communication - Volume 53, Issue 2, February 2011, Pages 229–241
نویسندگان
, ,