Article ID Journal Published Year Pages File Type
535250 Pattern Recognition Letters 2007 7 Pages PDF
Abstract

The Mel-frequency cepstral coefficients (MFCC) are most widely used features for speech recognition. But, their performance degrades in presence of additive noise. In this paper, we propose a noise compensation method for Mel sub-bands energies as well as MFCC features. This method includes two steps: Mel sub-band spectral subtraction and compression of Mel sub-band energies. In the compression step, we propose a sub-band SNR-dependent compression function. This function replaces logarithm function in conventional MFCC feature extraction. Experimental results show that the proposed method significantly improves performance of MFCC features in noisy conditions. It decreases word error rate about 70% in SNR value of 0 dB for different types of additive noise.

Related Topics
Physical Sciences and Engineering Computer Science Computer Vision and Pattern Recognition
Authors
, ,