Speech enhancement using a wavelet thresholding method based on symmetric Kullback-Leibler divergence

Article ID	Journal	Published Year	Pages	File Type
6959876	Signal Processing	2015	14 Pages	PDF

Abstract

Performance of wavelet thresholding methods for speech enhancement strongly depends on estimating an exact threshold value in the wavelet sub-bands. In this paper, we propose a new method for more exact estimation of the threshold value. Our proposed threshold value is firstly obtained based on the symmetric Kullback-Leibler divergence between the probability distributions of noisy speech and noise wavelet coefficients. In the next step, we improved this value using the segmental Signal-to-Noise Ratio (SNR). We used some TIMIT utterances to assess the performance of the proposed threshold. The algorithm is evaluated using the Perceptual Evaluation of Speech Quality (PESQ) score and the SNR improvement in ideal and real modes. In ideal and real modes, on average, we obtain respectively 2.25Â dB and 1Â dB SNR improvement and a PESQ score increase up to 1.1, 0.75 compared with the conventional wavelet thresholding approaches. In comparison to the adaptive thresholding approach, on average in ideal and real modes, we obtain respectively 1.6Â dB and 0.9Â dB SNR improvement. The PESQ value of the adaptive thresholding method, in the real and ideal modes, is 0.25 higher and 0.5 lower than that of our proposed method, respectively.

Keywords

Wavelet thresholding Kullback–Leibler divergence speech enhancement Probability distribution