Article ID | Journal | Published Year | Pages | File Type |
---|---|---|---|---|
6960439 | Speech Communication | 2018 | 14 Pages |
Abstract
We propose a novel estimator for estimating the amplitude of speech coefficients in the time-frequency domain. In order to avoid a phase spectrum estimator of complex coefficients when using the Fourier transform, we consider the discrete cosine transform (DCT). This estimator aims at minimizing the mean square error of the absolute values of the speech DCT coefficients. In order to take advantage of both parametric and non-parametric approaches, the proposed method combines block shrinkage and Bayesian statistical estimation. First, the absolute value of the clean coefficient is estimated by block smoothed sigmoid-based shrinkage (Block-SSBS). The block size required by the Block-SSBS is obtained by statistical optimization. This step enables us to reduce the negative impact on speech intelligibility of classical denoising methods similarly to smoothed binary masking. Second, for refining the estimation, an optimal statistical estimator is added to handle musical noise. For evaluating the performance of the proposed method, objective criteria are used. The experiments enhance the relevance of the approach, in terms of speech quality and intelligibility.
Related Topics
Physical Sciences and Engineering
Computer Science
Signal Processing
Authors
Van-Khanh Mai, Dominique Pastor, Abdeldjalil Aïssa-El-Bey, Raphaël Le Bidan,