Article ID | Journal | Published Year | Pages | File Type |
---|---|---|---|---|
4947820 | Neurocomputing | 2017 | 16 Pages |
Abstract
In this paper, a speech enhancement method based on regularized non-negative matrix factorization (NMF) for non-stationary Gaussian noise is proposed. An iterative posterior NMF-based model of the magnitudes of the spectral components of speech and noise is implemented using prior distributions for the magnitudes in the transformed domain. Because their sample distributions fit gamma and Rayleigh densities well, we propose to adaptively estimate the statistics of these distributions that are sufficient to provide a natural regularization of the NMF criterion. The resulting method is shown to outperform other benchmark algorithms in terms of the signal-to-distortion ratio (SDR) and a perceptual evaluation of the speech quality (PESQ).
Related Topics
Physical Sciences and Engineering
Computer Science
Artificial Intelligence
Authors
Sunnydayal Sunnydayal, Kishore Kumar, Sergio Cruces,