کد مقاله کد نشریه سال انتشار مقاله انگلیسی نسخه تمام متن
4977813 1452013 2017 33 صفحه PDF دانلود رایگان
عنوان انگلیسی مقاله ISI
Regularized non-negative matrix factorization with Gaussian mixtures and masking model for speech enhancement
ترجمه فارسی عنوان
فاکتورسازی ماتریس غیر منفی منظم با مخلوط گاوسی و مدل پوشش برای تقویت گفتار
موضوعات مرتبط
مهندسی و علوم پایه مهندسی کامپیوتر پردازش سیگنال
چکیده انگلیسی
We introduce single-channel supervised speech enhancement algorithms based on regularized non-negative matrix factorization (NMF). In the proposed framework, the log-likelihood functions (LLF) of the magnitude spectra for both the clean speech and noise, based on Gaussian mixture models (GMM), are included as regularization terms in the NMF cost function. By using this proposed regularization as a priori information in the enhancement stage, we can exploit the statistical properties of both the clean speech and noise signals. For further improvement of the enhanced speech quality, we also incorporate a masking model of the human auditory system in our approach. Specifically, we construct a weighted Wiener filter (WWF) where the power spectral densities (PSD) of the speech and noise are estimated from the above mentioned NMF algorithm with the proposed regularization. The weighting factor in the WWF is selected based on a masking threshold which is obtained from the estimated PSD of the enhanced speech. Experimental results of perceptual evaluation of speech quality (PESQ), source-to-distortion ratio (SDR) and segmental signal-to-noise ratio (SNR) show that the proposed speech enhancement algorithms (i.e., regularized NMF with and without masking model) provide better performance in speech enhancement than the benchmark algorithms.
ناشر
Database: Elsevier - ScienceDirect (ساینس دایرکت)
Journal: Speech Communication - Volume 87, March 2017, Pages 18-30
نویسندگان
, , ,