A constrained sequential EM algorithm for speech enhancement

Article ID	Journal	Published Year	Pages	File Type
10326539	Neural Networks	2008	9 Pages	PDF

Abstract

Speech enhancement is a fundamental problem, the goal of which is to estimate clean speech st, given a noise-contaminated signal st+nt, where nt is white or colored noise. This task can be viewed as a probabilistic inference problem which involves estimating the posterior distribution of hidden clean speech, given a noisy observation. Kalman filter is a representative method but is restricted to Gaussian distributions only. We consider the generalized auto-regressive (GAR) model in order to capture the non-Gaussian characteristics of speech. Then we present a constrained sequential EM algorithm where Rao-Blackwellized particle filters (RBPFs) are used in the E-step and model parameters are updated in a sequential manner in the M-step under positivity constraints for noise variance parameters. Numerical experiments confirm the high performance of our proposed method, compared to Kalman filter-based methods, in the task of sequential speech enhancement.

Keywords

Rao-Blackwellized particle filter Expectation Maximization (EM)speech enhancement Kalman filter