کد مقاله کد نشریه سال انتشار مقاله انگلیسی نسخه تمام متن
558363 874908 2013 21 صفحه PDF دانلود رایگان
عنوان انگلیسی مقاله ISI
Uncertainty-based learning of acoustic models from noisy data
موضوعات مرتبط
مهندسی و علوم پایه مهندسی کامپیوتر پردازش سیگنال
پیش نمایش صفحه اول مقاله
Uncertainty-based learning of acoustic models from noisy data
چکیده انگلیسی

We consider the problem of acoustic modeling of noisy speech data, where the uncertainty over the data is given by a Gaussian distribution. While this uncertainty has been exploited at the decoding stage via uncertainty decoding, its usage at the training stage remains limited to static model adaptation. We introduce a new expectation maximization (EM) based technique, which we call uncertainty training, that allows us to train Gaussian mixture models (GMMs) or hidden Markov models (HMMs) directly from noisy data with dynamic uncertainty. We evaluate the potential of this technique for a GMM-based speaker recognition task on speech data corrupted by real-world domestic background noise, using a state-of-the-art signal enhancement technique and various uncertainty estimation techniques as a front-end. Compared to conventional training, the proposed training algorithm results in 3–4% absolute improvement in speaker recognition accuracy by training from either matched, unmatched or multi-condition noisy data. This algorithm is also applicable with minor modifications to maximum a posteriori (MAP) or maximum likelihood linear regression (MLLR) acoustic model adaptation from noisy data and to other data than audio.


► Train GMM or HMM acoustic models from noisy data with dynamic uncertainty.
► 3–4% absolute improvement in speaker recognition accuracy compared to conventional training on speech data corrupted by domestic background noise.
► Also applicable with minor modifications to MAP or MLLR acoustic model adaptation.

ناشر
Database: Elsevier - ScienceDirect (ساینس دایرکت)
Journal: Computer Speech & Language - Volume 27, Issue 3, May 2013, Pages 874–894
نویسندگان
, , ,