A fast maximum likelihood nonlinear feature transformation method for GMM

کد مقاله	کد نشریه	سال انتشار	مقاله انگلیسی	نسخه تمام متن
408139	678250	2014	8 صفحه PDF	دانلود رایگان

عنوان انگلیسی مقاله ISI

A fast maximum likelihood nonlinear feature transformation method for GMM–HMM speaker adaptation

دانلود مقاله + سفارش ترجمه

دانلود مقاله ISI انگلیسی

رایگان برای ایرانیان

کلمات کلیدی

Speech recognition - تشخیص گفتار Maximum likelihood - حداکثر احتمال Extreme learning machine - دستگاه یادگیری شدید Speaker adaptation - سازگاری بلندگو Neural networks - شبکه های عصبی Hidden Markov models - مدل پنهان مارکوف

موضوعات مرتبط

مهندسی و علوم پایه مهندسی کامپیوتر هوش مصنوعی

پیش نمایش صفحه اول مقاله

A fast maximum likelihood nonlinear feature transformation method for GMM–HMM speaker adaptation

چکیده انگلیسی

We describe a novel maximum likelihood nonlinear feature bias compensation method for Gaussian mixture model–hidden Markov model (GMM–HMM) adaptation. Our approach exploits a single-hidden-layer neural network (SHLNN) that, similar to the extreme learning machine (ELM), uses randomly generated lower-layer weights and linear output units. Different from the conventional ELM, however, our approach optimizes the SHLNN parameters by maximizing the likelihood of observing the features given the speaker-independent GMM–HMM. We derive a novel and efficient learning algorithm for optimizing this criterion. We show, on a large vocabulary speech recognition task, that the proposed approach can cut the word error rate (WER) by 13% over the feature maximum likelihood linear regression (fMLLR) method with bias compensation, and can cut the WER by more than 5% over the fMLLR method with both bias and rotation transformations if applied on top of the fMLLR. Overall, it can reduce the WER by more than 27% over the speaker-independent system with 0.2 real-time adaptation time.

ناشر

Database: Elsevier - ScienceDirect (ساینس دایرکت)
Journal: Neurocomputing - Volume 128, 27 March 2014, Pages 145–152

نویسندگان

Kaisheng Yao, Dong Yu, Li Deng, Yifan Gong,

علوم انسانی و هنر

فنی، مهندسی و علوم پایه

پزشکی و سلامت

بیو تکنولوژی

پذیرش سفارش ترجمه

A fast maximum likelihood nonlinear feature transformation method for GMM–HMM speaker adaptation

دسترسی سریع

ارتباط

English Website