Boosting systems for large vocabulary continuous speech recognition

کد مقاله	کد نشریه	سال انتشار	مقاله انگلیسی	نسخه تمام متن
567497	876090	2012	7 صفحه PDF	دانلود رایگان

عنوان انگلیسی مقاله ISI

دانلود مقاله + سفارش ترجمه

دانلود مقاله ISI انگلیسی

رایگان برای ایرانیان

کلمات کلیدی

Speech recognition - تشخیص گفتار Boosting - تقویت Acoustic modeling - مدل سازی آکوستیک

موضوعات مرتبط

مهندسی و علوم پایه مهندسی کامپیوتر پردازش سیگنال

پیش نمایش صفحه اول مقاله

Boosting systems for large vocabulary continuous speech recognition

چکیده انگلیسی

We employ a variant of the popular Adaboost algorithm to train multiple acoustic models such that the aggregate system exhibits improved performance over the individual recognizers. Each model is trained sequentially on re-weighted versions of the training data. At each iteration, the weights are decreased for the frames that are correctly decoded by the current system. These weights are then multiplied with the frame-level statistics for the decision trees and Gaussian mixture components of the next iteration system. The composite system uses a log-linear combination of HMM state observation likelihoods. We report experimental results on several broadcast news transcription setups which differ in the language being spoken (English and Arabic) and amounts of training data. Additionally, we study the impact of boosting on maximum likelihood (ML) and discriminatively trained acoustic models. Our findings suggest that significant gains can be obtained for small amounts of training data even after feature and model-space discriminative training.

► We apply the Adaboost algorithm to large vocabulary continuous speech recognition.
► Acoustic models are trained sequentially on re-weighted data.
► Phonetic decision trees are also included in the boosting procedure.
► We study the impact of boosting for ML and discriminatively trained models.
► Accuracy gains on English and Arabic broadcast news transcription are obtained.

ناشر

Database: Elsevier - ScienceDirect (ساینس دایرکت)
Journal: Speech Communication - Volume 54, Issue 2, February 2012, Pages 212–218

نویسندگان

George Saon, Hagen Soltau,

علوم انسانی و هنر

فنی، مهندسی و علوم پایه

پزشکی و سلامت

بیو تکنولوژی

پذیرش سفارش ترجمه

Boosting systems for large vocabulary continuous speech recognition

دسترسی سریع

ارتباط

English Website