Mask estimation and imputation methods for missing data speech recognition in a multisource reverberant environment

کد مقاله	کد نشریه	سال انتشار	مقاله انگلیسی	نسخه تمام متن
558359	874908	2013	22 صفحه PDF	دانلود رایگان

عنوان انگلیسی مقاله ISI

دانلود مقاله + سفارش ترجمه

دانلود مقاله ISI انگلیسی

رایگان برای ایرانیان

کلمات کلیدی

Speech recognition - تشخیص گفتار Missing data - داده های گم شده Binaural - دوطرفه Imputation - محاسبه Noise robust - نویز قوی

موضوعات مرتبط

مهندسی و علوم پایه مهندسی کامپیوتر پردازش سیگنال

پیش نمایش صفحه اول مقاله

Mask estimation and imputation methods for missing data speech recognition in a multisource reverberant environment

چکیده انگلیسی

We present an automatic speech recognition system that uses a missing data approach to compensate for challenging environmental noise containing both additive and convolutive components. The unreliable and noise-corrupted (“missing”) components are identified using a Gaussian mixture model (GMM) classifier based on a diverse range of acoustic features. To perform speech recognition using the partially observed data, the missing components are substituted with clean speech estimates computed using both sparse imputation and cluster-based GMM imputation. Compared to two reference mask estimation techniques based on interaural level and time difference-pairs, the proposed missing data approach significantly improved the keyword accuracy rates in all signal-to-noise ratio conditions when evaluated on the CHiME reverberant multisource environment corpus. Of the imputation methods, cluster-based imputation was found to outperform sparse imputation. The highest keyword accuracy was achieved when the system was trained on imputed data, which made it more robust to possible imputation errors.

► Multifeature based mask estimation classifier is applied to missing data ASR.
► Mask estimation based on multifeature set outperforms binaural reference masks.
► Cluster-based imputation outperforms sparse imputation.
► Feature discrimination power analysis is conducted.

ناشر

Database: Elsevier - ScienceDirect (ساینس دایرکت)
Journal: Computer Speech & Language - Volume 27, Issue 3, May 2013, Pages 798–819

نویسندگان

Sami Keronen, Heikki Kallasjoki, Ulpu Remes, Guy J. Brown, Jort F. Gemmeke, Kalle J. Palomäki,

علوم انسانی و هنر

فنی، مهندسی و علوم پایه

پزشکی و سلامت

بیو تکنولوژی

پذیرش سفارش ترجمه

Mask estimation and imputation methods for missing data speech recognition in a multisource reverberant environment

دسترسی سریع

ارتباط

English Website