Incorporating local information of the acoustic environments to MAP-based feature compensation and acoustic model adaptation

کد مقاله	کد نشریه	سال انتشار	مقاله انگلیسی	نسخه تمام متن
10368503	874801	2014	18 صفحه PDF	دانلود رایگان

عنوان انگلیسی مقاله ISI

دانلود مقاله + سفارش ترجمه

دانلود مقاله ISI انگلیسی

رایگان برای ایرانیان

کلمات کلیدی

Noise robustness - استحکام سر و صدا Local information - اطلاعات محلی Feature compensation - جبران مشخصات Acoustic model adaptation - سازگاری مدل آکوستیک map - نقشه

موضوعات مرتبط

مهندسی و علوم پایه مهندسی کامپیوتر پردازش سیگنال

پیش نمایش صفحه اول مقاله

Incorporating local information of the acoustic environments to MAP-based feature compensation and acoustic model adaptation

چکیده انگلیسی

The maximum a posteriori (MAP) criterion is popularly used for feature compensation (FC) and acoustic model adaptation (MA) to reduce the mismatch between training and testing data sets. MAP-based FC and MA require prior densities of mapping function parameters, and designing suitable prior densities plays an important role in obtaining satisfactory performance. In this paper, we propose to use an environment structuring framework to provide suitable prior densities for facilitating MAP-based FC and MA for robust speech recognition. The framework is constructed in a two-stage hierarchical tree structure using environment clustering and partitioning processes. The constructed framework is highly capable of characterizing local information about complex speaker and speaking acoustic conditions. The local information is utilized to specify hyper-parameters in prior densities, which are then used in MAP-based FC and MA to handle the mismatch issue. We evaluated the proposed framework on Aurora-2, a connected digit recognition task, and Aurora-4, a large vocabulary continuous speech recognition (LVCSR) task. On both tasks, experimental results showed that with the prepared environment structuring framework, we could obtain suitable prior densities for enhancing the performance of MAP-based FC and MA.

ناشر

Database: Elsevier - ScienceDirect (ساینس دایرکت)
Journal: Computer Speech & Language - Volume 28, Issue 3, May 2014, Pages 709-726

نویسندگان

Yu Tsao, Xugang Lu, Paul Dixon, Ting-yao Hu, Shigeki Matsuda, Chiori Hori,

علوم انسانی و هنر

فنی، مهندسی و علوم پایه

پزشکی و سلامت

بیو تکنولوژی

پذیرش سفارش ترجمه

Incorporating local information of the acoustic environments to MAP-based feature compensation and acoustic model adaptation

دسترسی سریع

ارتباط

English Website