Stream Weight Training Based on MCE for Audio-Visual LVCSR

کد مقاله	کد نشریه	سال انتشار	مقاله انگلیسی	نسخه تمام متن
10429192	909699	2005	4 صفحه PDF	دانلود رایگان

عنوان انگلیسی مقاله ISI

دانلود مقاله + سفارش ترجمه

دانلود مقاله ISI انگلیسی

رایگان برای ایرانیان

کلمات کلیدی

Discriminative training - آموزش تبعیض آمیز

موضوعات مرتبط

مهندسی و علوم پایه سایر رشته های مهندسی مهندسی (عمومی)

پیش نمایش صفحه اول مقاله

Stream Weight Training Based on MCE for Audio-Visual LVCSR

چکیده انگلیسی

In this paper we address the problem of audio-visual speech recognition in the framework of the multi-stream hidden Markov model. Stream weight training based on minimum classification error criterion is discussed for use in large vocabulary continuous speech recognition (LVCSR). We present the lattice re-scoring and Viterbi approaches for calculating the loss function of continuous speech. The experimental results show that in the case of clean audio, the system performance can be improved by 36.1% in relative word error rate reduction when using state-based stream weights trained by a Viterbi approach, compared to an audio only speech recognition system. Further experimental results demonstrate that our audio-visual LVCSR system provides significant enhancement of robustness in noisy environments.

ناشر

Database: Elsevier - ScienceDirect (ساینس دایرکت)
Journal: Tsinghua Science & Technology - Volume 10, Issue 2, April 2005, Pages 141-144

نویسندگان

Liu (å é¹), Wang (çä½è±),

علوم انسانی و هنر

فنی، مهندسی و علوم پایه

پزشکی و سلامت

بیو تکنولوژی

پذیرش سفارش ترجمه

Stream Weight Training Based on MCE for Audio-Visual LVCSR

دسترسی سریع

ارتباط

English Website