کد مقاله کد نشریه سال انتشار مقاله انگلیسی نسخه تمام متن
10429192 909699 2005 4 صفحه PDF دانلود رایگان
عنوان انگلیسی مقاله ISI
Stream Weight Training Based on MCE for Audio-Visual LVCSR
موضوعات مرتبط
مهندسی و علوم پایه سایر رشته های مهندسی مهندسی (عمومی)
پیش نمایش صفحه اول مقاله
Stream Weight Training Based on MCE for Audio-Visual LVCSR
چکیده انگلیسی
In this paper we address the problem of audio-visual speech recognition in the framework of the multi-stream hidden Markov model. Stream weight training based on minimum classification error criterion is discussed for use in large vocabulary continuous speech recognition (LVCSR). We present the lattice re-scoring and Viterbi approaches for calculating the loss function of continuous speech. The experimental results show that in the case of clean audio, the system performance can be improved by 36.1% in relative word error rate reduction when using state-based stream weights trained by a Viterbi approach, compared to an audio only speech recognition system. Further experimental results demonstrate that our audio-visual LVCSR system provides significant enhancement of robustness in noisy environments.
ناشر
Database: Elsevier - ScienceDirect (ساینس دایرکت)
Journal: Tsinghua Science & Technology - Volume 10, Issue 2, April 2005, Pages 141-144
نویسندگان
, ,