کد مقاله کد نشریه سال انتشار مقاله انگلیسی نسخه تمام متن
567344 876070 2013 16 صفحه PDF دانلود رایگان
عنوان انگلیسی مقاله ISI
Approximate inference: A sampling based modeling technique to capture complex dependencies in a language model
موضوعات مرتبط
مهندسی و علوم پایه مهندسی کامپیوتر پردازش سیگنال
پیش نمایش صفحه اول مقاله
Approximate inference: A sampling based modeling technique to capture complex dependencies in a language model
چکیده انگلیسی

In this paper, we present strategies to incorporate long context information directly during the first pass decoding and also for the second pass lattice re-scoring in speech recognition systems. Long-span language models that capture complex syntactic and/or semantic information are seldom used in the first pass of large vocabulary continuous speech recognition systems due to the prohibitive increase in the size of the sentence-hypotheses search space. Typically, n-gram language models are used in the first pass to produce N-best lists, which are then re-scored using long-span models. Such a pipeline produces biased first pass output, resulting in sub-optimal performance during re-scoring. In this paper we show that computationally tractable variational approximations of the long-span and complex language models are a better choice than the standard n-gram model for the first pass decoding and also for lattice re-scoring.


► We approximate long-span language models (LM) using variational inference technique.
► Tractable surrogate models are then used in first pass ASR decoding.
► We work with recurrent neural network long span LMs.
► First pass and lattice rescoring experiments are carried out.
► Significant perplexity and WER reductions are reported on many speech tasks.

ناشر
Database: Elsevier - ScienceDirect (ساینس دایرکت)
Journal: Speech Communication - Volume 55, Issue 1, January 2013, Pages 162–177
نویسندگان
, , , ,