کد مقاله کد نشریه سال انتشار مقاله انگلیسی نسخه تمام متن
559059 875043 2012 19 صفحه PDF دانلود رایگان
عنوان انگلیسی مقاله ISI
Joint reranking of parsing and word recognition with automatic segmentation
موضوعات مرتبط
مهندسی و علوم پایه مهندسی کامپیوتر پردازش سیگنال
پیش نمایش صفحه اول مقاله
Joint reranking of parsing and word recognition with automatic segmentation
چکیده انگلیسی

Evaluation and optimization of automatic speech recognition (ASR) and parsing systems are often done separately. In the context of spoken language processing, however, these problems may be explored jointly via a reranking architecture. In this work, the effects of reranking for word error rate (WER) or reranking for the Sparseval parse-quality measure are examined in conversational speech recognition, while considering the impact of automatic segmentation. Under a WER criterion, the results indicate that the parse language model alone provides little benefit over a large n-gram model, but adding non-local syntactic features leads to improved performance. Under a Sparseval criterion, it is shown that including alternative word-sequence hypotheses has a much greater impact on parse accuracy than including alternate parse hypotheses. In both cases, the biggest performance improvements are obtained with high quality sentence segmentations. Qualitative analyses show that parse features help recover pronouns and improve recognition of main verbs.


► A new framework for joint reranking of parsing and word recognition incorporating automatic sentence segmentation.
► Confirms prior findings: segmentation quality has impact on parsing (Sparseval) quality; parsing language model has word error rate (WER) impact.
► Non-local syntactic features in reranking yield WER improvements vs. parsing LM alone; benefits larger when optimizing for WER (vs. parsing).
► More alternate word-hypotheses in reranking have a larger impact on parse accuracy than more parse hypotheses, and.
► Significant gains in parse accuracy are obtained through joint optimization.

ناشر
Database: Elsevier - ScienceDirect (ساینس دایرکت)
Journal: Computer Speech & Language - Volume 26, Issue 1, January 2012, Pages 1–19
نویسندگان
, ,