A monotonic statistical machine translation approach to speaking style transformation

کد مقاله	کد نشریه	سال انتشار	مقاله انگلیسی	نسخه تمام متن
558437	874929	2012	22 صفحه PDF	دانلود رایگان

عنوان انگلیسی مقاله ISI

دانلود مقاله + سفارش ترجمه

دانلود مقاله ISI انگلیسی

رایگان برای ایرانیان

موضوعات مرتبط

مهندسی و علوم پایه مهندسی کامپیوتر پردازش سیگنال

پیش نمایش صفحه اول مقاله

A monotonic statistical machine translation approach to speaking style transformation

چکیده انگلیسی

This paper presents a method for automatically transforming faithful transcripts or ASR results into clean transcripts for human consumption using a framework we label speaking style transformation (SST). We perform a detailed analysis of the types of corrections performed by human stenographers when creating clean transcripts, and propose a model that is able to handle the majority of the most common corrections. In particular, the proposed model uses a framework of monotonic statistical machine translation to perform not only the deletion of disfluencies and insertion of punctuation, but also correction of colloquial expressions, insertions of omitted words, and other transformations. We provide a detailed description of the model implementation in the weighted finite state transducer (WFST) framework. An evaluation of the proposed model on both faithful transcripts and speech recognition results of parliamentary and lecture speech demonstrates the effectiveness of the proposed model in performing the wide variety of corrections necessary for creating clean transcripts.

► We present a method for transforming faithful/ASR transcripts to clean transcripts.
► This method is called “speaking style transformation.”
► We perform an analysis of the corrections performed by human stenographers.
► Based on this, we propose a model that is able to handle the most common corrections.
► On parliamentary speech, the system is accurate across many types of transformations.

ناشر

Database: Elsevier - ScienceDirect (ساینس دایرکت)
Journal: Computer Speech & Language - Volume 26, Issue 5, October 2012, Pages 349–370

نویسندگان

Graham Neubig, Yuya Akita, Shinsuke Mori, Tatsuya Kawahara,

علوم انسانی و هنر

فنی، مهندسی و علوم پایه

پزشکی و سلامت

بیو تکنولوژی

پذیرش سفارش ترجمه

A monotonic statistical machine translation approach to speaking style transformation

دسترسی سریع

ارتباط

English Website