Prosodic and temporal features for language modeling for dialog

کد مقاله	کد نشریه	سال انتشار	مقاله انگلیسی	نسخه تمام متن
567493	876090	2012	14 صفحه PDF	دانلود رایگان

عنوان انگلیسی مقاله ISI

دانلود مقاله + سفارش ترجمه

دانلود مقاله ISI انگلیسی

رایگان برای ایرانیان

کلمات کلیدی

Speech recognition - تشخیص گفتار Perplexity - ناراحتی Prosody - پرونده Prediction - پیش بینی

موضوعات مرتبط

مهندسی و علوم پایه مهندسی کامپیوتر پردازش سیگنال

پیش نمایش صفحه اول مقاله

Prosodic and temporal features for language modeling for dialog

چکیده انگلیسی

If we can model the cognitive and communicative processes underlying speech, we should be able to better predict what a speaker will do. With this idea as inspiration, we examine a number of prosodic and timing features as potential sources of information on what words the speaker is likely to say next. In spontaneous dialog we find that word probabilities do vary with such features. Using perplexity as the metric, the most informative of these included recent speaking rate, volume, and pitch, and time until end of utterance. Using simple combinations of such features to augment trigram language models gave up to a 8.4% perplexity benefit on the Switchboard corpus, and up to a 1.0% relative reduction in word error rate (0.3% absolute) on the Verbmobil II corpus.

► Speakers’ underlying cognitive processes and states may be revealed by prosody.
► Features of the local prosodic context can help predict what words are likely next.
► Speaking rate, volume, pitch and time-until-utterance-end features were informative.
► A 8.4% perplexity reduction on the Switchboard corpus was obtained.
► In a recognizer, this gave up to a 1.0% relative reduction in word error rate.

ناشر

Database: Elsevier - ScienceDirect (ساینس دایرکت)
Journal: Speech Communication - Volume 54, Issue 2, February 2012, Pages 161–174

نویسندگان

Nigel G. Ward, Alejandro Vega, Timo Baumann,

علوم انسانی و هنر

فنی، مهندسی و علوم پایه

پزشکی و سلامت

بیو تکنولوژی

پذیرش سفارش ترجمه

Prosodic and temporal features for language modeling for dialog

دسترسی سریع

ارتباط

English Website