دانلود رایگان مقاله: مدل سازی وضوح گفتار با پوشش بهبود از محرک زمانی سازه

کد مقاله	کد نشریه	سال انتشار	مقاله انگلیسی	نسخه تمام متن
566670	1452019	2016	9 صفحه PDF	دانلود رایگان

عنوان انگلیسی مقاله ISI

Modeling speech intelligibility with recovered envelope from temporal fine structure stimulus

ترجمه فارسی عنوان

مدل سازی وضوح گفتار با پوشش بهبود از محرک زمانی سازه

دانلود مقاله + سفارش ترجمه

دانلود مقاله ISI انگلیسی

رایگان برای ایرانیان

کلمات کلیدی

وضوح سخن; سازه زمانی; پوشش بهبود; اندازه گیری عادی کوواریانس

temporal fine structure - ساختار خوب زمانی Speech intelligibility - هوش مصنوعی سخنرانی

موضوعات مرتبط

مهندسی و علوم پایه مهندسی کامپیوتر پردازش سیگنال

پیش نمایش مقاله

مدل سازی وضوح گفتار با پوشش بهبود از محرک زمانی سازه

چکیده انگلیسی

• Most existing STI-based metrics use the temporal envelope information and discard the temporal fine structure cue to predict speech intelligibility.
• The TFS stimulus contains rich intelligibility information, which is attributed to the recovered envelope.
• The recovered envelope predicted the intelligibility as well as the original envelope did.
• The prediction power was significantly improved when these two envelope waveforms were integrated.

Temporal envelope and fine structure are two prominent acoustic cues for speech perception. Most existing speech-transmission-index-based metrics make use of the temporal envelope information and discard the temporal fine structure (TFS) cue to predict speech intelligibility. Recent studies have shown that the TFS stimulus synthesized with multiband TFS waveforms contains rich intelligibility information, which is reflected as the recovered envelope from the TFS stimulus. The present study first assessed the performance of using the recovered envelope from the synthesized TFS stimulus to predict the intelligibility of noise-distorted and noise-suppressed speech. The TFS stimulus was synthesized and fed as an input into the conventional normalized covariance measure (NCM) module. The results showed that the recovered envelope from the TFS stimulus predicted the intelligibility as well as the original envelope extracted from the wideband speech signal did. In addition, an additive intelligibility model was designed to combine the envelope from wideband speech and the recovered envelope from the TFS stimulus to predict speech intelligibility. The prediction power was significantly improved when these two envelope waveforms were integrated. The present study suggests that the recovered envelope from the TFS stimulus may be alternative acoustic information for modeling speech intelligibility and improving the prediction power of the conventional NCM-based intelligibility index.

ناشر

Database: Elsevier - ScienceDirect (ساینس دایرکت)
Journal: Speech Communication - Volume 81, July 2016, Pages 120–128

نویسندگان

Fei Chen, Yu Tsao, Ying-Hui Lai,

علوم انسانی و هنر

فنی، مهندسی و علوم پایه

پزشکی و سلامت

بیو تکنولوژی

پذیرش سفارش ترجمه

دانلود رایگان مقاله ISI : مدل سازی وضوح گفتار با پوشش بهبود از محرک زمانی سازه

دسترسی سریع

ارتباط

English Website