کد مقاله کد نشریه سال انتشار مقاله انگلیسی نسخه تمام متن
565284 1452035 2014 11 صفحه PDF دانلود رایگان
عنوان انگلیسی مقاله ISI
An improved speech transmission index for intelligibility prediction
ترجمه فارسی عنوان
شاخص بهبود گفتار برای پیش بینی قابلیت اطمینان
کلمات کلیدی
شاخص انتقال سخنرانی، تابع انتقال مدولاسیون، تقویت گفتار، ارزیابی هدف، هوش مصنوعی سخنرانی، طیف مدولاسیون کوتاه مدت
موضوعات مرتبط
مهندسی و علوم پایه مهندسی کامپیوتر پردازش سیگنال
چکیده انگلیسی


• Objective measure of intelligibility based on speech intelligibility index.
• Does not assume stationarity of modulation signal over entire utterance.
• Processes modulation envelope in shorttime segments.
• Use of shorter durations significantly improves performance compared to STI.
• Improved correlation to subjective intelligibility for stimuli processed by enhancement methods.

The speech transmission index (STI) is a well known measure of intelligibility, most suited to the evaluation of speech intelligibility in rooms, with stimuli subjected to additive noise and reverberance. However, STI and its many variations do not effectively represent the intelligibility of stimuli containing non-linear distortions such as those resulting from processing by enhancement algorithms. In this paper, we revisit the STI approach and propose a variation which processes the modulation envelope in short-time segments, requiring only an assumption of quasi-stationarity (rather than the stationarity assumption of STI) of the modulation signal. Results presented in this work show that the proposed approach improves the measures correlation to subjective intelligibility scores compared to traditional STI for a range of noise types and subjected to different enhancement approaches. The approach is also shown to have higher correlation than other coherence, correlation and distance measures tested, but is unsuited to the evaluation of stimuli heavily distorted with (for example) masking based processing, where an alternative approach such as STOI is recommended.

ناشر
Database: Elsevier - ScienceDirect (ساینس دایرکت)
Journal: Speech Communication - Volume 65, November–December 2014, Pages 9–19
نویسندگان
, ,