کد مقاله کد نشریه سال انتشار مقاله انگلیسی نسخه تمام متن
557733 1451690 2016 10 صفحه PDF دانلود رایگان
عنوان انگلیسی مقاله ISI
Computing scores of voice quality and speech intelligibility in tracheoesophageal speech for speech stimuli of varying lengths
ترجمه فارسی عنوان
محاسبه امتیازات از کیفیت صدا و وضوح گفتار در سخنرانی tracheoesophageal برای محرک های سخنرانی با طول های مختلف
موضوعات مرتبط
مهندسی و علوم پایه مهندسی کامپیوتر پردازش سیگنال
چکیده انگلیسی


• Study investigates speech length and phonetic variety on model performance.
• Introduce strategy comparing models with restricted and non-restricted feature inputs.
• No statistical difference in model performance when feature inputs are restricted.
• Speech material circa 100 syllables allows accurate and stable model evaluation.

In this paper, automatic assessment models are developed for two perceptual variables: speech intelligibility and voice quality. The models are developed and tested on a corpus of Dutch tracheoesophageal (TE) speakers. In this corpus, each speaker read a text passage of approximately 300 syllables and two speech therapists provided consensus scores for the two perceptual variables. Model accuracy and stability are investigated as a function of the amount of speech that is made available for speaker assessment (clinical setting). Five sets of automatically generated acoustic-phonetic speaker features are employed as model inputs. In Part I, models taking complete feature sets as inputs are compared to models taking only the features which are expected to have sufficient support in the speech available for assessment. In Part II, the impact of phonetic content and stimulus length on the computer-generated scores is investigated. Our general finding is that a text encompassing circa 100 syllables is long enough to achieve close to asymptotic accuracy.

ناشر
Database: Elsevier - ScienceDirect (ساینس دایرکت)
Journal: Computer Speech & Language - Volume 37, May 2016, Pages 1–10
نویسندگان
, , , , , ,