کد مقاله کد نشریه سال انتشار مقاله انگلیسی نسخه تمام متن
559030 875039 2013 15 صفحه PDF دانلود رایگان
عنوان انگلیسی مقاله ISI
Adjusting dysarthric speech signals to be more intelligible
موضوعات مرتبط
مهندسی و علوم پایه مهندسی کامپیوتر پردازش سیگنال
پیش نمایش صفحه اول مقاله
Adjusting dysarthric speech signals to be more intelligible
چکیده انگلیسی

This paper presents a system that transforms the speech signals of speakers with physical speech disabilities into a more intelligible form that can be more easily understood by listeners. These transformations are based on the correction of pronunciation errors by the removal of repeated sounds, the insertion of deleted sounds, the devoicing of unvoiced phonemes, the adjustment of the tempo of speech by phase vocoding, and the adjustment of the frequency characteristics of speech by anchor-based morphing of the spectrum. These transformations are based on observations of disabled articulation including improper glottal voicing, lessened tongue movement, and lessened energy produced by the lungs. This system is a substantial step towards full automation in speech transformation without the need for expert or clinical intervention.Among human listeners, recognition rates increased up to 191% (from 21.6% to 41.2%) relative to the original speech by using the module that corrects pronunciation errors. Several types of modified dysarthric speech signals are also supplied to a standard automatic speech recognition system. In that study, the proportion of words correctly recognized increased up to 121% (from 72.7% to 87.9%) relative to the original speech, across various parameterizations of the recognizer. This represents a significant advance towards human-to-human assistive communication software and human–computer interaction.


► Changes to dysarthric speech signals modify tempo, frequency, and phoneme sequences.
► Changes dramatically improve intelligibility of dysarthric speech by human listeners.
► Modified acoustics also improve accuracy with automatic speech recognition.

ناشر
Database: Elsevier - ScienceDirect (ساینس دایرکت)
Journal: Computer Speech & Language - Volume 27, Issue 6, September 2013, Pages 1163–1177
نویسندگان
,