کد مقاله کد نشریه سال انتشار مقاله انگلیسی نسخه تمام متن
565293 1452036 2014 12 صفحه PDF دانلود رایگان
عنوان انگلیسی مقاله ISI
Vocal frequency estimation and voicing state prediction with surface EMG pattern recognition
موضوعات مرتبط
مهندسی و علوم پایه مهندسی کامپیوتر پردازش سیگنال
پیش نمایش صفحه اول مقاله
Vocal frequency estimation and voicing state prediction with surface EMG pattern recognition
چکیده انگلیسی


• We use neck muscle EMG to model vocal fundamental frequency and voicing state.
• We use respiratory trace to model voicing state.
• We predict vocal fundamental frequency with an RMSE of 2.81 semitones.
• We predict voicing state with an accuracy of 78.05% using neck muscle EMG.
• We predict voicing state with an accuracy of 65.24% using respiratory trace.

The majority of laryngectomees use the electrolarynx as their primary mode of verbal communication after total laryngectomy surgery. However, the archetypal electrolarynx suffers from a monotonous tone and the inconvenience of requiring manual control. This paper presents the potential of pattern recognition to support electrolarynx use by predicting fundamental frequency (F0) and voicing state (VS) from surface EMG of the infrahyoid and suprahyoid muscles, as well as from a respiratory trace. In this study, surface EMG signals from the infrahyoid and suprahyoid muscle groups and respiratory trace were collected from 10 able-bodied, adult males (18–60 years old). Participants performed three kinds of vocal tasks – tones, legatos and phrases. Signal features were extracted from the EMG and respiratory trace, and a Support Vector Machine (SVM) classifier with radial basis function kernels was employed to predict F0 and voicing state. An average root mean squared error of 2.81 ± 0.6 semitones was achieved for the estimation of vocal frequency in the range of 90–360 Hz. An average cross-validation (CV) accuracy of 78.05 ± 6.3% was achieved for the prediction of voicing state from EMG and 65.24 ± 7.8% from the respiratory trace. The proposed method has the advantage of being non-invasive compared with studies that relied on intramuscular electrodes (invasive), while still maintaining an accuracy above chance. Pattern classification of neck-muscle surface EMG has merit in the prediction of fundamental frequency and voicing state during vocalization, encouraging further study of automatic pitch modulation for electrolarynges and silent speech interfaces.

ناشر
Database: Elsevier - ScienceDirect (ساینس دایرکت)
Journal: Speech Communication - Volumes 63–64, September–October 2014, Pages 15–26
نویسندگان
, , ,