کد مقاله کد نشریه سال انتشار مقاله انگلیسی نسخه تمام متن
558120 1451669 2014 9 صفحه PDF دانلود رایگان
عنوان انگلیسی مقاله ISI
Pathological voice detection and binary classification using MPEG-7 audio features
موضوعات مرتبط
مهندسی و علوم پایه مهندسی کامپیوتر پردازش سیگنال
پیش نمایش صفحه اول مقاله
Pathological voice detection and binary classification using MPEG-7 audio features
چکیده انگلیسی


• MPEG-7 audio descriptors are used for voice pathology detection/classification.
• Fisher ratio is applied to find the most discriminative features.
• Using only top five features the detection accuracy reached over 99%.
• Those five features obtained over 93% accuracy for binary classification.
• Achieved the best detection rate so far in MEEI database (subset).

ObjectivesA pathological voice detection and classification method based on MPEG-7 audio low-level features is proposed in this paper. MPEG-7 features are originally used for multimedia indexing, which includes both video and audio. Indexing is related to event detection, and as pathological voice is a separate event than normal voice, we show that MPEG-7 part-4 audio low-level features can do very well in detecting pathological voices, as well as binary classifying the pathologies.Patients and methodsThe experiments are done on a subset of sustained vowel (“AH”) recordings from healthy and voice pathological subjects, from the Massachusetts Eye and Ear Infirmary (MEEI) database. For classification, support vector machine (SVM) is applied. An optional feature selection method, namely, Fisher discrimination ratio is applied.ResultsThe proposed method with MPEG-7 audio features and SVM classification is evaluated on voice pathology detection, as well as binary pathologies classification. The proposed method is able to achieve an accuracy of 99.994% with a standard deviation of 0.0105% for detecting pathological voices and an accuracy up to 100% for binary pathologies classification.ConclusionMPEG-7 descriptors can reliably be used for automatic voice pathology detection and classification.

ناشر
Database: Elsevier - ScienceDirect (ساینس دایرکت)
Journal: Biomedical Signal Processing and Control - Volume 11, May 2014, Pages 1–9
نویسندگان
, ,