کد مقاله | کد نشریه | سال انتشار | مقاله انگلیسی | نسخه تمام متن |
---|---|---|---|---|
4973482 | 1451641 | 2018 | 7 صفحه PDF | دانلود رایگان |
- First work to automatically detect glottal stop in cleft palate speech.
- Applying an extensive size of cleft palate speech database.
- An automatic initial consonants and finals segmentation method is proposed.
- Four acoustic features are applied to identify glottal stop
The speech therapy is essential during the whole treatment of Cleft Palate (CP) over many years. The automatic evaluation of CP speech could provide an effective assistant diagnosis. The glottal stop is a typical compensatory articulation error in CP speech. It is produced by adducting the vocal folds as a substitution during the pronunciation of oral pressure consonants. The existence of glottal stop has a great impact on speech intelligibility. In this work, an automatic glottal stop detection system is proposed. The CP speech database is collected by the Hospital of Stomatology, Sichuan University, which has the largest number of CP patients in China. This database includes extensive CP speech samples annotated by speech-language pathologists at the phoneme level. The vocabulary of this database includes all the initial consonants in Mandarin. The automatic initials and finals segmentation method is proposed firstly. Then, for the pressure initial consonants, four types of acoustic features are extracted: Mel-frequency cepstral coefficients, formants, Gammatone filtering energy feature, wavelet packet energy and Shannon entropy features. The features are processed using a k-nearest neighbor classifier. The k-fold cross validation is used to calculate the average detection accuracy, which reaches up to 93.5% using Gammatone filtering energy feature.
Journal: Biomedical Signal Processing and Control - Volume 39, January 2018, Pages 230-236