کد مقاله | کد نشریه | سال انتشار | مقاله انگلیسی | نسخه تمام متن |
---|---|---|---|---|
447401 | 1443137 | 2016 | 8 صفحه PDF | دانلود رایگان |
The performance of a non-tonal speech recognition system degrades when confronted with the task of recognizing tonal words. Several speech recognition applications require tonal word recognition. Therefore, this paper considers how to create a suitable tone model for a tonal syllable recognition system serving application devices based on a one-stream scheme. The fundamental frequency contour (F0 contour) approximation model is proposed here to estimate F0 continuity contours for all of a tonal word. The processes of approximation include voice detection, F0 smoothing, F0 forecasting, and F0 normalization. To model the F0 contours of unvoiced regions belonging to F0 forecasting, a linear regression function is used to create an approximate F0 contour. Experimental results indicate that the proposed model improves the accuracy of tonal word recognition by 8.6% and 12.2%, respectively, compared with conventional random and exponential approaches.
Journal: AEU - International Journal of Electronics and Communications - Volume 70, Issue 5, May 2016, Pages 681–688