کد مقاله کد نشریه سال انتشار مقاله انگلیسی نسخه تمام متن
535248 870334 2007 7 صفحه PDF دانلود رایگان
عنوان انگلیسی مقاله ISI
Segmentation of specific speech signals from multi-dialog environment using SVM and wavelet
موضوعات مرتبط
مهندسی و علوم پایه مهندسی کامپیوتر چشم انداز کامپیوتر و تشخیص الگو
پیش نمایش صفحه اول مقاله
Segmentation of specific speech signals from multi-dialog environment using SVM and wavelet
چکیده انگلیسی

In this paper, a novel multi-speaker segmentation technique is presented. This technique makes use of wavelets and support vector machines (SVMs) to segment specific speakers’ speech signals from multi-dialog environments. The proposed method first applies wavelets to determine the acoustical features such as subband power and pitch information from a given multi-dialog speech data. Then the multi-speaker segmentation of the given multi-dialog speech data can be accomplished by the use of a bottom–up SVM over these acoustical features and additional parameters, such as frequency cepstral coefficients. A public audio database, Aurora-2, is used to evaluate the performances of the proposed method. Experimental results show that the accuracy of multi-speaker segmentation is 100% achieved in the combination of two speakers. And the segmental accuracy can achieve at least 94.12% and 85.93% for 4-speaker and 8-speaker conditions, respectively.

ناشر
Database: Elsevier - ScienceDirect (ساینس دایرکت)
Journal: Pattern Recognition Letters - Volume 28, Issue 11, 1 August 2007, Pages 1307–1313
نویسندگان
, , ,