کد مقاله | کد نشریه | سال انتشار | مقاله انگلیسی | نسخه تمام متن |
---|---|---|---|---|
567084 | 876044 | 2013 | 9 صفحه PDF | دانلود رایگان |

• We propose an overlapped speech detection method in meetings.
• Each speaker wears a lapel microphone.
• Two novel features are utilized as inputs for a GMM-based detector.
• One is speech power after cross-channel spectral subtraction.
• The other is an amplitude spectral cosine correlation coefficient.
We propose an overlapped speech detection method for speech recognition and speaker diarization of meetings, where each speaker wears a lapel microphone. Two novel features are utilized as inputs for a GMM-based detector. One is speech power after cross-channel spectral subtraction which reduces the power from the other speakers. The other is an amplitude spectral cosine correlation coefficient which effectively extracts the correlation of spectral components in a rather quiet condition. We evaluated our method using a meeting speech corpus of four speakers. The accuracy of our proposed method, 75.7%, was significantly better than that of the conventional method, 66.8%, which uses raw speech power and power spectral Pearson’s correlation coefficient.
Journal: Speech Communication - Volume 55, Issue 10, November–December 2013, Pages 941–949