Unsupervised speaker segmentation with residual phase and MFCC features

Article ID	Journal	Published Year	Pages	File Type
388799	Expert Systems with Applications	2009	6 Pages	PDF

Abstract

This paper proposes an unsupervised method for improving the automatic speaker segmentation performance by combining the evidence from residual phase (RP) and mel frequency cepstral coefficients (MFCC). This method demonstrates the complementary nature of speaker specific information present in the residual phase in comparison with the information present in the conventional MFCC. Moreover this method presents an unsupervised speaker segmentation algorithm based on support vector machine (SVM). The experiments show that the combination of residual phase and MFCC helps to identify more robustly the transitions among speakers.

Keywords

Mel frequency cepstral coefficients Speaker segmentation Support vector machine