Singing speaker clustering based on subspace learning in the GMM mean supervector space

Article ID	Journal	Published Year	Pages	File Type
10370352	Speech Communication	2013	14 Pages	PDF

Abstract

âº Mixed style speech causes problems when training acoustic models for speech applications, such as speaker ID and ASR. âº This study is a first attempt for speaker clustering under mixed speaking styles which include reading and singing. âº Two types of subspace learning strategies in the GMM mean supervector space are studied: unsupervised and supervised. âº Advanced clustering algorithms are evaluated on a database that includes reading and singing the lyrics for each speaker. âº LPP subspace learning and a proposed cluster refining based on PLDA significantly improves clustering accuracies.

Keywords

Singing Speaker clustering Speaking styles Subspace learning